fair_drones

FAIR² Drone Dataset Card: Quick-Start Guide

For researchers creating or documenting drone-based wildlife datasets

Below is a streamlined checklist for creating a WildFAIRx compliant drone dataset card. Review the full dataset card template for detailed guidance.

Before You Start


Choose Your Template

Select the appropriate template based on your primary use case:


YAML Front Matter


Dataset Overview


Data Structure

!!! warning “Consistent Naming Conventions”

Ensure file naming is consistent across videos, telemetry files, and occurrence data:
- Use the same date format throughout (e.g., `YYYY_MM_DD` or `DD_MM_YY`)
- Match session/flight identifiers exactly between CSV files and data folders
- Document any naming changes (e.g., `session_1` → `flight_1`) to maintain data linkages
- Test that scripts can locate files using your naming pattern

Darwin Core Metadata

Event Records

For each survey location/date, document:

!!! tip “Extracting GPS Data from Telemetry”

If your videos have embedded telemetry (SRT files, EXIF data, or flight logs), you can extract GPS coordinates programmatically:
- Use `exiftool` for EXIF GPS data from images
- Parse DJI `.SRT` files for frame-level GPS coordinates
- Extract launch points, min/max bounds, and altitude ranges
- Aggregate video-level data to session/mission-level events
- See [KABR scripts](https://github.com/Imageomics/kabr-behavior-telemetry/tree/main/scripts) for Python examples

Occurrence Records


Platform Specifications


Mission Parameters


Annotations

Format and Coverage

Quality Metrics

Annotation Difficulty


Dataset Statistics


Limitations and Considerations


Licensing


Citation


Acknowledgements


Optional Sections


Validation Checklist

WildFAIRx Compliance

Darwin Core Compliance

Practical Completeness


Common Mistakes to Avoid

Vague Descriptions ❌ “We used a drone to collect wildlife images” ✓ “DJI Matrice 300 RTK with Zenmuse H20T camera flew grid patterns at 60m AGL”

Missing Geographic Precision ❌ “Collected in Tanzania” ✓ “Serengeti National Park, Mara Region (-2.3456, 34.8123 ±5m)”

Unclear Sampling Effort ❌ “Multiple flights” ✓ “45 missions totaling 30 flight hours, covering 2,500 hectares”

Incomplete Species Names ❌ “elephants, zebras, giraffes” ✓ “Loxodonta africana, Equus quagga, Giraffa camelopardalis”

Undocumented Splits ❌ “Split into train/val/test” ✓ “Stratified by location and season: missions 1-300 (train), 301-350 (val), 351-400 (test)”

Hidden Biases ❌ “Representative wildlife dataset” ✓ “Dry season only; large-bodied species overrepresented; morning flights bias against nocturnal species”


Time-Saving Tips

Before Starting:

While Completing:

Automation Opportunities:

For Missing Information:


Resources

Validation Tools

External Resources


Getting Help

!!! question “Questions, Comments, or Concerns?

For assistance:


Ready to start?

  1. Download the appropriate template for your task
  2. Gather your information using the checklist above
  3. Set aside 2-3 hours to complete the card
  4. Follow the template section by section
  5. Validate your completed card
  6. Publish your WildFAIRx compliant dataset!