FAIRy

BIOFAIR Open Mic 2025

BIOFAIR Open Mic • October 30, 2025

Abstract

An overview of FAIRy's approach to local-first dataset pre-checking and how it supports the BIOFAIR Data Network roadmap, specifically contributing to the Stocktaking & Gap Analysis initiative. This talk covers the tool's capabilities, the rulepack system for domain-specific validation, and how it helps researchers prepare data before submission to repositories like GEO, ENA, and Zenodo.

Full presentation slides and event recording available on BCoN website →

Key takeaways

  • FAIRy contributes to the Stocktaking & Gap Analysis slice of the BIOFAIR Data Network roadmap by assessing dataset readiness before it reaches a repository
  • A local-first validator and packager that compares what's in a submission to what's required, flagging gaps with pass / warn / fail
  • Generates shareable attestations that can be used to demonstrate readiness without exposing raw data
  • Open source and extensible: the first rulepack targets ENA, with the ability for teams to author their own repository-specific rulepacks to fit local policies
  • Helps reviewers and submitters identify the top 5–7 critical checks needed for successful repository submissions

Usage snippet

Example commands for running FAIRy pre-flight checks, generating readiness reports, and exporting validated data packages.

# Run FAIRy pre-flight check
fairy check --rulepack geo-seq-bulk metadata.tsv samples.tsv

# Generate readiness report
fairy report --output readiness-report.md

# Export validated data package
fairy export --format zenodo

Interested in an ENA rulepack pilot?

Email hello@datadabra.com