Why RO-Crate

Logo and words RO-Crate

RO-Crate is a community-driven, lightweight approach to packaging research data with its metadata, ensuring that data collections are retained in context and remain useful for years to come.

Traditional data archiving often results in "decontextualized" data—fragmented files stored in repositories where the vital context (who created it, what tools were used, how it should be cited) is lost, recorded in a disconnected document, or buried file system. This makes reproduction and reuse of research difficult, contributing to a "reproducibility crisis".

RO-Crate addresses these challenges by bundling data with machine-readable and human-readable metadata using standard web technologies. This ensures that metadata is not just a sidecar file but is kept with the data. When a user downloads a "crate" (a single ZIP file), they receive the data alongside its full context, ensuring it remains meaningful even years later.

Ensuring FAIR Data Principles

    RO-Crate is a practical implementation of the FAIR principles, making research:
  • Findable: Crates use persistent identifiers (PIDs) like DOIs and ORCIDs, and their linked metadata makes them easily discoverable by both humans and search engines.
  • Accessible: Standardized web protocols (JSON-LD, Schema.org) ensure that metadata can be retrieved automatically.
  • Interoperable: By using widely used web standards, RO-Crates can be used across different platforms and disciplines without requiring specialized software.
  • Reusable: Detailed provenance information—including the researchers involved, the steps taken, and the software used—is captured within the crate to facilitate reproduction.

Lightweight and Extensible

    Unlike complex, "heavyweight" metadata standards, RO-Crate is designed to be developer-friendly and easy to adopt.
  • JSON-LD Based: Uses familiar web technologies that integrate easily with modern web applications like CrateSpace.
  • Domain Extensible: While providing a core set of general-purpose structures, it can be extended with "profiles" to meet the specific needs of different research communities (e.g., bioinformatics, digital humanities).
  • Standardised Packaging: Crates can be distributed as simple ZIP files, making them compatible with almost any repository or storage system.

By adopting RO-Crate, CrateSpace provides more than just storage; it provides a FAIR-compliant research data management solution. It ensures that the valuable outputs of research are preserved, cited, and reused with their full integrity intact, fulfilling the requirements for modern, high-quality data stewardship.