DataLad extension for semantic metadata handling

The extension equips DataLad with some extra commands that enable alternative workflows.

What is in it for users?

What is in it for developers?


High-level API commands

meta_extract([dataset, path, sources, …]) Run one or more metadata extractors on a dataset or file.
meta_aggregate([path, dataset, recursive, …]) Aggregate metadata of one or more (sub)datasets for later reporting.
meta_report([path, dataset, reporton, recursive]) Query a dataset’s aggregated metadata for dataset and file metadata

Metadata extractors


DataLad development is being performed as part of a US-German collaboration in computational neuroscience (CRCNS) project “DataGit: converging catalogues, warehouses, and deployment logistics into a federated ‘data distribution’” (Halchenko/Hanke), co-funded by the US National Science Foundation (NSF 1429999) and the German Federal Ministry of Education and Research (BMBF 01GQ1411). Additional support is provided by the German federal state of Saxony-Anhalt and the European Regional Development Fund (ERDF), Project: Center for Behavioral Brain Sciences, Imaging Platform

DataLad is built atop the git-annex software that is being developed and maintained by Joey Hess.

Indices and tables