This folder contains citation collection configuration and outputs for DANDI dandisets using citations-collector.
- dandi-full.yaml: Collection configuration using BibTeX source (
../dandi.bib) - dandi-full-citations.tsv: Discovered citations (auto-generated)
- Makefile: Pipeline automation for discovery → merge → PDF fetching
- nwb-data-reuse.tsv: Manual curation from NWB Data Reuse Notion table
- comparison-report.md: Comparison between manual and auto-discovered citations
Run the full citation collection pipeline:
make allThis will:
- Discover citations from all configured sources (CrossRef, OpenCitations, DataCite, OpenAlex)
- Detect and merge preprints with their published versions
- Fetch open-access PDFs into
pdfs/directory
make discover # Step 1: Discover citations only
make merge # Step 2: Detect/merge preprints only
make pdfs # Step 3: Fetch PDFs only
make status # Show current statistics
make clean # Remove generated filesFor reproducible execution tracking:
datalad run -m "Update DANDI citations" --output dandi-full-citations.tsv --output pdfs/ make allThe collection now uses a BibTeX source pointing to ../dandi.bib, which is maintained by the dandi-citations tool. This allows:
- External maintenance of dandiset metadata in BibTeX format
- Automatic extraction of item IDs and version numbers from DOIs
- Integration with existing bibliography management workflows
The BibTeX file contains all DANDI dandisets with versioned DOIs. The citation collector:
- Parses
dandi.bibusing regex pattern to extractitem_idandflavor_idfrom DOIs - Discovers citations for each dandiset version from multiple sources
- Deduplicates and merges preprints with published versions
- Fetches open-access PDFs where available
For interactive exploration of the TSV file, we recommend visidata:
visidata dandi-full-citations.tsv
# Or directly from GitHub:
visidata https://raw.githubusercontent.com/dandi/dandi-bib/refs/heads/master/citations/dandi-full-citations.tsvEdit dandi-full.yaml to customize:
- Citation sources to query
- PDF output directory
- Email for API polite pools
- Regex pattern for BibTeX parsing
See citations-collector documentation for full configuration options.