|
| 1 | +# Digital Object Identifiers |
| 2 | + |
| 3 | +A Digital Object Identifier (DOI) is a code used to uniquely |
| 4 | +identify content of various types. DOIs enable easy online |
| 5 | +access to research data for discovery, attribution, and reuse, |
| 6 | +and enable accurate data citation and other metrics. DOIs are |
| 7 | +a persistent identifier, and as such carry expectations of |
| 8 | +curation, persistent access, and rich metadata. |
| 9 | + |
| 10 | +There is a system and practices associated with DOI usage, |
| 11 | +for "persistent and actionable identification and interoperable |
| 12 | +exchange of managed information on digital networks" |
| 13 | +(https://support.datacite.org/docs/doi-basics). |
| 14 | + |
| 15 | +DOIs are intended to be "resolvable," usually to information |
| 16 | +about the object to which the DOI refers—including information |
| 17 | +about where the object can be found. For a dataset, that would |
| 18 | +be a dataset landing page providing information about the |
| 19 | +dataset like where it can be accessed. The DOI should not |
| 20 | +point to the dataset itself. The DOI remains fixed over the |
| 21 | +lifetime of the object, whereas its location and metadata may |
| 22 | +change. When the location changes, the publisher of the |
| 23 | +object is responsible for updating the metadata for the DOI |
| 24 | +to the new locations. |
| 25 | + |
| 26 | +The developer and administrator of the DOI system is the |
| 27 | +International DOI Foundation (IDF) which introduced DOIs |
| 28 | +in 2000. Organizations that meet the contractual obligations |
| 29 | +of the DOI system and that are willing to pay to become a |
| 30 | +member (such as DataCite, see below) can assign DOIs. |
| 31 | + |
| 32 | +The DOI system is implemented through a federation of |
| 33 | +registration agencies coordinated by the IDF. |
| 34 | +See https://www.doi.org/, and particularly |
| 35 | +https://www.doi.org/hb.html, the DOI Handbook, for details. |
| 36 | + |
| 37 | +## Purpose of DOIs |
| 38 | + |
| 39 | +Funding agencies and publishers increasingly recognize that |
| 40 | +datasets and scientific software are valuable research outputs |
| 41 | +that should be openly available, identifiable, and citable—often |
| 42 | +through DOIs. |
| 43 | + |
| 44 | +At LASP, digital objects worthy of identification include |
| 45 | +datasets and associated outputs (e.g., documentation, papers, |
| 46 | +workflows, algorithms, software, etc.). |
| 47 | + |
| 48 | +## DOI registries |
| 49 | + |
| 50 | +To enable accessibility, a DOI needs to reside in a registry |
| 51 | +where it can be resolved. The registry collects and provides |
| 52 | +high level information, assigns DOIs, and links to references. |
| 53 | + |
| 54 | +[DataCite](https://datacite.org/) is a not-for-profit, global |
| 55 | +initiative to "help the research community locate, identify, |
| 56 | +and cite research data with confidence," through DOI minting |
| 57 | +and registration. It is the leading global provider of DOIs |
| 58 | +for datasets. From their website: |
| 59 | + |
| 60 | +>By working closely with data centres to assign DOIs to |
| 61 | +> datasets and other research objects, we are developing a |
| 62 | +> robust infrastructure that supports simple and effective |
| 63 | +> methods of data citation, discovery, and access. Citable |
| 64 | +> data become legitimate contributions to scholarly |
| 65 | +> communication, paving the way for new metrics and |
| 66 | +> publication models that recognize and reward data sharing. |
| 67 | +
|
| 68 | +CU Libraries are now a member of DataCite. Through this |
| 69 | +membership, LASP can mint and register DOIs for datasets |
| 70 | +housed in our repositories, enabling data to be persistently |
| 71 | +identified, accessed, and cited. |
| 72 | + |
| 73 | +[Crossref](https://www.crossref.org/) is another registry that |
| 74 | +is often mentioned in Earth and space science contexts. It's |
| 75 | +a not-for-profit association of ~2000 voting member publishers |
| 76 | +who represent 4300 societies and publishers. It exists to |
| 77 | +facilitate the links between distributed content hosted at |
| 78 | +other sites, and uses DOIs to do so. |
| 79 | + |
| 80 | +[Zenodo](https://zenodo.org/) is a free repository developed |
| 81 | +by CERN and operated by OpenAIRE. It is a general-purpose |
| 82 | +repository that allows researchers to deposit datasets, |
| 83 | +research software, reports, and any other research-related |
| 84 | +digital artifacts. Zenodo assigns DOIs to the deposited |
| 85 | +content, making it citable and discoverable. |
| 86 | +See [citing software](../workflows/open_source/citing_software.md) |
| 87 | +for more on using Zenodo to cite software. |
| 88 | + |
| 89 | +[ORCiDs](https://orcid.org/) are like DOIs but provide |
| 90 | +persistent digital object identifiers for people. |
| 91 | + |
| 92 | +## DOI Format |
| 93 | + |
| 94 | +When a LASP researcher needs a DOI, they will provide some information and receive a DOI back. |
| 95 | +They will never actually create a DOI. Nevertheless, it is worth understanding the form of a DOI |
| 96 | +and the goals behind its format. |
| 97 | + |
| 98 | +DataCite goals for DOIs include enabling robots and crawlers to recognize DataCite DOIs as URLs, |
| 99 | +making them easy to cut and paste, and helping users recognize that DOIs are both a persistent link |
| 100 | +and a persistent identifier. |
| 101 | + |
| 102 | +This is a DOI: |
| 103 | + |
| 104 | +https://doi.org/10.5281/ZENODO.31780 |
| 105 | +A DOI name consists of three parts: |
| 106 | + |
| 107 | + |
| 108 | + |
| 109 | +The proxy is an HTTP URL. DataCite recommends that all DOIs are permanent URLs. |
| 110 | +(Using the old DOI protocol, e.g. doi:/10.5281/ZENODO.31780 is NOT recommended.) |
| 111 | + |
| 112 | +A DOI prefix always starts with "10." and continues with a number. This number |
| 113 | +defines a globally unique namespace. (The scope of "global" depends on the organization |
| 114 | +managing multiple repositories.) Prefixes should not have semantic meaning. Adding |
| 115 | +meaning to the identifier is risky because "despite besting intentions, all names can |
| 116 | +change over time" [DataCite DOI Basics](https://support.datacite.org/docs/doi-basics). |
| 117 | + |
| 118 | +The suffix for a DOI can be almost any string. Here is where information provided in an |
| 119 | +input form may be integrated into the DOI. |
| 120 | + |
| 121 | +Note that DOI names are not case-sensitive, while URLs are case-sensitive: |
| 122 | +https://support.datacite.org/docs/datacite-doi-display-guidelines. |
| 123 | + |
| 124 | +## Useful Links |
| 125 | + |
| 126 | +- [DataCite: DOI Basics](https://support.datacite.org/docs/doi-basics) |
| 127 | +- [DataCite: DOI Handbook](https://www.doi.org/the-identifier/resources/handbook/) |
| 128 | +- [DataCite: DOI Display Guidelines](https://support.datacite.org/docs/datacite-doi-display-guidelines) |
| 129 | +- [Creating a DOI via CU Libraries and DataCite](creating_a_doi.md) |
| 130 | + |
| 131 | +## Acronyms |
| 132 | + |
| 133 | +- **DOI** = Digital Object Identifier |
| 134 | +- **IDF** = International DOI Foundation |
| 135 | +- **ORCID** = Open Researcher and Contributor ID |
| 136 | + |
| 137 | +Credit: Content taken from a Confluence guide written by Anne Wilson and Shawn Polson. |
0 commit comments