Data Citation Resources
This web page provides extra information in the form of guides, explanations, templates, aids in decision making and more to help individuals and organisations learn about data citation and Digital Object Identifiers (DOIs).
Overview : http://ands.org.au/cite-data
What is Data Citation
Data Citation Benefits
Data Citation Benefits for Researchers
DOI Data Citation in Action
Using Identifiers to Cite Data
Digital Object Identifiers for Datasets
ANDS and DOIs: Services, Projects and Connections
Implementing DOIs
Presentations and Training Material
Related Links
What is Data Citation?
Data citation refers to the practice of providing a reference to data in the same way as researchers routinely provide a bibliographic reference to printed resources. An important aim of ANDS is to enable more researchers to re-use research data more often. To achieve this aim ANDS is engaged in activities that will make it easier to share data, to recognise the importance of making data available and to make data citation a standard procedure.
More:
- Poster [PDF 1.9MB] - multiple free printed copies of the poster in pamphlet form are available by emailing: guides@ands.org.au
- Data Citation - ANDS Guide (awareness level): This two page guide covers: defining data citation, the ANDS approach to data citation, how to cite data.
- Mons, B., Haagen, H. van, Chichester, C., Hoen, P.-B. 'T, Dunnen, J. T. den, Ommen, G. van, et al. (2011). The value of data. Nature genetics, 43(4), 281-3. Nature Publishing Group. doi: 10.1038/ng0411-281
.
Data Citation Benefits?
|
CrossRef Animation |
The need to cite data is starting to be recognised as one of the key practices underpinning the recognition of data as a primary research output rather than as a by-product of research.
|
More:
- Whitlock, M. C. (2010). Data archiving in ecology and evolution: best practices. Trends in Ecology & Evolution, 1-5. Elsevier Ltd. doi: 10.1016/j.tree.2010.11.006
- Ball, A.& Duke, M. (2011). ‘Data Citation and Linking
'. DCC Briefing Papers. Edinburgh: Digital Curation Centre. - Ball, A. & Duke, M. (2011). ‘How to Cite Datasets and Link to Publications
’. DCC How-to Guides. Edinburgh: Digital Curation Centre. - Watch the video
on the Thomson Reuters DCI Digital Citation Index
- ICPSR (starts at 5.32): data sharing, how ICPSR is encouraging good practice, benefits of the DCI
- California Digital Library: (starts at 18.40): open source tools academic libraries are using to support data rich researchers
- Pangaea (starts at 29.30): data publishing prerequisites, infrastructure and building blocks
- Thomson Reuters (starts at 41.20): about the DCI and Q&A from webinar participants.
- ICPSR (starts at 5.32): data sharing, how ICPSR is encouraging good practice, benefits of the DCI
Data Citation benefits for Researchers:
(Information from Oak Ridge / NASA Poster)![]()
- Makes data publications more acceptable for CVs and creates more recognitions for data sharing
- Provides a means of tracking publications who are using or citing researchers dataset
- Can search for DOIs in search engines to find grey literature
- Journals are more accepting of DOIs
More:
- Callaghan S, Donegan S, Pepler S,2012 Making Data a First Class Scientific Output: Data Citation and Publication by NERC's Environmantal Data Centres. The International Journal of Digital Curation, Vol 7, Issue 1

- Piwowar HA, Day RS, Fridsma DB, 2007 Sharing Detailed Research Data Is Associated with Increased Citation Rate. PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308
DOI Data Citation in Action
Data citation standards vary across disciplines. However, DataCite
has undertaken some work in this area and recommends the following format.
Creator (Publication Year): Title. Publisher. Identifier
Denhard, Michael (2009): dphase_mpeps: MicroPEPS LAF‐Ensemble run by DWD for the MAP D‐PHASE project. World Data Center for Climate. http://dx.doi.org/10.1594/WDCC/dphase_mpeps
They recognise that it may also be desirable to include two optional properties, Version and ResourceType (as appropriate). If so, the recommended form is as follows:
Creator (PublicationYear): Title. Version. Publisher. ResourceType. Identifier
ISSP Research Group: International Social Survey Programme 2007: Leisure Time and Sports (ISSP 2007). GESIS Data Archive, Cologne, Germany, ZA4850 Data file Vers. 2.0.0, doi:10.4232/1.10079
More:
For citation purposes, the Identifier may optionally appear both in its original format and in a linkable, http format.
- Citations form: doi:10.1594/PANGAEA.484677
- Hypertext link form: http://dx.doi.org/10.1594/PANGAEA.484677

Examples of DOIs for PANGAEA datasets in a variety of settings:
Resources:
- DataCite Metadata Schema
[PDF 3.03KB] for the Publication and Citation of Research Data: an 18 page PDF examining the broader issues of data citation and associated metadata - A short guide to data citation
by the Michigan State University Library. - DataCite-International Initiative
to Facilitate Access to Research Data - An example of how Nature journal publisher uses DOIs
in their publication process - An example of how DOIs are used in collaboration endeavours between scientists: Use of DOIs in solving the German ecoli outbreak in July 2011
. - Watch: A Guelph Humber Librarian shows how various academic journals use DOIs in their reference structure (Where's the DOI?)
Using Identifiers to Cite Data
To be cited accurately and to ensure other researchers can find your dataset in the future, datasets need one or more identifiers (preferably a persistent identifier) to be attached to the dataset. Research data outputs don't stay in the one place: they have a lifecycle which involves the data moving around. Persistent identifier links to the data outputs shouldn't break, whether they are raw data or publications.
There are many types of persistent identifiers which can be used to cite data and datasets. Some institutions may use several different types of persistent identifiers. Digital Object Identifiers (DOIs) are a type of persistent identifier which are available through the ANDS Cite My Data Service.
More:
- Persistent Identifiers - ANDS Guide (awareness level). This 2 page guide covers: defining persistent identifiers, reasons for persistent identifiers, how they work, an example of one type of persistent identifier (Handle), and what needs to be done by whom to ensure persistence.
- Persistent Identifiers decision tree [PDF 253KB]:
Is a DOI preferable to another type of persistant identifier for data citation? YES if:
- This data will be exposed
- This data forms part of the scholarly record
- This data can be kept persistent
- The minimum metadata required by DataCite can be supplied.
Digital Object Identifiers (DOIs) for Datasets
A DOI Name is a Persistent Identifier, but also provides extra benefits.
- DOI Names can be used to uniquely identify either digital or non-digital objects, whether or not they have any internet presence.
- DOI Names are supported by the International DOI Foundation (IDF) and Registration Agencies infrastructure, which provides ongoing DOI services and allows for a high level of confidence in the quality and accuracy of DOI Names.
- DOIs are used by CrossRef and other publishers to track citation metrics.
DOIs are intended to be persistent. The DOI name syntax is a NISO standard
but allows the incorporation of any form of existing identifier. The DOI name suffix can be any alphanumeric string that the Registrant chooses. This can simply be a sequential number, or it can make use of an existing (legacy) identifier.
Example DOIs for datasets: doi:10.4232/1.10079 -or- doi:10.1594/WDCC/dphase_mpeps
Note: Currently the Cite My Data service will mint DOI suffixes in the form of random digits. This policy will be revisited in future releases of the service.
More:
- The Digital Object Identifier System and DOI Names: This 2 page ANDS guide covers: defining DOIs, DOI names, difference between DOIs and other persistent identifiers, advantages of DOIs for datasets, ANDS and DOIs.
- International DOI Foundation
has links to the DOI Handbook and FAQ - The DOI Handbook
on how a DOI® name is constructed and assigned. It discusses the use of the DOI name prefix as a naming authority, and the DOI name suffix as a mechanism for assigning individual numbers within that naming authority, incorporating (if required) existing identifiers. The ability of the DOI name to incorporate existing identifiers and the benefits of that approach are discussed in detail. Character sets, case sensitivity, uniqueness, and Internet identifier specifications are also discussed. - CrossRef is a DOI Registration agency: http://www.crossref.org/
- Watch: Scott Edmunds announce BGI's new GigaScience journal which uses DOIs
ANDS and DOIs: Services, Projects, Connections
- ANDS is a member of DataCite
ANDS has joined DataCite, a group of leading research libraries and technical information providers, that aims to make it easier for research datasets to be handled as independent, citable, unique scientific objects via the use of DOIs. ANDS is also participating in the DataCite Metadata Standards Working Group. - Cite My Data service
The ANDS Cite My Data service is offered to Australian research sector organizations who need to manage citations to their datasets. There is no charge for using this service. The Cite My Data service is built on the global DataCite data citation infrastructure . ANDS is currently the only Australian member of the DataCite Consortium - ANDS is keen to provide a service which helps Australian researchers, research organisations and discipline communities to address some of the policy, workflows and scholarly communications issues around data citation. The service would support research organisations to:
- Develop and implement policy
- Build consensus on citation practice
- Trial self- data citation in journal articles
- Track third party data citation
- Provide input into trials with automated data citation indexation services
ANDS is still exploring service provision opportunities in this area - we would appreciate your interest, feedback and input. Please email: contact@ands.org.au
Implementing DOIs
ANDS has a wealth of material and services to support the use of DOI's in promoting reusability and connectedness of Australian research data.
| Institutional policy & procedures |
Issues to consider when developing institutional policy and procedures Persistence into the future is the strength of the DOI concept - institutions need to have policies and procedures in place to define and assure persistence Institutions can incorporate an alpha component into the DOI name (eg doi:10.1234/FACULTYABC.654321) but note that this also implies persistence of the name into the foreseeable future Persistence may incur ongoing direct and indirect costs for the institution. "Persistent Identification" is an accepted concept: what does the DOI System add to this?
|
| IT Tools & Infrastructure (hardware & software) |
ANDS IT Projects Institutions need to meet minimum requirements before DOIs can be minted - Identifiers Decision Tree [PDF 253KB] DOIs can be minted through ANDS using the Cite My Data machine-to-machine serviceManaging Metadata DOIs need to be stored and managed with other metadata about the dataset in local stores or repositories. DOIs should be included in RIF-CS metadata feeds to Research Data Australia in the Identifier element to ensure maximum access for researchers to the data
Guides |
| Support services (people & advice) |
Awareness and Advice DOIs are well known within the scholarly journal world, but promulgation of the benefits and use of DOIs for research datasets may require an awareness campaign across the institution. Guides and articles
Posters and Pamphlets
People and Community Building contacts and networks
Conference presentations
|
Conferences, Presentations and Training Material
Presentation slides, associated materials and webinar recordings (where available) are available through the Data Citation Conferences, Presentations & Training Material page.
Previous events
- Data Citation Best Practice Roundtable: Brisbane, Aug 2012 - Program and register

- Data Citation virtual Roundtables: Webinar, June 2012 - featured speakers from DataCite, Oak Ridge National Laboratories, Thomson Reuters Data Citation Index product.
- Data Citation workshop at eResearch Australasia 2011: Workshop, Nov 2011 - featured DataCite, the experiences of early implementers of DOIs in Australia and ANDS "Cite My Data" service.
Related Links
If you are interested in DOIs, you may also be interested in the following related concepts. Information is available through:
The ANDS website:
- Data management: planning, policies, procedures
- IT: the Data curation continuum
- Licensing to encourage the reuse of data
- IT: the Date curation continuum - ANDS Guide
- Research data policy and the Australian Code for the Responsible Conduct of Research - ANDS Guide
External:
- QUT library blog post - linking your publications to your data
















