ANDS Logo
bannerbannerbannerbanner
 Find Research Data:

Data Citation Resources

This web page provides extra information in the form of guides, explanations, templates, aids in decision making and more to help individuals and organisations learn about data citation and Digital Object Identifiers (DOIs).

 

Overview : http://ands.org.au/cite-data

What is Data Citation
Data Citation Benefits
Data Citation Benefits for Researchers
DOI Data Citation in Action
Using Identifiers to Cite Data
Digital Object Identifiers for Datasets
ANDS and DOIs: Services, Projects and Connections
Implementing DOIs
Presentations and Training Material
Related Links


What is Data Citation?

Data Citation Poster

Data Citation Poster (PDF)

Data citation refers to the practice of providing a reference to data in the same way as researchers routinely provide a bibliographic reference to printed resources.  An important aim of ANDS is to enable more researchers to re-use research data more often.  To achieve this aim ANDS is engaged in activities that will make it easier to share data, to recognise the importance of making data available and to make data citation a standard procedure.

More:

  • Poster [PDF 1.9MB] - multiple free printed copies of the poster in pamphlet form are available by emailing: guides@ands.org.au
  • Data Citation - ANDS Guide  (awareness level): This two page guide covers: defining data citation, the ANDS approach to data citation, how to cite data.
  • Mons, B., Haagen, H. van, Chichester, C., Hoen, P.-B. 'T, Dunnen, J. T. den, Ommen, G. van, et al. (2011). The value of data. Nature genetics, 43(4), 281-3. Nature Publishing Group. doi: 10.1038/ng0411-281Nature Genetics.


Data Citation Benefits?

 

CrossRef Animation

CrossRef Animation

 

The need to cite data is starting to be recognised as one of the key practices underpinning the recognition of data as a primary research output rather than as a by-product of research.

  • If datasets are routinely cited they will start to achieve a validity and significance within the scholarly communications cycle.

  • Citation of data could enable recognition of scholarly effort in disciplines and organisations that want to acknowledge and reward data outputs.

  • Using DOIs will establish easier access to research data on the Internet, and

    • Increase the acceptance of research data as legitimately citable contribution to the scientific record,

    • Support data archiving thus permitting results to be verified and re-purposed for future study.

    • Enable data citation metrics using DOIs to be tracked (as is done with articles) by products such as Thomson Reuters Data Citation Index

More:

  • Whitlock, M. C. (2010). Data archiving in ecology and evolution: best practices. Trends in Ecology & Evolution, 1-5. Elsevier Ltd. doi: 10.1016/j.tree.2010.11.006Elsevier
  • Ball, A.& Duke, M. (2011). ‘Data Citation and LinkingDCC-Data citation and linking'. DCC Briefing Papers. Edinburgh: Digital Curation Centre. 
  • Ball, A. & Duke, M. (2011). ‘How to Cite Datasets and Link to PublicationsDCC guide-how to cite datasets and link to publications’. DCC How-to Guides. Edinburgh: Digital Curation Centre.
  • Watch the videovideo on the Thomson Reuters DCI Digital Citation Indexthomson reuters
    • ICPSR (starts at 5.32): data sharing, how ICPSR is encouraging good practice, benefits of the DCI
    • California Digital Library: (starts at 18.40): open source tools academic libraries are using to support data rich researchers
    • Pangaea (starts at 29.30): data publishing prerequisites, infrastructure and building blocks
    • Thomson Reuters (starts at 41.20): about the DCI and Q&A from webinar participants.

 

Benefits of Adding DOIs to Datasets Poster

Benefits of Adding DOIs to Finalised Datasets - Poster(PDF)

Data Citation benefits for Researchers:

(Information from Oak Ridge / NASA Poster)Oak Ridge / NASA Poster

  • Makes data publications more acceptable for CVs and creates more recognitions for data sharing
  • Provides a means of tracking publications who are using or citing researchers dataset
  • Can search for DOIs in search engines to find grey literature
  • Journals are more accepting of DOIs

More:


DOI Data Citation in Action

World Data Center for Climate Screenshot

Access page for World Data Center for Climate Dataset

Data citation standards vary across disciplines.  However, DataCiteDataCite has undertaken some work in this area and recommends the following format.

Creator (Publication Year): Title. Publisher. Identifier

Denhard, Michael (2009): dphase_mpeps: MicroPEPS LAF‐Ensemble run by DWD for the MAP D‐PHASE project. World Data Center for Climate. http://dx.doi.org/10.1594/WDCC/dphase_mpeps

 

ISSP Screenshot

Access page for ISSP Dataset

They recognise that it may also be desirable to include two optional properties, Version and ResourceType (as appropriate). If so, the recommended form is as follows:

Creator (PublicationYear): Title. Version. Publisher. ResourceType. Identifier

ISSP Research Group: International Social Survey Programme 2007: Leisure Time and Sports (ISSP 2007). GESIS Data Archive, Cologne, Germany, ZA4850 Data file Vers. 2.0.0, doi:10.4232/1.10079

 More:

PANGEA Screenshot

Access page for PANGEA Dataset

For citation purposes, the Identifier may optionally appear both in its original format and in a linkable, http format.

Examples of DOIs for PANGAEA datasets in a variety of settings:

Resources:

 


Using Identifiers to Cite Data

To be cited accurately and to ensure other researchers can find your dataset in the future, datasets need one or more identifiers (preferably a persistent identifier) to be attached to the dataset.  Research data outputs don't stay in the one place: they have a lifecycle which involves the data moving around. Persistent identifier links to the data outputs shouldn't break, whether they are raw data or publications.

There are many types of persistent identifiers which can be used to cite data and datasets.  Some institutions may use several different types of persistent identifiers.  Digital Object Identifiers (DOIs) are a type of persistent identifier which are available through the ANDS Cite My Data Service.

More:

  • Persistent Identifiers - ANDS Guide (awareness level). This 2 page guide covers: defining persistent identifiers, reasons for persistent identifiers, how they work, an example of one type of persistent identifier (Handle), and what needs to be done by whom to ensure persistence.   

Identifier Decision Tree

Identifier Decision Tree (PDF)

  • Persistent Identifiers decision tree [PDF 253KB]:
Is a DOI preferable to another type of persistant identifier for data citation? YES if:
  • This data will be exposed
  • This data forms part of the scholarly record
  • This data can be kept persistent
  • The minimum metadata required by DataCite can be supplied.


Digital Object Identifiers (DOIs) for Datasets

A DOI Name is a Persistent Identifier, but also provides extra benefits.

  • DOI Names can be used to uniquely identify either digital or non-digital objects, whether or not they have any internet presence.
  • DOI Names are supported by the International DOI Foundation (IDF) and Registration Agencies infrastructure, which provides ongoing DOI services and allows for a high level of confidence in the quality and accuracy of DOI Names.
  • DOIs are used by CrossRef and other publishers to track citation metrics.

DOIs are intended to be persistent.  The DOI name syntax is a NISO standardNISO standard but allows the incorporation of any form of existing identifier. The DOI name suffix can be any alphanumeric string that the Registrant chooses. This can simply be a sequential number, or it can make use of an existing (legacy) identifier.

Example DOIs for datasets:  doi:10.4232/1.10079    -or-   doi:10.1594/WDCC/dphase_mpeps 

Note: Currently the Cite My Data service will mint DOI suffixes in the form of random digits. This policy will be revisited in future releases of the service.

More:

  • The Digital Object Identifier System and DOI Names: This 2 page ANDS guide covers: defining DOIs, DOI names, difference between DOIs and other persistent identifiers, advantages of DOIs for datasets, ANDS and DOIs.

  • International DOI FoundationDOI foundation has links to the DOI Handbook and FAQ

  • The DOI HandbookDOI Handbook on how a DOI® name is constructed and assigned. It discusses the use of the DOI name prefix as a naming authority, and the DOI name suffix as a mechanism for assigning individual numbers within that naming authority, incorporating (if required) existing identifiers. The ability of the DOI name to incorporate existing identifiers and the benefits of that approach are discussed in detail. Character sets, case sensitivity, uniqueness, and Internet identifier specifications are also discussed.

  • CrossRef is a DOI Registration agency: http://www.crossref.org/CrossRef 

  • Watch: Scott Edmunds announce BGI's new GigaScience journal which uses DOIs 


    http://www.youtube.com/watch?v=AlYFa83aCWA


ANDS and DOIs: Services, Projects, Connections

  • ANDS is a member of DataCite
    ANDS has joined DataCite, a group of leading research libraries and technical information providers, that aims to make it easier for research datasets to be handled as independent, citable, unique scientific objects via the use of  DOIs.  ANDS is also participating in the DataCite Metadata Standards Working Group.  

  • Cite My Data service
    The ANDS Cite My Data service is offered to Australian research sector organizations who need to manage citations to their datasets. There is no charge for using this service. The Cite My Data service is built on the global DataCite data citation infrastructure . ANDS is currently the only Australian member of the DataCite Consortium

  • ANDS is keen to provide a service which helps Australian researchers, research organisations and discipline communities to address some of the policy, workflows and scholarly communications issues around data citation. The service would support research organisations to:
    • Develop and implement policy
    • Build consensus on citation practice
    • Trial self- data citation in journal articles
    • Track third party data citation
    • Provide input into trials with automated data citation indexation services

ANDS is still exploring service provision opportunities in this area - we would appreciate your interest, feedback and input. Please email: contact@ands.org.au

 

Implementing DOIs

ANDS has a wealth of material and services to support the use of DOI's in promoting reusability and connectedness of Australian research data.

Institutional policy & procedures

Issues to consider when developing institutional policy and procedures

Persistence into the future is the strength of the DOI concept - institutions need to have policies and procedures in place to define and assure persistence

Institutions can incorporate an alpha component into the DOI name (eg doi:10.1234/FACULTYABC.654321) but note that this also implies persistence of the name into the foreseeable future

Persistence may incur ongoing direct and indirect costs for the institution. "Persistent Identification" is an accepted concept: what does the DOI System add to this?DOI foundation FAQs

IT Tools & Infrastructure (hardware & software)


ANDS IT Projects

Institutions need to meet minimum requirements before DOIs can be minted - Identifiers Decision Tree [PDF 253KB]

DOIs can be minted through ANDS using the Cite My Data machine-to-machine service     


Managing Metadata

DOIs need to be stored and managed with other metadata about the dataset in local stores or repositories.

DOIs should be included in RIF-CS metadata feeds to Research Data Australia in the Identifier element to ensure maximum access for researchers to the data


Guides

  • Persistent identifiers (Working level) | [PDF 403KB]
  • Persistent identifiers (Expert level) | [PDF 975KB]
Support services (people & advice)


Awareness and Advice

DOIs are well known within the scholarly journal world, but promulgation of the benefits and use of DOIs for research datasets may require an awareness campaign across the institution.

Guides and articles

Posters and Pamphlets


People and Community

Building contacts and networks

Conference presentations

Conferences, Presentations and Training Material

Presentation slides, associated materials and webinar recordings (where available) are available through the Data Citation Conferences, Presentations & Training Material page.

Previous events

  • Data Citation Best Practice Roundtable: Brisbane, Aug 2012 - Program and registerEventbrite
  • Data Citation virtual Roundtables: Webinar, June 2012 - featured speakers from DataCite, Oak Ridge National Laboratories, Thomson Reuters Data Citation Index product.
  • Data Citation workshop at eResearch Australasia 2011: Workshop, Nov 2011 - featured DataCite, the experiences of early implementers of DOIs in Australia and ANDS "Cite My Data" service.

 

Related Links

If you are interested in DOIs, you may also be interested in the following related concepts. Information is available through:

The ANDS website:

External: