Data Citation

description48 papers

group0 followers

lightbulbAbout this topic

Data citation is the practice of providing a formal reference to datasets in scholarly work, ensuring proper attribution, facilitating data discovery, and promoting reproducibility in research. It involves specifying the dataset's authors, title, publication year, and access information, similar to citing traditional academic publications.

lightbulbAbout this topic

Key research themes

1. How can data citation infrastructures and persistent identifiers (PIDs) enhance precise data findability and reuse in scholarly research?

This theme examines the development and implementation of data citation infrastructures, especially the role of persistent identifiers at granular levels, to better support data discoverability, reuse, and scholarly credit—core components of FAIR data principles. It focuses on challenges and solutions related to uniquely identifying datasets and their subcomponents, such as variables, to promote precise attribution and reproducibility across disciplines.

The hurdles of current data citation practices and the adding-value of providing PIDs below study level

by Claus-Peter Klas

2023, Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries

Key finding: This paper identifies a significant gap in current Social Sciences data citation practices where Persistent Identifiers (PIDs) are assigned only at the study level but not at the finer attribute or variable level, hindering... Read more

articleView Paper downloadDownload

Implementing in the VAMDC the New Paradigms for Data Citation from the Research Data Alliance

by Marie-Lise Dubernet

2023, Data Science Journal

Key finding: This study details VAMDC’s implementation of Research Data Alliance (RDA) recommendations on dynamic data citation, including the use of versioned, time-stamped queries stored in a Query Store to enable reproducible... Read more

articleView Paper downloadDownload

Implementing in the VAMDC the New Paradigms for Data Citation from the Research Data Alliance

by Marie-Lise Dubernet

2023, Data Science Journal

Key finding: Springer Nature’s development of a standardized, tiered framework of research data policies across journals — ranging from encouragement of data sharing up to mandatory open data and peer review of data — illustrates a... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What are the prevalent practices and challenges in formal vs. informal data citation in scholarly publications, particularly in biomedical and social sciences?

This theme investigates the dynamics between formal data citation—references included in bibliographies or reference lists using standardized metadata—and informal citation practices embedded in main texts or acknowledgments. It explores disciplinary differences, barriers to standardized data citation, and the implications of informal citation on researcher recognition and data discoverability.

INFORMAL DATA CITATION FOR DATA SHARING AND REUSE IS MORE COMMON THAN FORMAL DATA CITATION IN BIOMEDICAL FIELDS

by Hyoungjoo Park and

2018, Journal of the Association for Information Science and Technology

Key finding: Through an automated and manual analysis of biomedical literature, this study finds that informal mentions of datasets within article texts far exceed formal data citations indexed in reference lists. This discrepancy leads... Read more

articleView Paper downloadDownload

Data Citation Detectives: The Role of a Bibliographer for a Social Science Data Archive

by Sarah Burchart

2023

Key finding: This paper showcases the workflows of ICPSR librarians who monitor formal and informal mentions of their datasets in social science literature, underscoring the complexity of detecting uncited or informally referenced uses.... Read more

articleView Paper downloadDownload

زبان فارسی در دربار اتابکان موصل

by Ali Safari Aq-qale

2019, Gozaresh-e Miras

Key finding: Highlighting the lag in proper data citation despite advances in data management infrastructure and FAIR principles, this editorial argues data sharing without formal citation undermines researcher credit and open science... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. What conceptual distinctions between software and data influence their citation and attribution practices in scientific research?

Research outputs encompass both software and data, yet these have distinct natures that affect how each should be cited, credited, and reused. This theme interrogates the epistemological and legal differences between software and data, implications for citation norms, and how these distinctions inform best practices for scholarly attribution.

Software vs. data in the context of citation

by Fernando Rios

2024

Key finding: The paper delineates critical distinctions between software and data: software is executable, creative, and generally copyright-protected, whereas data are empirical observations meant to provide evidence. This difference... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Data Citation

Assessing Research Support Services in Academic Libraries: A Systematic Literature Review

by Tariq Jan and

2025, Annals of Library and Information Studies

The study systematically analyzes Research Support Services (RSS) in academic libraries through a comprehensive review of scholarly literature. The objective is to evaluate publication trends, geographical distribution, sample... more

descriptionView Paper arrow_downwardDownload

11 - Minipresentations on educating the linguistics community

by Geoffrey Nathan

2025

Considers the promotion of data citation and attribution standards by: 1) establishing an education and training mandate, drawing on existing resources and standards, and engaging researchers early in their careers; and 2) exploring the... more

descriptionView Paper arrow_downwardDownload

Figshare integration with ORCID

by Johann Van Wyk

2025

Presented at the ORCID Research Visibility Workshop, Merensky Library, University of Pretoria, Pretoria, South Africa, 14 August 2019

descriptionView Paper arrow_downwardDownload

The integration of Artificial Intelligence tools in academic libraries within Ghana

by ADJEI SILAS

2024, Ghana Library Journal

Artificial Intelligence (AI) presents substantial opportunities to advance library operations and user experiences. However, its integration in academic libraries in Ghana has been relatively unexplored. This study investigates the... more

descriptionView Paper arrow_downwardDownload

TROLLing: Scope and operation of an open repository for linguistic datasets

by Laura Janda

2024

Poster: TROLLing (opendata.uit.no) is an international archive for open linguistic data and statistical code (e.g. R scripts), launched in 2014 at UiT The Arctic University of Norway. With the increasing demand for archiving and sharing... more

descriptionView Paper arrow_downwardDownload

Citation and Peer Review of Data: Moving Towards Formal Data Publication

by Brian Matthews

2024, IJDC

This paper discusses many of the issues associated with formally publishing data in academia, focusing primarily on the structures that need to be put in place for peer review and formal citation of datasets. Data publication is becoming... more

descriptionView Paper arrow_downwardDownload

Acceso abierto a los datos de investigación, una vía hacia la colaboración científica

by Remedios Melero

2024, Revista Espanola De Documentacion Cientifica

La transmisión del conocimiento debería verse favorecida por las oportunidades que brindan las TIC como medio de acceso y distribución de objetos digitales. Sin embargo, las barreras que impiden el acceso y reutilización de los trabajos derivados de la actividad científica y académica, ya sean económicas o de derechos de explotación, inhiben compartir un bien común como es el conocimiento. El movimiento por el acceso abierto a la ciencia promueve la eliminación de estas barreras y aboga por una cultura que permita compartir y reutilizar materiales, siempre con el reconocimiento de la autoría y con un uso responsable. Si el artículo científico ha sido históricamente una forma esencial de la comunicación de la ciencia, en la era digital cobran relevancia sus fundamentos, entre ellos los datos observacionales, descriptivos o experimentales que subyacen al artículo. Los datos pueden reutilizarse, transformarse mediante nuevos métodos o agregarse a otras fuentes. Los datos en abierto evitan la duplicidad de ensayos, dan transparencia a su forma de obtención y permiten su validación. En este trabajo se presentan algunas iniciativas y recomendaciones de cómo compartir, gestionar y promocionar el acceso abierto a los datos generados durante la investigación científica, como vía de colaboración entre grupos o personas con afinidad en sus temas de trabajo. Palabras clave: Acceso abierto; datos; repositorios de datos; citación de datos; políticas de acceso abierto. Abstract: Knowledge transfer should be facilitated by the opportunities offered by information technologies, as they affect the access and distribution of digital objects. However barriers to the access and reuse of scholarly research, whether economic or copyright-related, inhibit the sharing of this valuable common good. The open access movement promotes the elimination of these barriers and advocates for an open access culture of sharing and reusing materials, while guaranteeing that authors be duly acknowledged and that the data be used responsibly. If scientific papers have historically been essential for the communication of science, in the digital age it is now their building blocks that have gained greater importance, especially the observational, descriptive or experimental data that underpin the articles. Open research data can be reused, transformed by new methods or aggregated to other sources. Open access to research data avoids redundancy, provides transparency on how they have been obtained and allow for their validation. This work provides an overview of some initiatives and recommendations on how to share and manage research data and foster open access to it, as a means of collaboration between groups or individuals working in similar disciplines.

descriptionView Paper arrow_downwardDownload

Notes from Research Data Alliance Plenary Meeting in Dublin, Ireland

by Jana Porsche

2024

Notes from the Third Plenary for the Research Data Alliance in Dublin, Ireland on March 26 to 28, 2014 with focus on starting an institutional research data repository

descriptionView Paper arrow_downwardDownload

Promoting Reusable and Open Methods and Protocols (PRO-MaP): Draft recommendations to improve methodological clarity in life sciences publications

by Sofia B. Leite

2024

Detailed, accessible methods are essential for reproducibility, trust in science and scientific advancement; yet, many studies suggest that the reporting of methodological details in life sciences research publications is often... more

descriptionView Paper arrow_downwardDownload

Practices, Trends, and Recommendations in Technical Appendix Usage for Selected Data-Intensive Disciplines

by Stephen Abrams

2024

descriptionView Paper arrow_downwardDownload

Persistent Identifiers for Open Scholarship

by Niklas C Zimmer

2024

descriptionView Paper arrow_downwardDownload

NeDICC - Identifiers for Everything

by Niklas C Zimmer

2024

descriptionView Paper arrow_downwardDownload

Zenodo in the Spotlight of Traditional and New Metrics

by Juan Gorraiz

2024, Frontiers in Research Metrics and Analytics

In this case study, we aim to explore the characteristics and the reception of files uploaded to Zenodo, and the role the repository plays itself in generating usage. To this end, we first apply descriptive statistics on Zenodo's full set... more

descriptionView Paper arrow_downwardDownload

DataCite Best Practice Guide

by Christiane Bayer

2024, Zenodo (CERN European Organization for Nuclear Research)

descriptionView Paper arrow_downwardDownload

Research software citation in the Data Citation Index: Current practices and implications for research software sharing and reuse

by Dietmar Wolfram

2024, Journal of Informetrics

The aim of this study is to explore the phenomenon of research software citation and, in particular, to draw attention to the increasing importance of this form of citation in scholarly communication. This research sheds light on the... more

Relative distribution of software sharing records among the top repositories indexed by DCI. journal articles (Howison & Bullard, 2016; Pan et al., 2015). Weber and Thomer (2014) determined that only 13% of the roughly 1000 publications they examined explicitly mentioned the software package used in generating research outcome: and that most of these mentions specified only the software used for the data analysis, such as Statistical Package for the Social Sciences (SPSS) and MATrix LABoratory (MATLAB); at the same time, more than half of the articles included personal acknowledgments of individuals for assistance in the development of various pieces of software. Software was noticeably uncited in the PLOS ONE journals published in 2014 (Pan et al., 2015), but the more recent study cited above (Li et al., 2017. found formal software citation to be widespread owing to the establishment of official instructions by journal publishers FORCE 11 Software Citation Implementation Working Group (2018) is developing guidelines for the implementation o} software citation principles. Table 1

Summary of general metadata field usage for software sharing in the DCI. Table 2 Table 3

Comparison of identifiers used by various repositories included in the DCI. repository for the sharing research data outputs, including software; the Astrophysics Source Code Library (ASCL; 2.28%) provides access to software to support astronomy and astrophysics research; and ModelDB (2.96%) is a repository for com- putational neuroscience models. These 6 repositories, of the more than 350 indexed by the DCI, accounted for nearly all (>99%) of the software records in the DCI at the time this research was conducted; 6 other repositories together accounted for the remaining 0.33% of the records.

DCI-based software citations based on the year of software development for the top four cited repositories. Table 5

Analysis of software sharing and formal citation by repositories in the DCI.

descriptionView Paper arrow_downwardDownload

Response to RFI: 'Public Access to Digital Data Resulting From Federally Funded Scientific Research' Office of Science and Technology Policy

by George Alter

2023

Preservation, Discoverability, and Access (1) What specific Federal policies would encourage public access to and the preservation of broadly valuable digital data resulting from federally funded scientific research, to grow the U.S.... more

descriptionView Paper arrow_downwardDownload

Label-free detection of mitochondrial activity with Microwave Dielectric Spectroscopy Research Article

by Mary Poupot

2023

International audienceMitochondrial bioenergetics contributes to important biological processes and its dysfunction underlies some diseases, but its assessment requires invasive methods involving intracellular staining and chemical... more

descriptionView Paper arrow_downwardDownload

Making research data discoverable: an outreach activity of Datacite

by Prof. Rupak Chakravarty

2023, Zenodo (CERN European Organization for Nuclear Research)

The enormous growth in research data generated today has highlighted the value of data management (RDM) to make research FAIR (Findable, Accessible, Interconnected and Reusable). Appropriate data instructs researchers to use and reuse that data within appropriate citations and attribute it to the author. And Data citation refers to the process of presenting a reference to data in the same way as a bibliographic reference to printed resources is regularly provided by researchers. In this regard, the objective of this paper is to investigate the activities of the Datacite website in managing research data. Methodology The study approached the Datacite website, a non-profit organization that provides analysis with persistent identifiers (DOIs). The research examines the Statistics systems and other critical resources. Registrations by the Collective group and most involved repositories are included in the statistical approaches. The basic resources include top executives, OAI-PMH, DataCite Public Roadmap, DataCite Commons, DataCite/ORCID Auto-update and Service Providers. The outcomes were analysed by MS Excel. Results It is noted that there were 293 members of the registry from different countries. The USA was at the top of the 137 members according to registration, while at least one was located in India, Finland, Spain, etc. Germany was listed as the top member and most of the repository holding companies. Datafirst is the only server found in an Indian context. DataCite Commons found as a discovery tool which allows simple searches by works, individuals and organisations, while providing users with a detailed overview of the relationships between the entities in the research setting. Using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH), the DataCite service exposes metadata stored in the DataCite Metadata Store (MDS). Datacite Auto-update unambiguously categorises researchers and provides tools to automate the link between researchers and their creative work.

descriptionView Paper arrow_downwardDownload

Scholix Metadata Schema for Exchange of Scholarly Communication Links

by Wouter Haak

2023

This is the published version of the Scholix metadata scheme The goal of the Scholix initiative is to establish a high level interoperability framework for exchanging information about the links between scholarly literature and data. It... more

descriptionView Paper arrow_downwardDownload

Implementing in the VAMDC the New Paradigms for Data Citation from the Research Data Alliance

by Marie-Lise Dubernet

2023, Data Science Journal

VAMDC bridged the gap between atomic and molecular (A&M) producers and users by providing an interoperable e-infrastructure connecting A&M databases, as well as tools to extract and manipulate those data. The current paper highlights how... more

descriptionView Paper arrow_downwardDownload

Training of the Library Professionals in Digital Era: Key Issues

by DR SHIVAPUTRAPPA KATTIMANI

2023

The present paper examines the developments in the Information and Communication Technology and its application to the Library and Information Science. As it developed the library's functions and services, there is a need for training of... more

descriptionView Paper arrow_downwardDownload

ARK is in the Air: ARKs Trending in the French-speaking Area and the BnF’s Role in the ARK Story

by Thomas Ledoux

2023, HAL (Le Centre pour la Communication Scientifique Directe)

In recent years we have seen a growing adoption of Archival Resource Key (ARK) identi ers in France and in French-speaking countries, a growing reliance on National Library of France (BnF) ARKs for data dissemination, and a growing demand... more

descriptionView Paper arrow_downwardDownload

Research Data Repositories – Implications of Organization and Infrastructure on Use and Discovery

by Jeremy McLaughlin

2023

descriptionView Paper arrow_downwardDownload

by Prof Vishwas Chavan

2023

descriptionView Paper arrow_downwardDownload

DataCite Metadata: Getting Connected!

by Mohamed Yahia

2023

The role of DataCite and other large-scale infrastructures is evolving from identifying things to connecting things and DataCite metadata includes many ways to make connections. We will concentrate on relatedIdentifiers (and citations),... more

descriptionView Paper arrow_downwardDownload

(Big) Data in Library and Information Science: A Brief Overview of Some Important Problem Areas

by Koraljka Golub

2023, J. Univers. Comput. Sci.

Libraries hold a long history of a multidimensional focus on collecting, storing, organizing, preserving and providing access to information resources for various types of users. Data is nothing ne ...

descriptionView Paper arrow_downwardDownload

DataCite Metadata: Getting Connected!

by MOHAMED YAHIA

2023

descriptionView Paper arrow_downwardDownload

Symposium & Panel Discussion: Data Citation and Attribution for Reproducible Research in Linguistics

by Shobhana L Chelliah

2023

Slides from the symposium and panel discussion at the event "Data Citation and Attribution for Reproducible Research in Linguistics," Annual Meeting of the Linguistic Society of America, Austin, TX, 5 January 2017.

descriptionView Paper arrow_downwardDownload

Out of Cite, Out of Mind: The Current State of Practice, Policy, and Technology for the Citation of Data

by Paul Uhlir

2023, Data Science Journal

Since the 2011 workshop, the Task Group has undertaken a series of activities designed to build upon the international body of knowledge on data citation and attribution practices. The report presented here represents the next step... more

descriptionView Paper arrow_downwardDownload

Emerging Role of Librarians in Data Publication

by Ed Urban

2023

descriptionView Paper arrow_downwardDownload

Automated Attribution and Credit for Data: Connecting Publication to Data – and Data to Data Creators

by Alison Specht

2023

PARSEC Introduction: Advances in science, both today and in the future, depend on the openness, accessibility and reusability of data, software, samples, and data products. As environmental, ecological, and geological data represent... more

descriptionView Paper arrow_downwardDownload

Data Citation Detectives: The Role of a Bibliographer for a Social Science Data Archive

by Sarah Burchart

2023

The Inter-university Consortium for Political and Social Research (ICPSR)'s Bibliography of Data-Related Literature was established in 2000 to match datasets held in ICPSR topical archives to works resulting from data analyses. As a... more

descriptionView Paper arrow_downwardDownload

Continuous Curation of a Digital Bibliography Showcasing Criminal Justice Data Use

by Sarah Burchart

2023

Poster presented at the 2019 American Library Association Annual Conference in Washington, D.C., June 20-25, 2019

descriptionView Paper arrow_downwardDownload

A Data Citation Roadmap for Scholarly Data Repositories

by Gustavo Durand

2023

This article presents a practical roadmap for scholarly data repositories to implement data citation in accordance with the Joint Declaration of Data Citation Principles, a synopsis and harmonization of the recommendations of major... more

descriptionView Paper arrow_downwardDownload

Is informal data citation for data sharing and re‐use more common than formal data citation?

by Dietmar Wolfram

2023, Proceedings of the Association for Information Science and Technology

Data citation to reflect instances of data sharing and re-use is becoming more common, although it is not yet widely adopted. We investigate how common formal and informal data citation are in bioscience/biomedical research. We found that... more

descriptionView Paper arrow_downwardDownload

Informal data citation for data sharing and reuse is more common than formal data citation in biomedical fields

by Dietmar Wolfram

2023, Journal of the Association for Information Science and Technology

Data citation, where products of research such as data sets, software, and tissue cultures are shared and acknowledged, is becoming more common in the era of Open Science. Currently, the practice of formal data citation-where data... more

descriptionView Paper arrow_downwardDownload

An examination of research data sharing and re-use: implications for data citation practice

by Dietmar Wolfram

2023, Scientometrics

This study examines characteristics of data sharing and data re-use in Genetics and Heredity, where data citation is most common. This study applies an exploratory method because data citation is a relatively new area. The Data Citation... more

descriptionView Paper arrow_downwardDownload

Symposium & Panel Discussion: Data Citation and Attribution for Reproducible Research in Linguistics

by Shobhana Chelliah

2023

descriptionView Paper arrow_downwardDownload

Finding the principles of the commons: a report of the Force11 Scholarly Communications Working Group

by Daniel O'Donnell

2023, Collaborative Librarianship

Recommended Citation Champieux, Robin; Kramer, Bianca; Bosman, Jeroen; Bruno, Ian; Buckland, Amy; Callaghan, Sarah; Chapman, Chris; Hagstrom, Stephanie; Martone, MaryAnn E.; and O'Donnell, Daniel Paul (2016) "Finding the... more

descriptionView Paper arrow_downwardDownload

TROLLing: Scope and operation of an open repository for linguistic datasets

by Stein Høydalsvik

2023

descriptionView Paper arrow_downwardDownload

Symposium & Panel Discussion: Data Citation and Attribution for Reproducible Research in Linguistics

by Shobhana Chelliah

2023

descriptionView Paper arrow_downwardDownload

Out of Cite, Out of Mind: The Current State of Practice, Policy, and Technology for the Citation of Data

by Elizabeth Arnaud

2022, Data Science Journal

descriptionView Paper arrow_downwardDownload

A Data Sharing Story

by Mercè Crosas

2022, Journal of eScience Librarianship

descriptionView Paper arrow_downwardDownload

Call for Papers | Data reuse: What new information can we learn from used data?

by Henrique C Martins

2022

Call for papers for the Special Issue "Data reuse: What new information can we learn from used data?" One of the most critical tasks of executing empirical research in Business Administration is to collect reliable data. This is... more

descriptionView Paper arrow_downwardDownload

11 - Minipresentations on educating the linguistics community

by Geoffrey Nathan

2022

descriptionView Paper arrow_downwardDownload

12-15 - Working Group final reports

by Mandana Seyfeddinipur

2022

Working group reports from the four communities represented at the workshop: (1) Archivists, (2) Journal Editors, (3) IT/Big Data, (4) Ordinary Working Linguists. Presented at the first workshop on Developing Standards for Data Citation... more

descriptionView Paper arrow_downwardDownload

Dataset metadata

by Sai Deng

2022

oAs stated in NSF's "Information about the Data Management Plan Required for all Proposals" for Biological Sciences, the Federal government defines data (OMB Circular A-110) as: "…the recorded factual material commonly accepted in the... more

descriptionView Paper arrow_downwardDownload

A FAIR-Based Approach to Enhancing the Discovery and Re-Use of Transcriptomic Data Assets for Nuclear Receptor Signaling Pathways

by apollo mcowiti

2022, Data Science Journal

In an effort to lead our community in following modern data citation practices by formally citing data used in published research and implementing standards to facilitate reproducible research results and data, while also producing... more

descriptionView Paper arrow_downwardDownload

The Evolution of Data Citation: From Principles to Implementation

by Mercè Crosas

2022, IASSIST Quarterly

The Evolution of Data Citation: From Principles to Implementation

descriptionView Paper arrow_downwardDownload

Achieving human and machine accessibility of cited data in scholarly publications

by Mercè Crosas

2022

Reproducibility and reusability of research results is an important concern in scientific communication and science policy. A foundational element of reproducibility and reusability is the open and persistently available presentation of... more

descriptionView Paper arrow_downwardDownload