Academia.eduAcademia.edu

Data Citation

description48 papers
group0 followers
lightbulbAbout this topic
Data citation is the practice of providing a formal reference to datasets in scholarly work, ensuring proper attribution, facilitating data discovery, and promoting reproducibility in research. It involves specifying the dataset's authors, title, publication year, and access information, similar to citing traditional academic publications.
lightbulbAbout this topic
Data citation is the practice of providing a formal reference to datasets in scholarly work, ensuring proper attribution, facilitating data discovery, and promoting reproducibility in research. It involves specifying the dataset's authors, title, publication year, and access information, similar to citing traditional academic publications.

Key research themes

1. How can data citation infrastructures and persistent identifiers (PIDs) enhance precise data findability and reuse in scholarly research?

This theme examines the development and implementation of data citation infrastructures, especially the role of persistent identifiers at granular levels, to better support data discoverability, reuse, and scholarly credit—core components of FAIR data principles. It focuses on challenges and solutions related to uniquely identifying datasets and their subcomponents, such as variables, to promote precise attribution and reproducibility across disciplines.

Key finding: This paper identifies a significant gap in current Social Sciences data citation practices where Persistent Identifiers (PIDs) are assigned only at the study level but not at the finer attribute or variable level, hindering... Read more
Key finding: This study details VAMDC’s implementation of Research Data Alliance (RDA) recommendations on dynamic data citation, including the use of versioned, time-stamped queries stored in a Query Store to enable reproducible... Read more
Key finding: Springer Nature’s development of a standardized, tiered framework of research data policies across journals — ranging from encouragement of data sharing up to mandatory open data and peer review of data — illustrates a... Read more

2. What are the prevalent practices and challenges in formal vs. informal data citation in scholarly publications, particularly in biomedical and social sciences?

This theme investigates the dynamics between formal data citation—references included in bibliographies or reference lists using standardized metadata—and informal citation practices embedded in main texts or acknowledgments. It explores disciplinary differences, barriers to standardized data citation, and the implications of informal citation on researcher recognition and data discoverability.

Key finding: Through an automated and manual analysis of biomedical literature, this study finds that informal mentions of datasets within article texts far exceed formal data citations indexed in reference lists. This discrepancy leads... Read more
Key finding: This paper showcases the workflows of ICPSR librarians who monitor formal and informal mentions of their datasets in social science literature, underscoring the complexity of detecting uncited or informally referenced uses.... Read more
Key finding: Highlighting the lag in proper data citation despite advances in data management infrastructure and FAIR principles, this editorial argues data sharing without formal citation undermines researcher credit and open science... Read more

3. What conceptual distinctions between software and data influence their citation and attribution practices in scientific research?

Research outputs encompass both software and data, yet these have distinct natures that affect how each should be cited, credited, and reused. This theme interrogates the epistemological and legal differences between software and data, implications for citation norms, and how these distinctions inform best practices for scholarly attribution.

Key finding: The paper delineates critical distinctions between software and data: software is executable, creative, and generally copyright-protected, whereas data are empirical observations meant to provide evidence. This difference... Read more

All papers in Data Citation

by Tariq Jan and 
1 more
The study systematically analyzes Research Support Services (RSS) in academic libraries through a comprehensive review of scholarly literature. The objective is to evaluate publication trends, geographical distribution, sample... more
Considers the promotion of data citation and attribution standards by: 1) establishing an education and training mandate, drawing on existing resources and standards, and engaging researchers early in their careers; and 2) exploring the... more
Presented at the ORCID Research Visibility Workshop, Merensky Library, University of Pretoria, Pretoria, South Africa, 14 August 2019
Artificial Intelligence (AI) presents substantial opportunities to advance library operations and user experiences. However, its integration in academic libraries in Ghana has been relatively unexplored. This study investigates the... more
Poster: TROLLing (opendata.uit.no) is an international archive for open linguistic data and statistical code (e.g. R scripts), launched in 2014 at UiT The Arctic University of Norway. With the increasing demand for archiving and sharing... more
This paper discusses many of the issues associated with formally publishing data in academia, focusing primarily on the structures that need to be put in place for peer review and formal citation of datasets. Data publication is becoming... more
La transmisión del conocimiento debería verse favorecida por las oportunidades que brindan las TIC como medio de acceso y distribución de objetos digitales. Sin embargo, las barreras que impiden el acceso y reutilización de los trabajos... more
Notes from the Third Plenary for the Research Data Alliance in Dublin, Ireland on March 26 to 28, 2014 with focus on starting an institutional research data repository
Detailed, accessible methods are essential for reproducibility, trust in science and scientific advancement; yet, many studies suggest that the reporting of methodological details in life sciences research publications is often... more
In this case study, we aim to explore the characteristics and the reception of files uploaded to Zenodo, and the role the repository plays itself in generating usage. To this end, we first apply descriptive statistics on Zenodo's full set... more
The aim of this study is to explore the phenomenon of research software citation and, in particular, to draw attention to the increasing importance of this form of citation in scholarly communication. This research sheds light on the... more
Preservation, Discoverability, and Access (1) What specific Federal policies would encourage public access to and the preservation of broadly valuable digital data resulting from federally funded scientific research, to grow the U.S.... more
International audienceMitochondrial bioenergetics contributes to important biological processes and its dysfunction underlies some diseases, but its assessment requires invasive methods involving intracellular staining and chemical... more
The enormous growth in research data generated today has highlighted the value of data management (RDM) to make research FAIR (Findable, Accessible, Interconnected and Reusable). Appropriate data instructs researchers to use and reuse... more
This is the published version of the Scholix metadata scheme The goal of the Scholix initiative is to establish a high level interoperability framework for exchanging information about the links between scholarly literature and data. It... more
VAMDC bridged the gap between atomic and molecular (A&M) producers and users by providing an interoperable e-infrastructure connecting A&M databases, as well as tools to extract and manipulate those data. The current paper highlights how... more
The present paper examines the developments in the Information and Communication Technology and its application to the Library and Information Science. As it developed the library's functions and services, there is a need for training of... more
In recent years we have seen a growing adoption of Archival Resource Key (ARK) identi ers in France and in French-speaking countries, a growing reliance on National Library of France (BnF) ARKs for data dissemination, and a growing demand... more
The role of DataCite and other large-scale infrastructures is evolving from identifying things to connecting things and DataCite metadata includes many ways to make connections. We will concentrate on relatedIdentifiers (and citations),... more
Libraries hold a long history of a multidimensional focus on collecting, storing, organizing, preserving and providing access to information resources for various types of users. Data is nothing ne ...
The role of DataCite and other large-scale infrastructures is evolving from identifying things to connecting things and DataCite metadata includes many ways to make connections. We will concentrate on relatedIdentifiers (and citations),... more
Slides from the symposium and panel discussion at the event "Data Citation and Attribution for Reproducible Research in Linguistics," Annual Meeting of the Linguistic Society of America, Austin, TX, 5 January 2017.
Since the 2011 workshop, the Task Group has undertaken a series of activities designed to build upon the international body of knowledge on data citation and attribution practices. The report presented here represents the next step... more
PARSEC Introduction: Advances in science, both today and in the future, depend on the openness, accessibility and reusability of data, software, samples, and data products. As environmental, ecological, and geological data represent... more
The Inter-university Consortium for Political and Social Research (ICPSR)'s Bibliography of Data-Related Literature was established in 2000 to match datasets held in ICPSR topical archives to works resulting from data analyses. As a... more
Poster presented at the 2019 American Library Association Annual Conference in Washington, D.C., June 20-25, 2019
This article presents a practical roadmap for scholarly data repositories to implement data citation in accordance with the Joint Declaration of Data Citation Principles, a synopsis and harmonization of the recommendations of major... more
Data citation to reflect instances of data sharing and re-use is becoming more common, although it is not yet widely adopted. We investigate how common formal and informal data citation are in bioscience/biomedical research. We found that... more
Data citation, where products of research such as data sets, software, and tissue cultures are shared and acknowledged, is becoming more common in the era of Open Science. Currently, the practice of formal data citation-where data... more
This study examines characteristics of data sharing and data re-use in Genetics and Heredity, where data citation is most common. This study applies an exploratory method because data citation is a relatively new area. The Data Citation... more
Slides from the symposium and panel discussion at the event "Data Citation and Attribution for Reproducible Research in Linguistics," Annual Meeting of the Linguistic Society of America, Austin, TX, 5 January 2017.
Recommended Citation Champieux, Robin; Kramer, Bianca; Bosman, Jeroen; Bruno, Ian; Buckland, Amy; Callaghan, Sarah; Chapman, Chris; Hagstrom, Stephanie; Martone, MaryAnn E.; and O'Donnell, Daniel Paul (2016) "Finding the... more
Poster: TROLLing (opendata.uit.no) is an international archive for open linguistic data and statistical code (e.g. R scripts), launched in 2014 at UiT The Arctic University of Norway. With the increasing demand for archiving and sharing... more
Slides from the symposium and panel discussion at the event "Data Citation and Attribution for Reproducible Research in Linguistics," Annual Meeting of the Linguistic Society of America, Austin, TX, 5 January 2017.
Since the 2011 workshop, the Task Group has undertaken a series of activities designed to build upon the international body of knowledge on data citation and attribution practices. The report presented here represents the next step... more
Call for papers for the Special Issue "Data reuse: What new information can we learn from used data?" One of the most critical tasks of executing empirical research in Business Administration is to collect reliable data. This is... more
Considers the promotion of data citation and attribution standards by: 1) establishing an education and training mandate, drawing on existing resources and standards, and engaging researchers early in their careers; and 2) exploring the... more
Working group reports from the four communities represented at the workshop: (1) Archivists, (2) Journal Editors, (3) IT/Big Data, (4) Ordinary Working Linguists. Presented at the first workshop on Developing Standards for Data Citation... more
oAs stated in NSF's "Information about the Data Management Plan Required for all Proposals" for Biological Sciences, the Federal government defines data (OMB Circular A-110) as: "…the recorded factual material commonly accepted in the... more
In an effort to lead our community in following modern data citation practices by formally citing data used in published research and implementing standards to facilitate reproducible research results and data, while also producing... more
The Evolution of Data Citation: From Principles to Implementation
Reproducibility and reusability of research results is an important concern in scientific communication and science policy. A foundational element of reproducibility and reusability is the open and persistently available presentation of... more
Download research papers for free!