Published Versions 3 Vol 2 (1) : 192–198 2019
How to (Easily) Extend the FAIRness of Existing Repositories
623 26 0
Abstract & Keywords
Abstract: Data repository infrastructures for academics have appeared in waves since the dawn of Web technology. These waves are driven by changes in societal needs, archiving needs and the development of cloud computing resources. As such, the data repository landscape has many flavors when it comes to sustainability models, target audiences and feature sets. One thing that links all data repositories is a desire to make the content they host reusable, building on the core principles of cataloging content for economical and research speed efficiency. The FAIR principles are a common goal for all repository infrastructures to aim for. No matter what discipline or infrastructure, the goal of reusable content, for both humans and machines, is a common one. As such, this is the first time that repositories can work toward a common goal that ultimately lends itself to interoperability. The idea that research can move further and faster as we un-silo these fantastic resources is an achievable one. This paper investigates the steps that existing repositories need to take in order to remain useful and relevant in a FAIR research world.
Keywords: FAIR data; Metadata; Interoperability; Repositories; Data curation
M.D. Wilkinson, M. Dumontier, I.J. Aalbersberg, G. Appleton, M. Axton, A. Baak, … & B. Mons. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data 3(2016), Article No. 160018. doi: 10.1038/sdata.2016.18.
Guidelines on FAIR data management in Horizon 2020 (2016). Available at:
D. Valen, & K. Blanchat. Overview of OSTP responses chart—raw.indd files. (2016). doi: 10.6084/m9.figshare.1522124.v2.
H. Wilcox, D. Baptista, & H. Hope. Wellcome’s open access policy review—consultation analysis. (2018). doi: 10.6084/m9.figshare.6887345.v2.
D. Valen, & M. Hahnel. Low level repository FAIR overview. (2019). doi:10.6084/m9.figshare.8312408.v1.
M. Downey. Assessing author identifiers: Preparing for a linked data approach to name authority control in an institutional repository context. Journal of Library Metadata 19(1–2)(2019), 117-136. doi: 10.1080/19386389.2019.1590936.
E.C. Friedberg. Good news on the horizon: The open researcher and contributor ID (ORCID). DNA Repair 9(2)(2010), 102. doi: 10.1016/j.dnarep.2009.12.005.
[8 ] N. Juty, S.M. Wimalaratne, S. Soiland-Reyes, J. Kunze, C.A. Goble, & T. Clark. Unique, persistent, resolvable: Identifiers as the foundation of FAIR. Data Intelligence. Special issue on Emerging FAIR practices. (In press). DI-2019-00xx.
S. Schwichtenberg, C. Gerth, & G. Engels. From open API to semantic specifications and code adapters. In: 2017 IEEE International Conference on Web Services (ICWS), IEEE, 2017, pp. 484–491. doi: 10.1109/icws.2017.56.
L.R. Johnston, J. Carlson, C.Hudson-Vitale, H. Imker, W. Kozlowski, R. Olendorf, C. Stewart, M. Blake, J. Herndon, T.M. McGeary, & E. Hull. Data curation network: A cross-institutional staffing model for curating research data. International Journal of Digital Curation 13(1)(2018), 125–140. doi: 10.2218/ijdc.v13i1.616.
M.Vans, & P. Franks. A blueprint for preserving virtual world cultural heritage using Preservica & custom metadata schema. In: Archiving Conf 2019, pp. 42–46.
E.P. McLellan. Selecting formats for digital preservation: Lessons learned from the Archivematica Project. Information Standards Quarterly 22(2)(2010), 30. Available at:
V. Stathias, A. Koleti, D. Vidović, D.J. Cooper, K.M. Jagodnik, R. Terryn, … & S.C. Schürer. Sustainable data and metadata management at the BD2K-LINCS Data Coordination and Integration Center. Scientific Data 5(2018), Article No. 180117. Doi: 10.1038/sdata.2018.117.
D. Ivanovi, B. Schmidt, R. Grim, & A. Dunning. FAIRness of repositories their data: A report from LIBER’s research data management working group. (2019). doi: 10.5281/zenodo.3251593.
I. Hrynaszkiewicz, N. Simons, A. Hussain, & S. Goudie. Developing a research data policy framework for all journals and publishers. (2019). doi: 10.6084/m9.figshare.8223365.v1.
Article and author information
Cite As
M. Hahnel & D. Valen. How to (easily) extend the FAIRness of existing repositories. Data Intelligence 2(2020), 192–198. doi: 10.1162/dint_a_00041
Mark Hahnel
Both authors M. Hahnel ( and D. Valen ( contributed equally to the design and writing of the article.
Mark Hahnel is the CEO and founder of Figshare, which he created whilst completing his PhD in stem cell biology at Imperial College London. Figshare currently provides research data infrastructure for institutions, publishers and funders globally. He is passionate about open science and the potential it has to revolutionize the research community. For the last eight years, Mark has been leading the development of research data infrastructure, with the core aim of reusable and interoperable academic data. Mark sits on the board of DataCite and the advisory board for Directory of Open Access Journals (DOAJ). He was on the judging panel for the National Institutes of Health (NIH), Wellcome Trust Open Science prize and acted as an advisor for the Springer Nature master classes.
Dan Valen
Both authors M. Hahnel ( and D. Valen ( contributed equally to the design and writing of the article. D. Valen created the referenced data set.
Dan Valen joined Figshare as its first US-based employee in early 2014 to help researchers and organizations navigate trends in research data management. In his current role, he focuses on the development of Figshare community through engagement, strategic partnerships and educational outreach. Prior to working in the research data space at Figshare, Dan spent over 6 years at one of the largest scientific, technical, engineering and medical (STEM) publishers holding positions in editorial, trade publishing and electronic content licensing.
Publication records
Published: None (Versions3
Data Intelligence