The Scholarly Kitchen

What’s Hot and Cooking In Scholarly Publishing

  • About
  • Archives
  • Collections
    Scholarly Publishing 101 -- The Basics
    Collections
    • Scholarly Publishing 101 -- The Basics
    • Academia
    • Business Models
    • Discovery and Access
    • Diversity, Equity, Inclusion, and Accessibility
    • Economics
    • Libraries
    • Marketing
    • Mental Health Awareness
    • Metrics and Analytics
    • Open Access
    • Organizational Management
    • Peer Review
    • Strategic Planning
    • Technology and Disruption
  • Translations
    topographic world map
    Translations
    • All Translations
    • Chinese
    • German
    • Japanese
    • Korean
    • Spanish
  • Chefs
  • Podcast
  • Follow

Building the Social and Technical Infrastructures to Transform Research Data Sharing One Plenary at a Time

  • By Phill Jones
  • Nov 18, 2021
  • 0 Comments
  • Data Publishing
  • Infrastructure
Share
Share
0 Shares

The Research Data Alliance (RDA) is a community-driven, non-profit initiative that was originally set up in 2013 by the European Commission, the US NSF and NISO, and the Australian Department of Innovation. Right from the very start, RDA was an entirely independent, grassroots effort to build the social and technical infrastructure that supports open sharing and re-use of data. The most recent RDA plenary was an entirely virtual event spread across two weeks at the beginning of November, which I attended via Whova, Zoom, and a little bit of Gather, from the comfort and COVID-safety of my home office.

The RDA Plenary is very different from other conferences I have attended. The grassroots community focus visibly underpins everything about both RDA and its twice yearly Plenaries. Each session is organized by an RDA group, including working groups, interest groups, communities of practice, and birds of a feather groups (convened for a single RDA plenary to gauge interest in a new topic). It can all seem a bit complicated from the outside, so there are web pages of instructions and explainer videos on the RDA website.

Magnifying glass on charts graphs spreadsheet paper.

The sheer scale of the research data ecosystem

Developing the research data ecosystem from the ground up is a substantial challenge, with action required at many levels — from individual researchers and communities of practice to funder policies. At a subject level, RDA has domain- or subject-specific groups, which are often supported by learned societies, like the Earth, Space, and Environmental Sciences Interest Group (IG), which has chairs from both the European Geosciences and American Geophysical Unions. Some of these groups have very specific focuses and could even be considered to be quite granular.

At a higher level, there are groups that discuss common standards for data management plans, a metadata standards catalogue, and even how to engage researchers with good research management practices. Just about every conceivable angle is covered, at least to some extent.

With so much to participate in, it’s only possible to write about a small part of the action, so I’ve picked a couple of things that I was particularly interested in.

All about persistence

The first session I attended was bright and early (at least in my time zone) on the first day of the Plenary. I was invited to attend the PID Interest group (PID IG). The discussion was very wide-ranging, with a series of prepared remarks from invited attendees and a vibrant open floor discussion in both audio and text chat feed.

One running theme was, as Tom Demeranville of ORCID put it, ‘persistence is a social problem rather than a technical one’, which might sound counterintuitive at first. When we look at how PIDs are defined by organizations like OpenAIRE and even ORCID, they are generally described as pointers that will always link to a particular digital resource, even if the URL changes. PIDs also have associated metadata that describes objects and enables linking to other PIDs to create an emergent knowledge graph. While that’s all true, it’s not a full definition of persistence, because somebody has to maintain all of those links and metadata. Otherwise, a PID system will suffer the same link-rot problems as every other web resource.

Building on the idea of PIDs as an organizational challenge, Natasha Simons of the Australian Research Data Commons (ARDC), talked about the need for investment to create and maintain PIDs. While the community is good at designing and creating the technical and organizational structures needed to create PIDs, she argued, more investment in communications and marketing is needed to make the value of PIDs visible. Only then, with the help of good governance and a sustainability model, can some PIDs transition from small grant-supported projects to sustainable organizations.

In my own short presentation, I focused on the need to continue to improve adoption. A major challenge in driving adoption is misaligned incentives. Simply put, metadata that is entered into a system that integrates into a PID — whether that be an ORCID, DOI from Crossref or DataCite, ROR, or RAiD — only has value if other stakeholders are also adding their own metadata and systems integrations. In other words, adding metadata to the PID graph generally helps other stakeholders more than the person who entered it. For example, funders need researchers and institutions to report on the outputs that are funded by their grants, while publishers need to know what new research priorities are getting supported so that they know what content to acquire or products and services to develop.

The burning question that I put to the PID IG was, how do we align those incentives and make the value more obvious? Is the answer funder mandates or improved incentives? Participation reports? Central support for integrations from funders? Better targeting of products and services? Better communications and marketing? Or some combination of all of the above?

Metadata interoperability

Another theme that ran through my plenary experience was metadata interoperability. As Alice Meadows and I wrote in a previous post, metadata is important because: 

Metadata enables connections to be made between published articles, researchers, datasets, computer programs, institutions, grants, funders, and more, eventually including things like shared facilities

Several sessions at RDA touched on this issue, including the Research Data Architectures in Research Institutions IG. James Wilson of University College, London described how they have been developing a research data ecosystem out of a collection of technologies including their ePRINTs repository, Researchfish for grant reporting, and their CRiS system Symplectic Elements. Similarly, Kimi Keith of the University of Cape Town described their efforts to build a research data management ecosystem focused around their electronic Research Administration system, integrated into their CRIS system (Clarivate Converis) and figshare. The idea is to reduce researcher burden while better meeting various research management use cases, like ensuring compliance with data management commitments. 

Despite these two, very different research environments, both speakers agreed that lack of interoperability between systems that contain metadata was a serious impediment to automating research management.

Conclusions

There were many other highlights of this year’s RDA plenary, certainly too many to go into here. Overall, from the discussions of the most niche aspects of metadata or data schema for a particular discipline to the highest level discussion of strategic research data management, the need for better accepted standards, best practices, workflows, and system interoperability is now clearer than ever.

If I have a concern at all, it’s that the good people at RDA can’t do all of this alone. Publishers, learned societies, institutions, libraries, and funders all have significant parts to play in building a better, more efficient, and more connected research infrastructure. While there are representatives of all of these stakeholders in RDA, more organizational and systemic work is needed. From the immediate benefits that publishers can glean from improved metadata about research and smoother processes, to systemic benefits for the research infrastructure that will accelerate science and save lives, it’s in all  our interests to be a part of this transformation.

Share
Share
0 Shares
Share
Share
0 Shares
Phill Jones

Phill Jones

@phillbjones

Phill Jones is a co-founder of MoreBrains Consulting Cooperative. MoreBrains works in open science, research infrastructure and publishing. As part of the MoreBrains team, Phill supports a diverse range of clients from publishers and learned societies to institutions and funders, on a broad range of strategic and operational challenges. He's worked in a variety of senior and governance roles in editorial, outreach, scientometrics, product and technology at such places as JoVE, Digital Science, and Emerald. In a former life, he was a cross-disciplinary research scientist at the UK Atomic Energy Authority and Harvard Medical School.

View All Posts by Phill Jones

Discussion

Official Blog of:

Society for Scholarly Publishing (SSP)

The Chefs

  • Rick Anderson
  • Todd A Carpenter
  • Angela Cochran
  • Lettie Y. Conrad
  • David Crotty
  • Joseph Esposito
  • Roohi Ghosh
  • Robert Harington
  • Haseeb Irfanullah
  • Lisa Janicke Hinchliffe
  • Phill Jones
  • Roy Kaufman
  • Scholarly Kitchen
  • Alice Meadows
  • Ann Michael
  • Alison Mudditt
  • Jill O'Neill
  • Charlie Rapple
  • Dianndra Roberts
  • Roger C. Schonfeld
  • Avi Staiman
  • Randy Townsend
  • Tim Vines
  • Jasmine Wallace
  • Karin Wulf
  • Hong Zhou

Interested in writing for The Scholarly Kitchen? Learn more.

Most Recent

  • Language Evolves, or rather, Constantly Cooks New Ways to Pass the Vibe Check
  • A Tumultuous Week at the Library of Congress
  • Guest Post — Fostering AI Adoption and Literacy Within Your Organization 

SSP News

Get Your Tickets to the EPIC Awards!

May 14, 2025

Get Ready for SSP 2025: Innovation, Swag, and Scholarly Networking!

May 13, 2025

Baltimore Beyond the Conference: Local Tips from Two Insiders

May 7, 2025
Follow the Scholarly Kitchen Blog Follow Us

Related Articles:

  • Photo of the 2nd STM Research Data Workshop Is it Finally the Year of Research Data? – The STM Association Thinks So
  • fsci class photo Guest Post — Open Research in Practice: Moving from Why to How?
  • folder search concept Guest Post: Encouraging Data Sharing: A Small Investment for Large Potential Gain

Next Article:

fruit preserved in jars Guest Post -- Seeking Feedback on a Model Digital Preservation Policy, a Project of the NASIG Digital Preservation Committee
Society for Scholarly Publishing (SSP)

The mission of the Society for Scholarly Publishing (SSP) is to advance scholarly publishing and communication, and the professional development of its members through education, collaboration, and networking. SSP established The Scholarly Kitchen blog in February 2008 to keep SSP members and interested parties aware of new developments in publishing.

The Scholarly Kitchen is a moderated and independent blog. Opinions on The Scholarly Kitchen are those of the authors. They are not necessarily those held by the Society for Scholarly Publishing nor by their respective employers.

  • About
  • Archives
  • Chefs
  • Podcast
  • Follow
  • Advertising
  • Privacy Policy
  • Terms of Use
  • Website Credits
ISSN 2690-8085