Blog

 3 minute read.

Better preprint metadata through community participation

Preprints have become an important tool for rapidly communicating and iterating on research outputs. There is now a range of preprint servers, some subject-specific, some based on a particular geographical area, and others linked to publishers or individual journals in addition to generalist platforms. In 2016 the Crossref schema started to support preprints and since then the number of metadata records has grown to around 16,000 new preprint DOIs per month.

Preprints aren’t the same as journal articles, books, or conference papers. They have unique features, and how they are viewed and integrated into the publishing process has evolved over the past six years. For this reason, we have been revisiting the preprint metadata schema and decided that the best approach would be to form an advisory group (AG) of preprint practitioners and experts to help us.

The AG has identified a number of areas in which preprint metadata could be improved. Four of these were considered to have the highest priority:

  1. Withdrawal and removal of preprints.
  2. Preprints as an article type (not a subtype of posted content) in the schema.
  3. Relationships between preprints and other outputs.
  4. Versioning of preprints.

The members of the AG set to work with great enthusiasm, sharing perspectives and expertise. This led to a first tranche of recommendations shared for feedback earlier this year, and we’re grateful for engagement and feedback from the community over the last few months.

What did the community say?

Some of the points raised in the feedback were:

  • Could the origin of a withdrawal be included in the metadata, in particular whether it was requested by an author or another party?
  • Can the metadata represent when a preprint has been submitted to a journal and what stage it is in the editorial process?
  • Crossref is not alone in looking at preprint metadata, and several NISO groups are also engaged in related work.
  • Interoperability and the ability to create relationships with identifiers beyond DOIs is important to maintain an accurate and comprehensive record of research outputs.

These will form the basis for ongoing discussions.

What happens next?

There are three next steps that we will be taking.

  1. The recommendations outline only the outcomes of discussions in a relatively brief format. We have been working on a more detailed paper to communicate more about what was discussed and provide some extra justification and alternatives.
  2. The AG will continue to meet and discuss the points raised during consultation on the recommendations, along with topics that were considered a lower priority at an earlier stage.
  3. We will draw up a set of proposals for specific changes to the metadata schema that will reflect the outcomes of the recommendations and discussions.

Although the initial period for feedback on preprint metadata has ended, we welcome feedback at any time. If you would like to get in touch, please contact me or any member of the advisory group.

Related pages and blog posts

Page owner: Martyn Rittman   |   Last updated 2022-November-09