In August 2022, the United States Office of Science and Technology Policy (OSTP) issued a memo (PDF) on ensuring free, immediate, and equitable access to federally funded research (a.k.a. the “Nelson memo”). Crossref is particularly interested in and relevant for the areas of this guidance that cover metadata and persistent identifiers—and the infrastructure and services that make them useful.
Funding bodies worldwide are increasingly involved in research infrastructure for dissemination and discovery.
Preprints have become an important tool for rapidly communicating and iterating on research outputs. There is now a range of preprint servers, some subject-specific, some based on a particular geographical area, and others linked to publishers or individual journals in addition to generalist platforms. In 2016 the Crossref schema started to support preprints and since then the number of metadata records has grown to around 16,000 new preprint DOIs per month.
TL;DR One of the things that makes me glad to work at Crossref is the principles to which we hold ourselves, and the most public and measurable of those must be the Principles of Open Scholarly Infrastructure, or POSI, for short. These ambitions lay out how we want to operate - to be open in our governance, in our membership and also in our source code and data. And it’s that openness of source code that’s the reason for my post today - on 26th September 2022, our first collaboration with the JSON Forms open-source project was released into the wild.
Ans: metadata and services are all underpinned by POSI.
Leading into a blog post with a question always makes my brain jump ahead to answer that question with the simplest answer possible. I was a nightmare English Literature student. ‘Was Macbeth purely a villain?’ ‘No’. *leaves exam*
Just like not giving one-word answers to exam questions, playing our role in the integrity of the scholarly record and helping our members enhance theirs takes thought, explanation, transparency, and work.
The web deposit form is suitable for making small numbers of deposits, and you do not need any knowledge of XML to use it. You can use this form to deposit metadata for journals, books, conference proceedings, reports, and dissertations. You can also upload NLM or JATS-formatted XML using this form.
How to use the web deposit form to register your content
Step one: Select your content type
Start at the web deposit form and select the type of content you want to register: journal, book, conference proceedings, report or dissertation. Different fields will appear depending on what you’ve chosen.
Step two: Add the content you want to register
You can deposit a journal-level DOI only or DOIs for each article within a given issue.
Register DOIs for articles - you can register articles for up to one issue in any deposit. On the first screen, enter information for the relevant journal or issue, then click add articles. After you’ve added information for each article, click on Add another article to register the next article. When you are done, click Finish.
Register a journal-level DOI only - on the first screen, enter the information for the relevant journal or issue and click Submit journal/issue DOI.
Please note: when you register your first item, be really careful about the journal title you enter - this will create a journal title record and any future submissions will have to match this. Your journal title doesn’t have to match the title in the ISSN portal, but if you do want it to match, make sure to check what this is before you register your first item.
You can register both book- and chapter-level information. Add you content and select Submit Book DOI to deposit a title-level DOI, or select Add Chapters to enter metadata for chapters attached to the book being registered. For series and sets, only one volume can be registered at a time.
Enter event and conference paper information. Select Add Papers to enter metadata for conference papers.
Select Submit Report DOI to deposit a single report DOI, or select Add Content Item if the report contains multiple chapters or papers.
Complete the fields and click Submit dissertation.
Step three: Login and submit your content
Login with your Crossref account credentials, and then add the email address that should receive the submission log email. Even if your login username is your email address, you still need to add an email address to receive the submission log email. It can be the same or different from the email you used as your login username. Finally, click Deposit.
Your submission is then added to our submission queue. When we’ve processed your file we’ll send you a log via email (to the address you gave us in step three). You must review this log to make sure your content was registered successfully. Learn more about error and warning messages.
We also send you a copy of the XML that has been generated by the web deposit form. This XML is just for your records - you don’t need to do anything with it. If changes or corrections need to be made to your metadata record, you can edit and submit the XML instead of re-entering your metadata into the form. If you do edit the XML, be sure to increment the value in the <timestamp> field to ensure a successful update.
Web deposit form limitations and how to work around them
There are some metadata elements that you can’t currently register as part of your initial deposit in the web deposit form. However, you can add many of these to an existing deposit later on, using our other tools. Here’s a list of the elements that can’t currently be included in your initial deposit with the web deposit form, and your options for adding them later:
archive locations, and article numbers or IDs can’t be registered using the web deposit form.
Funding and license information, Similarity Check URLs, STM-article sharing framework (stm-asf) license information, and text and data mining URLs can be added to existing DOIs in bulk using a supplemental metadata upload.
How to use the web deposit form to upload a JATS or NLM file
You can use the web deposit form to upload an XML file built according to the NLM or JATS document type definition (DTD) publishing tag set. Find out more.
How to use the web deposit form for supplemental metadata upload using a .csv file
Supplemental metadata upload enables members to add metadata elements to existing DOIs in bulk by uploading a .csv file via the web deposit form. You can use it to add funding metadata, license metadata, funding and license metadata together, Similarity Check URLs, or STM-article sharing framework (stm-asf) license metadata.
Click the choose file button next to csv file information and select your .csv file for upload
Enter the email address that should receive the submission log email
Click upload csv data
Some initial validation relating to formatting is performed upon upload. Incomplete or incorrect files will return an error message, and will not be deposited. If the file passes the initial validation, it will be converted to XML, and registered with us. Additional validation is performed upon deposit.
You will receive a submission log when your deposit is complete. Please review the log to be sure your DOIs have been updated successfully.
Please contact us with questions or comments about your .csv upload. If you are reporting problems with a .csv upload, please attach the .csv file to your support request.
Specifications for all .csv files
Files submitted for supplemental metadata upload using a .csv file must comply with these specifications:
Don’t use commas (,) or quotation marks (") within a column value
Dates must be in the format: YYYY-MM-DD
If metadata is not available for an item, leave the cell blank. If an entire column is not populated, you may omit it. Do not enter placeholders such as n/a or -, as this will cause your deposit to fail, or cause incorrect metadata to be attached to a DOI
Files may be up to 45 MB in size. We automatically split the file into batches of 4,000 DOIs for processing. We send you two emails for each batch: a submission log, and a copy of the submitted XML, so uploading a large file may result in many emails.
<funder_identifier>: funding agency identifier in the form of a DOI
<award_number>: grant number or other fund identifier
If a DOI has multiple funders, the DOI must be repeated for each funder. We recommend that all available metadata is deposited. If a piece of funding metadata is not available (for example, a grant number) the field should be left blank.
Open your file using spreadsheet software (such as MS Excel). Your file should contain two columns with the headings DOI and <item crawler="iParadigms">, where DOI is the DOI being updated and <item crawler="iParadigms"> is the URL being submitted for Similarity Check indexing. Here is an example:
Replace the example DOIs (10.5555/test1) and URLs (https://www.yoururl.com/pdf1) with your DOIs and URLs.
Be sure to save the file as .csv and not as .xlsx (or any other file type)
Some spreadsheet programs add additional quotation marks to the column headers, such as "DOI" or "<item crawler="iParadigms">". If your upload is not successful, please open your file in a text editor, and make sure the top line of the file is simply: DOI or <item crawler="iParadigms">. Edit if necessary, and resubmit.