Metadata is communication; it can tell a story about research and paint a picture for others to respond to and learn from, across the world and throughout the forthcoming generations. Metadata can feel technical with words like ‘infrastructure’ and ‘schema’, and sometimes, like tech in general, it comes with hyperbole. But metadata really is part art (storytelling and pictures) and part science (structured models and standards) with both aspects being equally important, and requiring people as well as systems. That necessary combination of human and machine involvement also makes metadata challenging.
Once a year we release all metadata records for content registered with Crossref in a public data file. This year’s version, containing nearly 180 million records, is now available. It includes metadata associated with all Crossref-registered DOIs in JSON-lines format.
Crossref Ambassadors act as local points of contact, meeting editors, librarians, researchers, and institutions to help them navigate Crossref services and understand how strong metadata supports visibility, integrity, and trust in research. They explain how to participate in our rich network of connections between works, people, and institutions, in ways that make sense in their own contexts. And last year, being our 25th anniversary, Ambassadors also massively contributed to our celebrations!
We have renewed our partnership with DOAJ to focus on a new set of objectives that reflect both organisations’ commitment to improving sustainable and equitable services and infrastructure. This renewed collaboration focuses on improving the quality of scholarly metadata while expanding support for journals in low- and middle income- countries.
We have worked together since 2021, primarily to encourage the dissemination and use of scholarly research using online technologies, and regional and international networks, partners and communities. This partnership has helped to build local institutional capacity and sustainability within the global scholarly communication ecosystem. A continued partnership also reflects that we have a shared community; currently almost 90% of DOAJ journals are represented in Crossref.
We believe in Persistent Identifiers. We believe in defence in depth. Today we’re excited to announce an upgrade to our data resilience strategy.
Defence in depth means layers of security and resilience, and that means layers of backups. For some years now, our last line of defence has been a reliable, tried-and-tested technology. One that’s been around for a while. Yes, I’m talking about the humble 5¼ inch floppy disk.
This may come as surprise to some. When things go well, you’re probably never aware of them. In day to day use, the only time a typical Crossref user sees a floppy disk is when they click ‘save’ (yes, some journals still require submissions in Microsoft Word).
History
But why?
Let me take you back to the early days of Crossref. The technology scene was different. This data was too important to trust to new and unproven technologies like Zip disks, CD-Rs or USB Thumb Drives. So we started with punched cards.
IBM 5081-style punched card.
Punched cards are reliable and durable as long as you don’t fold, spindle or mutilate them. But even in 2001 we knew that punched cards’ days were numbered. The capacity of 80 characters kept DOIs short. Translating DOIs into EBCDIC made ASCII a challenge, let alone SICIs. We kept a close eye on the nascent Unicode.
Breathing Room
In 2017 the change of DOI display guidelines from http://dx.doi.org.turing.library.northwestern.edu to https://doi-org.turing.library.northwestern.edu shortened each DOI by 2 characters, buying us some time. But eventually we knew we had to upgrade to something more modern.
So we migrated to 5¼ inch floppy disks.
5¼ Floppy disk in drive
At 640 KB per disk these were a huge improvement. We could fit around 20,000 DOIs on one floppy. Today we only need around 10,000 floppy disks to store all of our DOIs (not the metadata, just the DOIs). Surprisingly this only takes about 20 metres of shelf space to store.
Typical work from home setup. Getting ready to backup some DOIs!
The move to working-from-home brought an unexpected benefit. Staff mail floppy disks to each other and keep them in constant rotation, which produces a distributed fault tolerant system.
Persistence Means Change
But it can’t last forever. DOIs registration shows no sign of slowing down. It’s clear we need a new, compact storage medium. So, after months of research, we’ve invested in new equipment.
Today we announce our migration to 3½ inch floppies.
If it goes to plan you won’t even notice the change.