Unnatural Keys – Nature doesn’t come with identifiers

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

openlibrary

409 4,865 9.9 Python

One webpage for every book ever published!

I ran into the same problem while building https://learnawesome.org/ . Forget the broad class of "learning resources", even the "books" category doesn't have a usable unique ID. Not everything gets an ISBN for example. There's also the ambiguity between a "work" and an "edition" of a work.
This is probably why OpenLibrary supports mapping of books with 40+ identifiers: https://github.com/internetarchive/openlibrary/blob/master/o...

learndb

4 112 9.2 JavaScript

Curated learning resources with topics, formats, difficulty levels, expert reviews and metadata tags

I ran into the same problem while building https://learnawesome.org/ . Forget the broad class of "learning resources", even the "books" category doesn't have a usable unique ID. Not everything gets an ISBN for example. There's also the ambiguity between a "work" and an "edition" of a work.
This is probably why OpenLibrary supports mapping of books with 40+ identifiers: https://github.com/internetarchive/openlibrary/blob/master/o...

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
tag

1 5 6.8 Jupyter Notebook

Technical Architecture Group (by tdwg)

Nature + Identifiers is an issue (e.g. see https://github.com/tdwg/tag/issues/36). I've routinely mentioned to others in my field to look at other industries for ways forward, particularly ones like the music industry, so its interesting, if unsurprising to see all the same problems arise there. For those who know, in our field, when Identifiers comes up in conversation at conferences etc., we tiptoe away, somehow people can't learn from the past issues.
There are two issues that keep coming up in my mind 1) People want Identifiers to do something (like resolve), rather than just be identifiers and 2) People think that there are such things as "unique" identifiers (one identifier per "thing"). Neither, in my mind, are the purpose of identifiers. Identifiers should do one thing, localize you to some concept. By localize I mean that if you can find the digital space (or in physical collections where identifiers are used the physical "printed" identifier) that "contains" the identifier then you should have a reasonable probability of finding the thing/concept that identifier is for. That's all. No certainty, no uniqueness. It's very akin to what we do when we cite something in a publication, we are giving the research a reasonable chance of finding the origin. This isn't to say that we shouldn't try to keep identifiers unique though, it's to say that when it comes down to crunch time we should never assume 1) they are unique, and 2) that their special properties (e.g. that they resolve) actually work.
I've seen numerous identifier schemes come and go, we've specifically designed a 1-many for our things-to-identifiers in our systems (sitting on top our internal IDs). DOIs? They must be unique, right? Nope. Institutional CODENs? Nope (though the botanists have done it pretty well through community peer-pressure).
As others have noted, identifiers really are just labels, though things like UUIDs have the game-changing property ofreducing the probability that you're looking at a homonymous label.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Internet Archive: Open Library

1 project | news.ycombinator.com | 30 Apr 2024
Ask HN: Anyone looking for contributors for their open source projects

13 projects | news.ycombinator.com | 21 Mar 2024
Building an Open Source Decentralized E-Book Search Engine

5 projects | news.ycombinator.com | 11 Mar 2024
MLIS books available digitally?

1 project | /r/librarians | 8 Dec 2023
HMF a “legal” website to download books

1 project | /r/HelpMeFind | 5 Dec 2023

Unnatural Keys – Nature doesn’t come with identifiers

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
internet-archive Education open-source Ipfs Books
Post date: 28 May 2023

openlibrary

learndb

InfluxDB

tag

Related posts

Internet Archive: Open Library

Ask HN: Anyone looking for contributors for their open source projects

Building an Open Source Decentralized E-Book Search Engine

MLIS books available digitally?

HMF a “legal” website to download books

Unnatural Keys – Nature doesn’t come with identifiers

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com internet-archive Education open-source Ipfs Books Post date: 28 May 2023

openlibrary

learndb

InfluxDB

tag

Related posts

Internet Archive: Open Library

Ask HN: Anyone looking for contributors for their open source projects

Building an Open Source Decentralized E-Book Search Engine

MLIS books available digitally?

HMF a “legal” website to download books

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
internet-archive Education open-source Ipfs Books
Post date: 28 May 2023