Unnatural Keys – Nature doesn’t come with identifiers

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • openlibrary

    One webpage for every book ever published!

  • I ran into the same problem while building https://learnawesome.org/ . Forget the broad class of "learning resources", even the "books" category doesn't have a usable unique ID. Not everything gets an ISBN for example. There's also the ambiguity between a "work" and an "edition" of a work.

    This is probably why OpenLibrary supports mapping of books with 40+ identifiers: https://github.com/internetarchive/openlibrary/blob/master/o...

  • learndb

    Curated learning resources with topics, formats, difficulty levels, expert reviews and metadata tags

  • I ran into the same problem while building https://learnawesome.org/ . Forget the broad class of "learning resources", even the "books" category doesn't have a usable unique ID. Not everything gets an ISBN for example. There's also the ambiguity between a "work" and an "edition" of a work.

    This is probably why OpenLibrary supports mapping of books with 40+ identifiers: https://github.com/internetarchive/openlibrary/blob/master/o...

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • tag

    Technical Architecture Group (by tdwg)

  • Nature + Identifiers is an issue (e.g. see https://github.com/tdwg/tag/issues/36). I've routinely mentioned to others in my field to look at other industries for ways forward, particularly ones like the music industry, so its interesting, if unsurprising to see all the same problems arise there. For those who know, in our field, when Identifiers comes up in conversation at conferences etc., we tiptoe away, somehow people can't learn from the past issues.

    There are two issues that keep coming up in my mind 1) People want Identifiers to do something (like resolve), rather than just be identifiers and 2) People think that there are such things as "unique" identifiers (one identifier per "thing"). Neither, in my mind, are the purpose of identifiers. Identifiers should do one thing, localize you to some concept. By localize I mean that if you can find the digital space (or in physical collections where identifiers are used the physical "printed" identifier) that "contains" the identifier then you should have a reasonable probability of finding the thing/concept that identifier is for. That's all. No certainty, no uniqueness. It's very akin to what we do when we cite something in a publication, we are giving the research a reasonable chance of finding the origin. This isn't to say that we shouldn't try to keep identifiers unique though, it's to say that when it comes down to crunch time we should never assume 1) they are unique, and 2) that their special properties (e.g. that they resolve) actually work.

    I've seen numerous identifier schemes come and go, we've specifically designed a 1-many for our things-to-identifiers in our systems (sitting on top our internal IDs). DOIs? They must be unique, right? Nope. Institutional CODENs? Nope (though the botanists have done it pretty well through community peer-pressure).

    As others have noted, identifiers really are just labels, though things like UUIDs have the game-changing property ofreducing the probability that you're looking at a homonymous label.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Internet Archive: Open Library

    1 project | news.ycombinator.com | 30 Apr 2024
  • Ask HN: Anyone looking for contributors for their open source projects

    13 projects | news.ycombinator.com | 21 Mar 2024
  • Building an Open Source Decentralized E-Book Search Engine

    5 projects | news.ycombinator.com | 11 Mar 2024
  • MLIS books available digitally?

    1 project | /r/librarians | 8 Dec 2023
  • HMF a “legal” website to download books

    1 project | /r/HelpMeFind | 5 Dec 2023