UTF-8 as told by Rob Pike

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • MySQL

    MySQL Server, the world's most popular open source database, and MySQL Cluster, a real-time, open source transactional database.

  • So now we know who is really responsible for the whole MySQL utf8mb4 fiasco -- these 2 guys sitting in a diner, conjuring up a brilliant scheme to cover 4 billions characters, which turned out to exceed the actual requirement by more than 2000x.

    September 1992: 2 guys scribbling on a placemat.

    January 1998: RFC 2279 defines UTF-8 to be between 1 to 6 bytes.

    March 2001: A bunch of CJK characters were added to Unicode Data 3.1.0, pushing the total to 94,140, exceeding the 16-bit limit of 3 bytes UTF-8.

    March 2002: MySQL added support for UTF-8, initially setting the limit to 6 bytes (https://github.com/mysql/mysql-server/commit/55e0a9c)

    September 2002: MySQL decided to reduce the limit to 3 bytes, probably for storage efficiency reason (https://github.com/mysql/mysql-server/commit/43a506c, https://adamhooper.medium.com/in-mysql-never-use-utf8-use-ut...)

    November 2003: RFC 3629 defines UTF-8 to be between 1 to 4 bytes.

    Arguably, if the placemat was smaller and the guys stopped at 4 bytes after running out of space, perhaps MySQL would have done the right thing? Ah, who am I kidding. The same commit would likely still happen.

  • xee

    Automatically exported from code.google.com/p/xee

  • It's the same with html and css: people shit on it all the time, but this just shows they don't have the imagination to see how much worse it could be.

    Just compare to e.g. Photoshop file format: https://github.com/gco/xee/blob/master/XeePhotoshopLoader.m#...

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • System Design: Databases and DBMS

    12 projects | dev.to | 1 May 2024
  • PresentaciĆ³n del Operador LMS Moodle

    5 projects | dev.to | 29 Apr 2024
  • Introducing LMS Moodle Operator

    5 projects | dev.to | 29 Apr 2024
  • Integrate txtai with Postgres

    2 projects | dev.to | 25 Apr 2024
  • The Scoop on SQL

    1 project | dev.to | 14 Apr 2024