Most similar language to each European language, based purely on letter distribution [OC]

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

wikiextractor

3 3,637 0.0 Python

A tool for extracting plain text from Wikipedia dumps

Methodology: extracted 100MB of article texts from each of the different Wikipedias using https://github.com/attardi/wikiextractor, and counted the character prevalences using Python. The similarity measure is just the sum of the absolute differences in character prevalences (so a lower score means more similar): e.g. if language A has distribution {A: 0.5, B: 0.3, C: 0.2} and language B has distribution {A: 0.8, B: 0.2} then their similarity is |0.5-0.8|+|0.3-0.2|+|0.2-0.0|=0.6. The final chart was generated using graphviz and pillar.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Ask HN: Modern Day Equivalent to HyperCard?

4 projects | news.ycombinator.com | 1 May 2024
CommaAgents, LLM AutoGenish like system for building LLM systems

1 project | news.ycombinator.com | 1 May 2024
Monitor Postgres replication slot growth via Slack

1 project | news.ycombinator.com | 1 May 2024
Fourier Kolmogorov-Arnold Networks

1 project | news.ycombinator.com | 1 May 2024
Emulation of Nintendo Game Boy (DMG-01) (2016) [pdf]

1 project | news.ycombinator.com | 1 May 2024

Most similar language to each European language, based purely on letter distribution [OC]

This page summarizes the projects mentioned and recommended in the original post on /r/dataisbeautiful Post date: 8 Jun 2022

wikiextractor

InfluxDB

Related posts

Ask HN: Modern Day Equivalent to HyperCard?

CommaAgents, LLM AutoGenish like system for building LLM systems

Monitor Postgres replication slot growth via Slack

Fourier Kolmogorov-Arnold Networks

Emulation of Nintendo Game Boy (DMG-01) (2016) [pdf]