Lingua-Go, the most accurate language detection for Go

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Build time-series-based applications quickly and at scale.
  • Sonar - Write Clean C++ Code. Always.
  • SaaSHub - Software Alternatives and Reviews
  • lingua-go

    The most accurate natural language detection library for Go, suitable for long and short text alike

    Worth noting that this is on Lingua-Go's issues list for the 1.1.0 version: https://github.com/pemistahl/lingua-go/issues/9

  • lingua-py

    The most accurate natural language detection library for Python, suitable for long and short text alike

    There is also a comparison with CLD 2 in the repo of sister Python library:

    https://github.com/pemistahl/lingua-py#4-how-good-is-it

    CDL 2 seems to be slightly less accurate than CLD 3 on average.

  • InfluxDB

    Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.

  • cld3

  • cld2

    Compact Language Detector 2

  • LSTM_langid

    Source code for the Apple reproduction

    In general, language detection is surprisingly hard. There is LSTM-based implementation https://github.com/AU-DIS/LSTM_langid which should be better than ngrams.

  • Sonar

    Write Clean C++ Code. Always.. Sonar helps you commit clean C++ code every time. With over 550 unique rules to find C++ bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts