-
CommonCrawl.org has Java bindings for CLD2 here: https://github.com/commoncrawl/language-detection-cld2
-
InfluxDB
Purpose built for real-time analytics at any scale. InfluxDB Platform is powered by columnar analytics, optimized for cost-efficient storage, and built with open data standards.
-
lingua
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
I've used this one previously, and found it pretty easy to use, relatively fast, and accurate: https://github.com/pemistahl/lingua
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
Comparing Language Detection Libraries (& API) Using Java/ColdFusion/CFML
-
Announcing Lingua 1.2.0 - The most accurate natural language detection library for the JVM, suitable for long and short text alike
-
r/argentina es el subreddit de habla hispana mas popular del sitio
-
The most popular languages on Reddit, after analyzing 1M comments: English, German, Spanish, Portuguese, French, Italian, Romanian, Dutch... [OC]
-
Hazelcast + Kibana: best buddies for exploring and visualizing data