lingua
cld3
Our great sponsors
lingua | cld3 | |
---|---|---|
8 | 6 | |
648 | 737 | |
- | 1.5% | |
6.3 | 0.0 | |
17 days ago | 10 months ago | |
Kotlin | C++ | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
lingua
-
Hazelcast + Kibana: best buddies for exploring and visualizing data
A linguist can infer the language of the field. It's also possible to use an automated process in the pipeline. A couple of NLP libraries are available in the JVM ecosystem, but I set my eyes on Lingua, one focused on language recognition.
- Language Detection - Pre Trained Models
-
Free and easy to use Java language detection library
I've used this one previously, and found it pretty easy to use, relatively fast, and accurate: https://github.com/pemistahl/lingua
cld3
-
cld3: Rust binding for Compact Language Detector v3 (CLD3), a neural network model for language identification.
the C++ code is from https://github.com/google/cld3
- Lingua-Go, the most accurate language detection for Go
-
Announcing Lingua 1.0.0: The most accurate natural language detection library for Python, suitable for long and short text alike
Python is widely used in natural language processing, so there are a couple of comprehensive open source libraries for this task, such as Google's CLD 2 and CLD 3, langid and langdetect. Unfortunately, except for the last one they have two major drawbacks:
-
Best C# library to detect the language of user input strings without calling external APIs like Google Translate etc?
I was looking for something like that for .net app and ended up using this https://github.com/google/cld3
- Language Detection - Pre Trained Models
What are some alternatives?
language-detection-cld2 - Natural language detection, Java bindings for CLD2
ntextcat
Beagle - Beagle helps you identify keywords, phrases, regexes, and complex search queries of interest in streams of text documents.
kotlin-logging - Lightweight Multiplatform logging framework for Kotlin. A convenient and performant logging facade.
cld3-kotlin - Bindings to Google's Compact Language Detector 3 to JVM Based Languages
kovenant - Kovenant. Promises for Kotlin.
KtUnits - Simple unit conversion library for Kotlin
CakeParse - Simple parser combinator library for Kotlin
kotlin-futures - A collections of extension functions to make the JVM Future, CompletableFuture, ListenableFuture API more functional and Kotlin like.
langdetect - Port of Google's language-detection library to Python.
khronos - An intuitive Date extensions in Kotlin.
actions-on-google-kotlin - Unofficial Actions on Google SDK for Kotlin and Java