lingua
okhttp-eventsource
lingua | okhttp-eventsource | |
---|---|---|
8 | 1 | |
660 | 130 | |
- | 0.8% | |
6.0 | 3.5 | |
25 days ago | 4 months ago | |
Kotlin | Java | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
lingua
- Announcing Lingua 1.2.0 - The most accurate natural language detection library for the JVM, suitable for long and short text alike
-
r/argentina es el subreddit de habla hispana mas popular del sitio
select 'r/'||subreddit sub , initcap(lang) language , count(*) c , ratio_to_report(c) over(partition by sub) ratio , sum(iff(language!='English', c, 0)) over(partition by sub) total_not_english , sum(c) over(partition by sub) total from reddit_sample_languages_udtf group by 1, 2 qualify ratio > .02 order by total_not_english desc, c desc, 1, ratio desc- Jason Baumgartner for collecting and sharing Reddit’s comments. - Peter M. Stahl for the Lingua project to detect languages in Java. - Snowflake for making it easy to run Java code in a UDF.
-
The most popular languages on Reddit, after analyzing 1M comments: English, German, Spanish, Portuguese, French, Italian, Romanian, Dutch... [OC]
I don't speak most of these languages, so I wasn't able to verify -- instead I just used the results of this library: https://github.com/pemistahl/lingua
-
Hazelcast + Kibana: best buddies for exploring and visualizing data
A linguist can infer the language of the field. It's also possible to use an automated process in the pipeline. A couple of NLP libraries are available in the JVM ecosystem, but I set my eyes on Lingua, one focused on language recognition.
- Usando a Biblioteca Lingua para Kotlin
- Language Detection - Pre Trained Models
- Lingua 1.1.0 released - The most accurate natural language detection library for the JVM
-
Free and easy to use Java language detection library
I've used this one previously, and found it pretty easy to use, relatively fast, and accurate: https://github.com/pemistahl/lingua
okhttp-eventsource
-
Hazelcast + Kibana: best buddies for exploring and visualizing data
Wikipedia provides changes through Server-Sent Events. In short, with SSE, you register a client to the endpoint, and every time new data comes in, you are notified and can act accordingly. On the JVM, a couple of SSE-compatible clients are available, including Spring WebClient. Instead, I chose to use OkHttp EventSource because it's lightweight - it only depends on OkHttp, and its usage is relatively straightforward.
What are some alternatives?
language-detection-cld2 - Natural language detection, Java bindings for CLD2
JavaGuide - 「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!
Beagle - Beagle helps you identify keywords, phrases, regexes, and complex search queries of interest in streams of text documents.
Design Patterns - Design patterns implemented in Java
kotlin-logging - Lightweight Multiplatform logging framework for Kotlin. A convenient and performant logging facade.
wikipedia-changes - Repository for the post and talk "Hazelcast + Kibana: best buddies for exploring and visualizing data"
cld3-kotlin - Bindings to Google's Compact Language Detector 3 to JVM Based Languages
sentry-java - A Sentry SDK for Java, Android and other JVM languages.
kovenant - Kovenant. Promises for Kotlin.
cronet-transport-for-okhttp - This package allows OkHttp and Retrofit users to use Cronet as their transport layer, benefiting from features like QUIC/HTTP3 support or connection migration.
KtUnits - Simple unit conversion library for Kotlin
Fetch - The best file downloader library for Android