lingua-rs
sonic
Our great sponsors
lingua-rs | sonic | |
---|---|---|
9 | 48 | |
820 | 19,431 | |
- | - | |
8.9 | 7.0 | |
11 days ago | 26 days ago | |
Rust | Rust | |
Apache License 2.0 | Mozilla Public License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
lingua-rs
-
I created a program that finds out which anki cards out of 50_000 are in english and deletes them in 2 minutes
Discovery of Lingua: While working on a different project, I discovered the Lingua library.
- Lingua 1.5.0 - The most accurate natural language detection library for Rust, now with support for detecting multiple languages in mixed-language text
-
Opensourcing Whichlang, a fast language detection library for Rust! 🚀 ⚡
It is. Have you tried with this PR though? (Disclaimer: I made that PR) It'll most likely still be slower, but at least it shouldn't be catastrophically slower when using multiple threads.
-
Whatlang 0.15.0 released (lightweight lib for language recognition)
How does it compare to lingua?
- Announcing Lingua 1.4: The most accurate natural language detection library for Rust - now with WASM support
- Announcing Lingua 1.3 - The most accurate natural language detection library for Rust
-
Whatlang strikes back
For those who don't know me: I'm the author of Lingua. I've just made a comparison between the current Lingua version 1.2.0 and the new Whatlang 0.12.0. In fact, the detection accuracy of Whatlang has increased from 65% in version 0.11.1 to 74% in version 0.12.0 on average across all supported languages and detection tasks. You can find the detailed comparison here. In short:
-
Text Rendering
Language detection -> Lingua
sonic
-
What is Hybrid Search?
Sonic - a project written in Rust, uses custom network communication protocol for fast communication between the client and the server.
-
ArchiveBox: Open-source self-hosted web archiving
This is uncanny, I just discovered ArchiveBox earlier today and set up a self-hosted instance on some home hardware for a collection of bookmarks of useful guides, tutorials, and references I've collected over the years.
Setting it up on K8s with sonic [1] as the search backend and importing a few hundred URLs only took ~an hour or so, and the cached pages look great for the most part.
[1] https://github.com/valeriansaliou/sonic
- sonic: Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
- Seeking a free full text search solution for large data with progress display
- Show HN: CozoDB, Hybrid Relational-Graph-Vector DB, the Hippocampus for LLMs
- FLiP Stack Weekly for 15-Jan-2023
-
Building an Internet Scale Meme Search Engine
If you don't need advanced search features, you can use Sonic (https://github.com/valeriansaliou/sonic). It's blazing fast and you can save lot of money on servers.
-
Any Full Text Search library for json data?
What about Sonic? Maybe it requires a bit of integration, but it's simple and blazing fast.
-
10 Trending Github repositories / October, 27 2022
git clone https://github.com/valeriansaliou/sonic.git
- Sonic, An alternative to Elasticsearch that runs on a few MBs of RAM
What are some alternatives?
lingua-py - The most accurate natural language detection library for Python, suitable for short text and mixed-language text
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
whatlang-rs - Natural language detection library for Rust. Try demo online: https://whatlang.org/
fastapi - FastAPI framework, high performance, easy to learn, fast to code, ready for production
crates.io - The Rust package registry
Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
allsorts - Font parser, shaping engine, and subsetter implemented in Rust
tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
whatlang-accuracy-benchmark - Accuracy benchmarks for Whatlang
tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust [Moved to: https://github.com/quickwit-oss/tantivy]
rust-harfbuzz - Rust bindings to HarfBuzz
graylog - Free and open log management