sonic
whatlang-rs
Our great sponsors
sonic | whatlang-rs | |
---|---|---|
48 | 7 | |
19,317 | 945 | |
- | - | |
7.5 | 5.1 | |
3 months ago | 12 days ago | |
Rust | Rust | |
Mozilla Public License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sonic
-
What is Hybrid Search?
Sonic - a project written in Rust, uses custom network communication protocol for fast communication between the client and the server.
-
ArchiveBox: Open-source self-hosted web archiving
This is uncanny, I just discovered ArchiveBox earlier today and set up a self-hosted instance on some home hardware for a collection of bookmarks of useful guides, tutorials, and references I've collected over the years.
Setting it up on K8s with sonic [1] as the search backend and importing a few hundred URLs only took ~an hour or so, and the cached pages look great for the most part.
- Seeking a free full text search solution for large data with progress display
- Show HN: CozoDB, Hybrid Relational-Graph-Vector DB, the Hippocampus for LLMs
- FLiP Stack Weekly for 15-Jan-2023
-
Building an Internet Scale Meme Search Engine
If you don't need advanced search features, you can use Sonic (https://github.com/valeriansaliou/sonic). It's blazing fast and you can save lot of money on servers.
-
Any Full Text Search library for json data?
What about Sonic? Maybe it requires a bit of integration, but it's simple and blazing fast.
-
10 Trending Github repositories / October, 27 2022
git clone https://github.com/valeriansaliou/sonic.git
-
An alternative to Elasticsearch that runs on a few MBs of RAM
- Sonic (https://github.com/valeriansaliou/sonic)
There isn't enough out there comparing all these for the simple typical fuzzy search/search box usecase -- which I think is 80% of people doing search today.
Like other people are pointing out, most of these engines won't have all the features of ES (or more accurately Lucene) but I am pretty convinced that most of the time it doesn't actually matter and if someone is searching on your site excessively maybe there's a problem with your UX (unless you're a search engine or repository of information).
[0]: https://supabase.com/blog/postgres-full-text-search-vs-the-r...
whatlang-rs
-
Lingua 1.5.0 - The most accurate natural language detection library for Rust, now with support for detecting multiple languages in mixed-language text
How does it compare to whatlang?
-
Python Binding for WhatLang (Detect languages) - Blazing Fast ⚡
WhatLang is a Python library for detecting the language of a text. It is based on the WhatLang Rust library.
-
To people with real Rusty jobs: How did you land it? What exactly do you do at your job? How proficient are you? What skills besides Rust? How long did it take?
I started working on whatlang project (https://github.com/greyblake/whatlang-rs). In 2017 I started going to Rust interviews. At that moment there were only 3 companies in Berlin that were offering Rust jobs (as far as I know): Parity, Mozilla, 1aim. I had interview with all of them and did not pass. I had classical Ruby/web background, and at that moment Rust was seen as alternative to C++, so many would expect me to know C++ well (but it was not really the case). I did continue working on my open source projects and writing blog posts from time to time. Year 2020 was very different. I was like rust turned from underdog to mainstream. I felt like Rust job openings tripled. Head hunters started writing me on LinkedIn, waw! I got contacted by big CryptoExchange, because they wanted to use my library for technical analysis. Sounds like a dream! Eventually, I find a job at Impero.com, thanks to this subreddit. They posted a job description and I send them my CV. Soon I got hired. It's a remote job, but at that moment it did not make a difference, because of the pandemic.
-
Whatlang 0.15.0 released (lightweight lib for language recognition)
CHANGELOG: https://github.com/greyblake/whatlang-rs/blob/master/CHANGELOG.md
-
Whatlang strikes back
I am happy to announce a release of a new version (0.12.0) of whatlang.
Regarding Chinese / Japanese, if I got it correctly Japanese may include Katakana, Hiragana and Mandarin, while Chinese includes only Mandarin characters (again I can be wrong here).
What are some alternatives?
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
fastapi - FastAPI framework, high performance, easy to learn, fast to code, ready for production
Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
regex - An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust [Moved to: https://github.com/quickwit-oss/tantivy]
graylog - Free and open log management
Fluent - Rust implementation of Project Fluent
zincsearch - ZincSearch . A lightweight alternative to elasticsearch that requires minimal resources, written in Go.
OpenSearch - 🔎 Open source distributed and RESTful search engine.
textwrap - An efficient and powerful Rust library for word wrapping text.
sea-query - 🔱 A dynamic SQL query builder for MySQL, Postgres and SQLite