awesome-vector-search
cadence
awesome-vector-search | cadence | |
---|---|---|
20 | 19 | |
1,284 | 7,831 | |
2.5% | 1.0% | |
5.7 | 9.7 | |
23 days ago | about 19 hours ago | |
Go | ||
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-vector-search
- Show HN: SimSIMD vs. SciPy: How AVX-512 and SVE make SIMD cleaner and ML faster
-
Reality check on good embedding model (and this idea in general)
Probably. But there are a number of free open source ones. For example, I've got a document that I'm doing embedding-keys for that has about 8000 sentences. Here's a list of some [ https://github.com/currentslab/awesome-vector-search ]
-
Rye, meet GPT3 ... and vice versa :)
note: search for vector databases not written in Go but with Go clients, in case there is anything more local/lightweight: https://github.com/currentslab/awesome-vector-search
-
Vector database built for scalable similarity search
https://github.com/currentslab/awesome-vector-search
I was surprised to see Elastic actually has ok support for some of this stuff, though it appears slower for most of the tasks.
-
[P] My co-founder and I quit our engineering jobs at AWS to build “Tensor Search”. Here is why.
Supporting sequence of vectors does seems like a fresh air to the vector search service. I have added marqo to the list of awesome vector search (disclosure: I am the maintainer of the list) to increase your exposure.
-
What are vector search engines?
If you want a proper curated list of various libraries and standalone services of vector search engines, refer to this awesome GitHub repository by Currents API.
- List of vector search libraries
- List of curated vector search libraries
- A GitHub repository that collects awesome vector search framework/engine, library, cloud service, and research papers
- Find anything fast with Google's vector search technology
cadence
- Show HN: Hatchet – Open-source distributed task queue
-
Ask HN: Who is hiring? (December 2023)
Uber | Software Engineers | Hybrid (Denmark) | https://www.uber.com/dk/en/careers/locations/aarhus/
Work with an amazing team responsible for the infrastructure software that makes Uber’s data centers around the world reliable and scalable. If you want to solve the toughest engineering challenges alongside some of the smartest people in the industry, Uber Aarhus is the right place for you.
The team in Aarhus build and operate the stateless and stateful compute platforms used by nearly every other engineer in the company (Up - https://www.uber.com/en-GB/blog/up-portable-microservices-re... and Odin - https://www.uber.com/en-GB/blog/how-uber-optimized-cassandra...) as well as other related infrastructure projects such as Cadence - https://github.com/uber/cadence.
- Cadence – Fault-Tolerant Stateful Code Platform by Uber
-
Best way to schedule events and handle them in the future?
May be this..https://cadenceworkflow.io/
- Mandala: experiment data management as a built-in (Python) language feature
-
are you interested in an end to end queue/pubsub & worker platform
a managed esb orchestration for example is exactly same as step functions and workflow engines like cadence - https://github.com/uber/cadence
-
Why messaging is much better than REST for inter-microservice communications
Having done a reasonable amount of messaging code in my time, I would say the final form of this sort of thing might look more like Cadence[0] than anything like this.
[0] https://github.com/uber/cadence
-
cadence VS javactrl-kafka - a user suggested alternative
2 projects | 2 Feb 2023
- Fault-Tolerant Stateful Code Platform
-
[P] My co-founder and I quit our engineering jobs at AWS to build “Tensor Search”. Here is why.
Emit events from your primary DB (postgres, etc.) to something like kafka or rabbitmq and then catch that in your search engine. There's also some end-to-end solutions like temporal (temporal.io) or cadence (https://cadenceworkflow.io/)
What are some alternatives?
pgvector - Open-source vector similarity search for Postgres
temporal - Temporal service
annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Flowable (V6) - A compact and highly efficient workflow and Business Process Management (BPM) platform for developers, system admins and business users.
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
gocelery - Celery Distributed Task Queue in Go
Milvus - A cloud-native vector database, storage for next generation AI applications
Asynq - Simple, reliable, and efficient distributed task queue in Go
hnswlib - Header-only C++/python library for fast approximate nearest neighbors
docker-compose - Temporal docker-compose files
featureform - The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Faktory - Language-agnostic persistent background job server