-
Milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
1. Milvus by Zilliz | Github
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
3. Quivr | GitHub | tutorial
-
haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
4. Haystack by Deepset | Github | tutorial
-
proton
High-performance, low-footprint SQL database written in C++. Process millions of rows per second from Kafka/Pulsar, Iceberg, or ClickHouse, and seamlessly write results back. Supports powerful features like JOIN, CDC, UPSERT, and LOOKUP, enabling real-time analytics and ETL at scale. (by timeplus-io)
5. Proton by Timeplus | Github | tutorial
-
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
6. Ydata-synthetic and Ydata-profiling by YData | Github | tutorial
-
7. Apache Flink | Github | tutorial
-
7. Apache Flink | Github | tutorial
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
8. LangChain RB | Github | tutorial
-
flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
9. Flyte by Union AI | Github | tutorial
-
10. DVC by Iterative | Github | tutorial
-
10. DVC by Iterative | Github | tutorial
-
11. Phoenix by Arize AI | Github | tutorial
-
12. TruLens by TruEra | Github | tutorial
-
OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
13. OpenLLM by BentoML | Github | tutorial
-
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
14. LabelStudio by Human Signal | Github | tutorial
-
15. LlamaIndex | Github | tutorial
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives