First 15 Open Source Advent projects

This page summarizes the projects mentioned and recommended in the original post on dev.to

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. Milvus

    Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

    1. Milvus by Zilliz | Github

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. quivr

    Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.

    3. Quivr | GitHub | tutorial

  4. haystack

    AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

    4. Haystack by Deepset | Github | tutorial

  5. proton

    High-performance, low-footprint SQL database written in C++. Process millions of rows per second from Kafka/Pulsar, Iceberg, or ClickHouse, and seamlessly write results back. Supports powerful features like JOIN, CDC, UPSERT, and LOOKUP, enabling real-time analytics and ETL at scale. (by timeplus-io)

    5. Proton by Timeplus | Github | tutorial

  6. ydata-profiling

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    6. Ydata-synthetic and Ydata-profiling by YData | Github | tutorial

  7. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  8. langchainrb

    Build LLM-powered applications in Ruby

    8. LangChain RB | Github | tutorial

  9. flyte

    Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

    9. Flyte by Union AI | Github | tutorial

  10. dvc

    🦉 Data Versioning and ML Experiments

    10. DVC by Iterative | Github | tutorial

  11. dvclive

    📈 Log and track ML metrics, parameters, models with Git and/or DVC

    10. DVC by Iterative | Github | tutorial

  12. phoenix

    AI Observability & Evaluation (by Arize-ai)

    11. Phoenix by Arize AI | Github | tutorial

  13. trulens

    Evaluation and Tracking for LLM Experiments

    12. TruLens by TruEra | Github | tutorial

  14. OpenLLM

    Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

    13. OpenLLM by BentoML | Github | tutorial

  15. label-studio

    Label Studio is a multi-type data labeling and annotation tool with standardized output format

    14. LabelStudio by Human Signal | Github | tutorial

  16. llama_index

    LlamaIndex is the leading framework for building LLM-powered agents over your data.

    15. LlamaIndex | Github | tutorial

  17. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Top Programming Languages for AI Development in 2025

    9 projects | dev.to | 29 Apr 2025
  • Can I run this LLM?

    1 project | news.ycombinator.com | 25 Feb 2025
  • Vaadin Flow for AdminUI

    1 project | dev.to | 25 Feb 2025
  • Running locally DeepSeek-R1 for RAG

    1 project | dev.to | 21 Feb 2025
  • Building a Sarcasm Detection System with LSTM and GloVe: A Complete Guide

    4 projects | dev.to | 2 Jan 2025

Did you know that Python is
the 2nd most popular programming language
based on number of references?