First 15 Open Source Advent projects

This page summarizes the projects mentioned and recommended in the original post on dev.to

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • Milvus

    A cloud-native vector database, storage for next generation AI applications

  • 1. Milvus by Zilliz | Github

  • quivr

    Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.

  • 3. Quivr | GitHub | tutorial

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • haystack

    :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

  • 4. Haystack by Deepset | Github | tutorial

  • proton

    A streaming SQL engine, a fast and lightweight alternative to ksqlDB and Apache Flink, 🚀 powered by ClickHouse. (by timeplus-io)

  • 5. Proton by Timeplus | Github | tutorial

  • ydata-profiling

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

  • 6. Ydata-synthetic and Ydata-profiling by YData | Github | tutorial

    7. Apache Flink | Github | tutorial

    7. Apache Flink | Github | tutorial

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • langchainrb

    Build LLM-powered applications in Ruby

  • 8. LangChain RB | Github | tutorial

  • flyte

    Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

  • 9. Flyte by Union AI | Github | tutorial

  • dvc

    🦉 ML Experiments and Data Management with Git

  • 10. DVC by Iterative | Github | tutorial

  • dvclive

    📈 Log and track ML metrics, parameters, models with Git and/or DVC

  • 10. DVC by Iterative | Github | tutorial

  • phoenix

    AI Observability & Evaluation (by Arize-ai)

  • 11. Phoenix by Arize AI | Github | tutorial

  • trulens

    Evaluation and Tracking for LLM Experiments

  • 12. TruLens by TruEra | Github | tutorial

  • OpenLLM

    Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint, locally and in the cloud.

  • 13. OpenLLM by BentoML | Github | tutorial

  • label-studio

    Label Studio is a multi-type data labeling and annotation tool with standardized output format

  • 14. LabelStudio by Human Signal | Github | tutorial

  • llama_index

    LlamaIndex is a data framework for your LLM applications

  • 15. LlamaIndex | Github | tutorial

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts