Show HN: Neum AI – Open-source large-scale RAG framework

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
Sevalla - Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com
featured
  1. NeumAI

    Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

    Interesting to see that the semantic chunking in the tools library is a wrapper around GPT-4. Asks GPT for the python code and executes it: https://github.com/NeumTry/NeumAI/blob/main/neumai-tools/neu...

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. fast_vector_similarity

    The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.

    Got it. I'd encourage you to expose more of that functionality at the level of your application if possible. I think there is a lot of potential in using more than just cosine similarity, especially when there are lots of candidates and you really want to sharpen up the top few recommendations to the best ones. You might find this open-source library I made recently useful for that:

    https://github.com/Dicklesworthstone/fast_vector_similarity

    I've had good results from starting with cosine similarity (using FAISS) and then "enriching" the top results from that with more sophisticated measures of similarity from my library to get the final ranking.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • From Prompt to AI-Powered Finance App in Minutes

    1 project | dev.to | 26 Aug 2025
  • Embedding Text Documents with Qwen3

    1 project | news.ycombinator.com | 23 Aug 2025
  • Multi-Dimensional Vector Support in CocoIndex – Underneath Explained

    1 project | news.ycombinator.com | 13 Aug 2025
  • CocoIndex – open-source ETL saves you >90% compute for AI workloads

    3 projects | news.ycombinator.com | 12 Aug 2025
  • Multimodal Face Recognition Pipeline with CocoIndex: Real-Time Image and Vector Search

    1 project | dev.to | 29 Jul 2025