Show HN: Neum AI – Open-source large-scale RAG framework

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

NeumAI

2 779 8.7 Python

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

Interesting to see that the semantic chunking in the tools library is a wrapper around GPT-4. Asks GPT for the python code and executes it: https://github.com/NeumTry/NeumAI/blob/main/neumai-tools/neu...

fast_vector_similarity

7 324 7.2 Rust

The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.

Got it. I'd encourage you to expose more of that functionality at the level of your application if possible. I think there is a lot of potential in using more than just cosine similarity, especially when there are lots of candidates and you really want to sharpen up the top few recommendations to the best ones. You might find this open-source library I made recently useful for that:
https://github.com/Dicklesworthstone/fast_vector_similarity
I've had good results from starting with cosine similarity (using FAISS) and then "enriching" the top results from that with more sophisticated measures of similarity from my library to get the final ranking.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

NPi – An Open Source project for enhancing AI Agents in taking action

5 projects | news.ycombinator.com | 2 May 2024
Ask HN: LLM workflows to avoid copying and pasting from the web interfaces?

1 project | news.ycombinator.com | 3 May 2024
Ask HN: What's with the Gatekeeping in Open Source?

1 project | news.ycombinator.com | 2 May 2024
Fixing a real-world bug with AI using Claude Opus 3 with Plandex [video]

1 project | news.ycombinator.com | 2 May 2024
Show HN: FileKitty – Combine and label text files for LLM prompt contexts

5 projects | news.ycombinator.com | 1 May 2024

Show HN: Neum AI – Open-source large-scale RAG framework

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
AI Data Embeddings ETL llm
Post date: 21 Nov 2023

NeumAI

fast_vector_similarity

InfluxDB

Related posts

NPi – An Open Source project for enhancing AI Agents in taking action

Ask HN: LLM workflows to avoid copying and pasting from the web interfaces?

Ask HN: What's with the Gatekeeping in Open Source?

Fixing a real-world bug with AI using Claude Opus 3 with Plandex [video]

Show HN: FileKitty – Combine and label text files for LLM prompt contexts

Show HN: Neum AI – Open-source large-scale RAG framework

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com AI Data Embeddings ETL llm Post date: 21 Nov 2023

NeumAI

fast_vector_similarity

InfluxDB

Related posts

NPi – An Open Source project for enhancing AI Agents in taking action

Ask HN: LLM workflows to avoid copying and pasting from the web interfaces?

Ask HN: What's with the Gatekeeping in Open Source?

Fixing a real-world bug with AI using Claude Opus 3 with Plandex [video]

Show HN: FileKitty – Combine and label text files for LLM prompt contexts

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
AI Data Embeddings ETL llm
Post date: 21 Nov 2023