raglite

🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL (by superlinear-ai)

Raglite Alternatives

Similar projects and alternatives to raglite

  1. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  2. rag-postgres-openai-python

    A RAG app to ask questions about rows in a database table. Deployable on Azure Container Apps with PostgreSQL Flexible Server.

  3. simple-pgvector-python

    An Abstraction Using a similar API to Pinecone but implemented with pgvector python

  4. rag-with-amazon-postgresql-using-pgvector-and-sagemaker

    Question Answering application with Large Language Models (LLMs) and Amazon Postgresql using pgvector

  5. pgserver

    Pip-installable, embedded-like postgres server for your python app

  6. tech-trend-tracker

    An application that tracks technology trends from news articles.

  7. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  8. ragflow

    14 raglite VS ragflow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

  9. vector_sqlite

    Faiss with sqlite

  10. txtai

    385 raglite VS txtai

    💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

  11. trieve

    20 raglite VS trieve

    All-in-one platform for search, recommendations, RAG, and analytics offered via API

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better raglite alternative or higher similarity.

raglite discussion

Log in or Post with

raglite reviews and mentions

Posts with mentions or reviews of raglite. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-11-23.
  • Show HN: RAGLite – A Python package for the unhobbling of RAG
    1 project | news.ycombinator.com | 19 Dec 2024
  • 32k context length text embedding models
    4 projects | news.ycombinator.com | 23 Nov 2024
    The name ‘late chunking’ is indeed somewhat of a misnomer in the sense that the technique does not partition documents into document chunks. What it actually does is to pool token embeddings (of a large context) into say sentence embeddings. The result is that your document is now represented as a sequence of sentence embeddings, each of which is informed by the other sentences in the document.

    Then, you want to parition the document into chunks. Late chunking pairs really well with semantic chunking because it can use late chunking's improved sentence embeddings to find semantically more cohesive chunks. In fact, you can cast this as a binary integer programming problem and find the ‘best’ chunks this way. See RAGLite [1] for an implementation of both techniques including the formulation of semantic chunking as an optimization problem.

    Finally, you have a sequence of document chunks, each represented as a multi-vector sequence of sentence embeddings. You could choose to pool these sentence embeddings into a single embedding vector per chunk. Or, you could leave the multi-vector chunk embeddings as-is and apply a more advanced querying technique like ColBERT's MaxSim [2].

    [1] https://github.com/superlinear-ai/raglite

    [2] https://huggingface.co/blog/fsommers/document-similarity-col...

Stats

Basic raglite repo stats
3
1,023
8.9
27 days ago

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com