Exploring Methods to Improve Text Chunking in RAG Models (and other things...)

This page summarizes the projects mentioned and recommended in the original post on /r/GPT3

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • vectorboard

    Open Source Embeddings Optimisation and Eval Framework for RAG/LLM Applications. Documentations at https://docs.vectorboard.ai/introduction

  • Hi, about chunking, if the text is structured (markdown or html), you can take headding and paragraph as a chunking unit, but the result is also affected by the applied embeddings, which can be evaluated separately first, for standard chunking methods with different chunk lengths, for example with this tool https://github.com/VectorBoard/vectorboard.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts