SaaSHub helps you find the best software and product alternatives Learn more β
Txtai Alternatives
Similar projects and alternatives to txtai
-
-
InfluxDB
InfluxDB β Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
-
Milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
-
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
-
-
-
-
EdgeChains
EdgeChains.js is Full-Stack GenAI library. Front-end, backend, apis, prompt management, distributed computing. All core prompts & chains are managed declaratively in jsonnet (and not hidden in classes)
-
-
-
-
-
-
tika-python
Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
txtai discussion
txtai reviews and mentions
- Chunking your data for RAG
-
Analyzing LinkedIn Company Posts with Graphs and Agents
txtai is an all-in-one embeddings database for semantic search, LLM orchestration and language model workflows.
- Getting started with LLM APIs
-
Lists of open-source frameworks for building RAG applications
Ideal For: Projects requiring quick setup and robust search capabilities. GitHub Repository
-
Show HN: I made a website to semantically search ArXiv papers
Excellent project.
As mentioned in another comment, I've put together an embeddings database using the arxiv dataset (https://huggingface.co/NeuML/txtai-arxiv) recently.
For those interested in the literature search space, a couple other projects I've worked on that may be of interest.
annotateai (https://github.com/neuml/annotateai) - Annotates papers with LLMs. Supports searching the arxiv database mentioned above.
paperai (https://github.com/neuml/paperai) - Semantic search and workflows for medical/scientific papers. Built on txtai (https://github.com/neuml/txtai)
paperetl (https://github.com/neuml/paperetl) - ETL processes for medical and scientific papers. Supports full PDF docs.
-
Building Effective "Agents"
If you're looking for a lightweight open-source framework designed to handle the patterns mentioned in this article: https://github.com/neuml/txtai
Disclaimer: I'm the author of the framework.
-
Postgres for Everything (E/Postgres)
I fully agree. Postgres has solved many of the problems that many are re-solving with GenAI related databases.
With txtai (https://github.com/neuml/txtai), I've went all in with Postgres + pgvector. Projects can start small with a SQLite backend then switch the persistence to Postgres. With this, you get all the years of battle-tested production experience from Postgres built-in for free.
-
Voice Activity Detection in Elixir with Membran
VAD is certainly a complex but underappreciated topic. If you like signal processing, FFTs and want to see a similar concept implemented in Python, then check out this code. It's has a fairly well-tuned VAD component built-in.
https://github.com/neuml/txtai/blob/master/src/python/txtai/...
-
Pinecone integrates AI inferencing with vector database
txtai (https://github.com/neuml/txtai) has had inline vectorization since 2020. It supports Transformers, llama.cpp and LLM API services. It also has inline integration with LLM models and a built-in RAG pipeline.
-
Show HN: Open-Source Colab Notebooks to Implement Advanced RAG Techniques
An alternative you can try is txtai (https://github.com/neuml/txtai).
Disclaimer: I'm the primary developer
-
A note from our sponsor - SaaSHub
www.saashub.com | 23 May 2025
Stats
neuml/txtai is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of txtai is Python.