Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Python search-engine Projects
-
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
swirl-search
Swirl is an open-source search platform that uses AI to search multiple content and data sources simultaneously and return AI-ranked results. And provides summaries of your answers from searches using LLMs. It's a one-click, easy-to-use Retrieval Augmented Generation (RAG) Solution.
-
Search Engine Parser
Lightweight package to query popular search engines and scrape for result titles, links and descriptions
-
mindflow
🧠 AI-powered CLI git wrapper, boilerplate code generator, chat history manager, and code search engine to streamline your dev workflow 🌊
-
HyperTag
HyperTag - Intuitive Knowledge Management WebApp & CLI for Humans using Deep Learning & Tags
-
PatZilla
PatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: [Self Hosted] Selbst gehostete Mailserver: mailcow, mailinabox, mailU... hast du sie (eingehend) getestet? Ihre Meinung und Ratschläge hier, danke! | /r/aufdeutsch | 2023-04-27
txtai is an all-in-one embeddings database for semantic search, LLM orchestration and language model workflows.
We (Marqo) are doing a lot on 1 and 2. There is a huge amount to be done on the ML side of vector search and we are investing heavily in it. I think it has not quite sunk in that vector search systems are ML systems and everything that comes with that. I would love to chat about 1 and 2 so feel free to email me (email is in my profile). What we have done so far is here -> https://github.com/marqo-ai/marqo
here
Project mention: GitHub - swirlai/swirl-search: Swirl is an open-source search platform that uses AI to search multiple content and data sources simultaneously, finds the best results using a reader LLM, then prompts Generative AI, enabling you to get answers based on your data. | /r/programming | 2023-12-05
My personal knowledge base is hosted on GitHub at https://raphaelsty.github.io/knowledge/. It scans the documents I like every day using GitHub Action, Zotero, HackerNews upvote and Github Likes. It's not yet optimized for smartphones. It cost me $5 to host it for a year.
Project mention: Show HN: GPT-Powered Video Retrieval and Streaming | news.ycombinator.com | 2024-02-08
This is really cool. I have a pretty fast BM25 search engine in Pandas I've been working on for local testing.
https://github.com/softwaredoug/searcharray
Why Pandas? Because BM25 is one thing, but you also want to combine with other factors (recency, popularity, etc) easily computed in pandas / numpy...
Python search-engine related posts
- YaCy, a distributed Web Search Engine, based on a peer-to-peer network
- Perform Image-Driven Reverse Image Search on E-Commerce Sites with ImageBind and Qdrant
- Show HN: GPT-Powered Video Retrieval and Streaming
- A search engine in 80 lines of Python
- FLaNK Stack 29 Jan 2024
- Is YouTube starting to protect channel RSS feeds?
- Are we at peak vector database?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 19 Apr 2024
Index
What are some of the best open-source search-engine projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | PaddleNLP | 11,335 |
2 | Mailpile | 8,782 |
3 | whoogle-search | 8,713 |
4 | txtai | 6,910 |
5 | marqo | 4,086 |
6 | search-plugins | 3,426 |
7 | gerev | 2,596 |
8 | swirl-search | 1,501 |
9 | mwmbl | 1,353 |
10 | Maryam | 929 |
11 | bertsearch | 887 |
12 | knowledge | 522 |
13 | Search Engine Parser | 429 |
14 | StreamRAG | 392 |
15 | Yuno | 373 |
16 | addok | 315 |
17 | mindflow | 215 |
18 | HyperTag | 180 |
19 | searcharray | 141 |
20 | houndsploit | 113 |
21 | relevanceai | 98 |
22 | PatZilla | 93 |
23 | achoz | 76 |