doctor
pydoxtools
doctor | pydoxtools | |
---|---|---|
1 | 2 | |
50 | 56 | |
- | - | |
6.2 | 9.5 | |
2 days ago | 3 months ago | |
Python | Python | |
BSD 2-clause "Simplified" License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
doctor
-
WordPerfect for Unix Character Terminals
WordPerfect is still used a little by lawyers.
Here's how the free law project parses it: wpd2html
https://github.com/freelawproject/doctor/blob/main/doctor/ta...
pydoxtools
- What is the most cost-efficient way to have an embedding generator endpoint that is using an open-source embedding model? [D]
-
File acquisition to S3
Also, I saw this repo in a r/python post yesterday https://github.com/Xyntopia/pydoxtools
What are some alternatives?
pexicdb - Pexicdb is a simple model based file database
RankGPT - [EMNLP 2023 Outstanding Paper Award] Is ChatGPT Good at Search? LLMs as Re-Ranking Agent
AutoLearn-GPT - ChatGPT learns automatically.
tika-python - Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
pandora - Pandora is an analysis framework to discover if a file is suspicious and conveniently show the results
gensim - Topic Modelling for Humans
rakun2 - RaKUn 2.0 - A fast keyword detection algorithm
haystack - :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
megabots - 🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵
searchGPT - Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.