The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 Python Text Projects
-
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
pytorch-widedeep
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
1filellm
Specify a github or local repo, github pull request, arXiv or Sci-Hub paper, Youtube transcript or documentation URL on the web and scrape into a text file and clipboard for easier LLM ingestion
-
py_midicsv
A Python port and library-fication of the midicsv tool by John Walker. If you need to convert MIDI files to human-readable text files and back, this is the library for you.
-
semchunk
A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
-
pytextcodifier
:package: Turn your text files into codified images or your codified images into text files.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Exploring Open-Source Alternatives to Landing AI for Robust MLOps | dev.to | 2023-12-13Numerous tools exist for detecting anomalies in time series data, but Alibi Detect stood out to me, particularly for its capabilities and its compatibility with both TensorFlow and PyTorch backends.
Project mention: ART 6.0 released: ASCII and Non-ASCII art library for Python (+ Space support) | /r/coolgithubprojects | 2023-06-14
Evennia - MUD server (text-based MMORPG). Python
I created a cli tool that copies a GitHub or local repo into a text file for llm ingestion. It only pulls the filetypes you specify.
https://github.com/jimmc414/1filellm
Project mention: Ask HN: What Underrated Open Source Project Deserves More Recognition? | news.ycombinator.com | 2024-03-07
See https://github.com/pszemraj/textsum. He's the guy that trained most of the popular long finetuned long models. He created a pip package to make life easier (which uses Huggingface under the hood, just pre-selects good models and obfuscates boilerplate).
Project mention: semchunk alternatives - text-splitter and langchain | libhunt.com/r/semchunk | 2023-11-09
Python Text related posts
- Building a Multi-Tenant App with FastAPI, SQLModel, and PropelAuth
- On why Markdown is not a good, or even a half-decent, markup language
- ART 6.0 released: ASCII and Non-ASCII art library for Python (+ Space support)
- Show HN: I turned my microeconomics textbook into a chatbot with GPT-3
- Modern Polars: an extensive side-by-side comparison of Polars and Pandas
- Show HN: Pygame's Text Input Module
- How to create diagrams via code?
-
A note from our sponsor - WorkOS
workos.com | 26 Apr 2024
Index
What are some of the best open-source Text projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | TextRecognitionDataGenerator | 3,038 |
2 | aeneas | 2,379 |
3 | alibi-detect | 2,082 |
4 | art | 1,988 |
5 | evennia | 1,715 |
6 | pytorch-widedeep | 1,234 |
7 | pygame-menu | 502 |
8 | pangu.py | 233 |
9 | 1filellm | 214 |
10 | pygame-text-input | 138 |
11 | orange3-text | 124 |
12 | textsum | 110 |
13 | py_midicsv | 72 |
14 | zeroshot_topics | 60 |
15 | Quote2Image | 58 |
16 | To-ASCII | 57 |
17 | namekrea | 49 |
18 | semchunk | 22 |
19 | Oz-Engine | 14 |
20 | pytextcodifier | 14 |
21 | litemark | 13 |
22 | pythontextnow | 10 |
23 | linesieve | 7 |
Sponsored