Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression. Learn more →
Top 23 Python Text processing Projects
Data validation using Python type hintsProject mention: popularity behind pydantic | reddit.com/r/Python | 2023-03-24
I did read this ... Pydantic Docs.
Fuzzy String Matching in PythonProject mention: Need help solving a subtitles problem. The logic seems complex | reddit.com/r/learnpython | 2023-01-19
Do fuzzy matching (something like fuzzywuzzy maybe) to see if the the words line up (allowing for wrong words). You'll need to work out how to use scoring to work out how well aligned the two lists are.
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.Project mention: Form editing, changelogs, and progressive diffing - am I reinventing the wheel? | reddit.com/r/AskProgramming | 2022-08-06
Outside of that, to get the diffs there is a library called diff-match-patch that has implementations in most languages. Your data model / state tracking sounds like it matches the internal constraints.
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.Project mention: can you create your own program language in python, if yes how? | reddit.com/r/Python | 2023-03-12
Lark is a good library to assist with this.
Fixes mojibake and other glitches in Unicode text, after the fact.Project mention: 7 Useful Python Libraries You Should Use in Your Next Project | reddit.com/r/Python | 2022-11-23
Python port of Google's libphonenumberProject mention: Python: Data validation | dev.to | 2023-01-20
Validating a phone number using phonenumbers
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
A non-validating SQL parser module for PythonProject mention: Data Load Diagram | reddit.com/r/dataengineering | 2023-03-06
Gotcha, since we haven't actually written all of this yet I don't have any useful code snippets to share but we've discussed tackling the problem internally using something like sqlparse. You'd need to identify the relevant sql chunks, parse them for table dependency information and then create the relevant entities in whichever data lineage tool you were using.
Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.Project mention: textdistance: Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage. | reddit.com/r/coding | 2022-08-04
Python Lex-YaccProject mention: Why is the grammar that I defined does not use tokens? (LEX/YACC/python) | reddit.com/r/AskProgramming | 2022-05-20
https://github.com/dabeaz/ply/tree/master/ply you can find yacc and lex files here
Python character encoding detectorProject mention: After almost a year, Ben Eater is back | news.ycombinator.com | 2022-11-05
A generator library for concise, unambiguous and URL-safe UUIDs.Project mention: Short, friendly base32 slugs from timestamps | news.ycombinator.com | 2023-01-18
I use shortuuid for that stuff, which also omits the capital letter I, and has some other niceties (I wrote the library). It works really well, and I like how small the IDs are.
a python library for doing approximate and phonetic matching of strings.
Python library for creating PEG parsersProject mention: Need help developing an interpreter | reddit.com/r/learnpython | 2023-03-07
Look into "parser combinators" for building an interpreter. There's a few ones out there, but PyParsing is one I've seen around that looks pretty nifty.
A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.
Returns unicode slugs
An implementation of figlet written in PythonProject mention: CS50P WEEK 4 FRANK, IAN AND GLENS LETTERS PSET. | reddit.com/r/cs50 | 2022-11-17
Sorry, just noticed you said you went to github. The way I found it was by going to the intro page (I don't really consider what's you're quoting to be docs), then going to github and then explored that. You yourself slightly missed it on github, it is in the the __init__.py file: https://github.com/pwaller/pyfiglet/blob/master/pyfiglet/__init__.py . There's also another docs file in that project but that is mostly for developers that are working on library itself, not users.
Construct: Declarative data structures for python that allow symmetric parsing and buildingProject mention: MPK mini MK3 not working | reddit.com/r/akaiMPC | 2023-01-18
Translate Chinese hanzi to pinyin (拼音) by Python, 汉字转拼音
A simple Python module for parsing human names into their individual componentsProject mention: Least expensive way to find a partial match in database query | reddit.com/r/django | 2022-08-08
Something like https://github.com/derek73/python-nameparser ?
Python flexible slugify function
Truly universal encoding detector in pure Python
A slugifier that works in unicode
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Text processing related posts
data structures & algorithms resources available with python ?
1 project | reddit.com/r/Python | 15 Mar 2023
can you create your own program language in python, if yes how?
1 project | reddit.com/r/Python | 12 Mar 2023
what is colon (:) operator?
1 project | reddit.com/r/learnpython | 8 Mar 2023
Data Load Diagram
2 projects | reddit.com/r/dataengineering | 6 Mar 2023
Lark a Python lexer/parser library
1 project | news.ycombinator.com | 27 Feb 2023
Ask HN: Will we see a TypeScript for Python?
2 projects | news.ycombinator.com | 26 Feb 2023
Create your own scripting language in Python with Sly
1 project | reddit.com/r/Python | 28 Jan 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Mar 2023
What are some of the best open-source Text processing projects in Python? This list will help you: