Top 23 Python Text processing Projects
Fuzzy String Matching in PythonLatest mention: Comparing Strings Is Easy With FuzzyWuzzy | dev.to | 2021-01-13
The code implemented by each of the functions described above, as well as other useful FuzzyWuzzy functions, can be found here.
Data parsing and validation using Python type hints
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.Latest mention: Get Diff and Patch Html | dev.to | 2021-01-24
Photo by Markus Spiske on Diff.Match.Patch based on Google library.
Fixes mojibake and other glitches in Unicode text, after the fact.
Python port of Google's libphonenumber
A non-validating SQL parser module for Python
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.Latest mention: JSON parser | reddit.com/r/Python | 2021-01-14
Writing parsers by hand is fun, but it's much easier to use a parser: https://github.com/lark-parser/lark/blob/master/examples/json_parser.py
Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
Python Lex-YaccLatest mention: Good Resources for creating a programming language | dev.to | 2021-01-02
dabeaz / ply
Python character encoding detector
A generator library for concise, unambiguous and URL-safe UUIDs.
🎐 a python library for doing approximate and phonetic matching of strings.
A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.
Python library for creating PEG parsers
Python library for creating PEG parsersLatest mention: Perform mathematical operations based on a string from user - Best way? Any existing library? | reddit.com/r/learnpython | 2021-01-01
Returns unicode slugsLatest mention: Simple Cli Tool To View Trending Repositories And | reddit.com/r/Python | 2020-12-28
An implementation of figlet written in Python
Translate Chinese hanzi to pinyin (拼音) by Python, 汉字转拼音
Construct: Declarative data structures for python that allow symmetric parsing and building
Python flexible slugify function
A simple Python module for parsing human names into their individual components
A slugifier that works in unicode
What are some of the best open-source Text processing projects in Python? This list will help you: