xpinyin
Levenshtein
Our great sponsors
xpinyin | Levenshtein | |
---|---|---|
- | 2 | |
809 | 1,239 | |
- | - | |
2.6 | 0.0 | |
over 3 years ago | over 2 years ago | |
Python | C | |
BSD 3-clause "New" or "Revised" License | GNU General Public License v2.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
xpinyin
We haven't tracked posts mentioning xpinyin yet.
Tracking mentions began in Dec 2020.
Levenshtein
-
Is it possible on Python?
Yeah my hunch is that a combination of nltk, python-Levenshtein, numpy for language processing, pandas for gathering results and scrapy for web scraping should make it possible. Sadly such a project probably requires at least a month or two worth of training in Python to prototype. Good luck OP.
-
Four Useful Python Libraries You Don't Know About
I've used fuzzy-wuzzy and it is pretty slow if you can't install python-Levenshtein (which I couldn't, though I don't remember why). I ended up uninstalling it and using a custom matching algorithm for search in my app.
What are some alternatives?
pyfiglet - An implementation of figlet written in Python
fuzzywuzzy - Fuzzy String Matching in Python
汉字拼音转换工具(Python 版) - 汉字转拼音(pypinyin)
jellyfish - 🪼 a python library for doing approximate and phonetic matching of strings.
TextDistance - 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
ftfy - Fixes mojibake and other glitches in Unicode text, after the fact.
chardet - Python character encoding detector
Charset Normalizer - Truly universal encoding detector in pure Python
uniout - Never see escaped bytes in output.
shortuuid - A generator library for concise, unambiguous and URL-safe UUIDs.