-
pyWhat
🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙♀️
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
In the same vague theme of "I don't know what I'm dealing with" : https://github.com/ajalt/fuckitpy
-
Another one sort of related is hachoir, and specifically the hachoir-metadata script: https://github.com/vstinner/hachoir
-
usaddress
:us: a python library for parsing unstructured United States address strings into address components
Some great probabilistic python libraries:
https://github.com/datamade/usaddress - "usaddress is a Python library for parsing unstructured address strings into address components, using advanced NLP methods."
https://github.com/datamade/probablepeople - "probablepeople is a python library for parsing unstructured romanized name or company strings into components, using advanced NLP methods."
-
probablepeople
:family: a python library for parsing unstructured western names into name components.
Some great probabilistic python libraries:
https://github.com/datamade/usaddress - "usaddress is a Python library for parsing unstructured address strings into address components, using advanced NLP methods."
https://github.com/datamade/probablepeople - "probablepeople is a python library for parsing unstructured romanized name or company strings into components, using advanced NLP methods."
-
We built a similar tool, utilizing a CNN. It works on structured (and unstructured) data and provides additional info.
https://github.com/capitalone/DataProfiler
Cool part, is you can “extend” the intern name-entity recognition model by refitting with the new data.
Out if the box, the DataProfiler does something like 18 entities including most of the PII dada.
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Didn't know there was a python version, but as the README says, this is based on the classic fuckitjs: https://github.com/mattdiamond/fuckitjs