PyWhat: Identify Anything

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. pyWhat

    🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙‍♀️

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. FuckIt.py

    The Python error steamroller.

    In the same vague theme of "I don't know what I'm dealing with" : https://github.com/ajalt/fuckitpy

  4. hachoir

    Hachoir is a Python library to view and edit a binary stream field by field

    Another one sort of related is hachoir, and specifically the hachoir-metadata script: https://github.com/vstinner/hachoir

  5. usaddress

    :us: a python library for parsing unstructured United States address strings into address components

    Some great probabilistic python libraries:

    https://github.com/datamade/usaddress - "usaddress is a Python library for parsing unstructured address strings into address components, using advanced NLP methods."

    https://github.com/datamade/probablepeople - "probablepeople is a python library for parsing unstructured romanized name or company strings into components, using advanced NLP methods."

  6. probablepeople

    :family: a python library for parsing unstructured western names into name components.

    Some great probabilistic python libraries:

    https://github.com/datamade/usaddress - "usaddress is a Python library for parsing unstructured address strings into address components, using advanced NLP methods."

    https://github.com/datamade/probablepeople - "probablepeople is a python library for parsing unstructured romanized name or company strings into components, using advanced NLP methods."

  7. DataProfiler

    What's in your data? Extract schema, statistics and entities from datasets

    We built a similar tool, utilizing a CNN. It works on structured (and unstructured) data and provides additional info.

    https://github.com/capitalone/DataProfiler

    Cool part, is you can “extend” the intern name-entity recognition model by refitting with the new data.

    Out if the box, the DataProfiler does something like 18 entities including most of the PII dada.

  8. chardet

    Python character encoding detector

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. fuckitjs

    The Original Javascript Error Steamroller

    Didn't know there was a python version, but as the README says, this is based on the classic fuckitjs: https://github.com/mattdiamond/fuckitjs

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Python Malware Starting to Employ Anti-Debug Techniques

    7 projects | news.ycombinator.com | 23 Dec 2022
  • A ranked list of machine learning Python libraries. Updated weekly

    1 project | news.ycombinator.com | 31 Jan 2025
  • Analyzing LinkedIn Company Posts with Graphs and Agents

    1 project | dev.to | 12 Jan 2025
  • Property-Based Testing for the People

    5 projects | news.ycombinator.com | 6 Jan 2025
  • Proptest: property testing in Rust

    6 projects | dev.to | 27 Dec 2024