Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work. Learn more →
Top 23 Python Parsing Projects
Data validation using Python type hintsProject mention: [DISCUSSION] What's your favorite Python library, and how has it helped you in your projects? | /r/pythonhelp | 2023-04-22
As for the most utilized and still loved library, that would probably be pydantic, it helps declaring types so convenient - be it dto's, models or just complex arguments - and plays nice with bunch of other libraries from it's own ecosystem.
🕵️♂️ Collect a dossier on a person by username from thousands of sitesProject mention: IWTL how to find and delete old online accounts that I've forgotten about | /r/IWantToLearn | 2023-04-17
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
Datetimes for Humans™
Useful extensions to the standard Python datetime featuresProject mention: Using Openpyxl - keep min date, handle line breaks, handle duplicates | /r/learnpython | 2023-05-01
Here is an example for a single cell (I'm using the dateutil package to parse the strings):
Python library for creating PEG parsersProject mention: Need help developing an interpreter | /r/learnpython | 2023-03-07
Look into "parser combinators" for building an interpreter. There's a few ones out there, but PyParsing is one I've seen around that looks pretty nifty.
Super timeline all the thingsProject mention: Custom DFIR | /r/computerforensics | 2023-02-09
However, what you are trying to do has already been done. For collections look at velociraptor's offline collector https://github.com/Velocidex/velociraptor. For processing check out Log2Timeline (plaso) https://github.com/log2timeline/plaso.
Core validation logic for pydantic written in rustProject mention: Investigating Pydantic v2's Bold Performance Claims | dev.to | 2023-05-17
I encourage you to checkout the official benchmarks for more realistic and detailed examples, and, as always, YMMV.
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.Project mention: local Windows installation of GFP-GAN | /r/MLQuestions | 2022-07-16
# Install facexlib - https://github.com/xinntao/facexlib
⛏️ Extract accounts info from personal pages on various sites for OSINT purposeProject mention: Looking for a good open source web scraping tool | /r/webscraping | 2023-01-14
Check this for profiles: https://github.com/soxoj/socid-extractor
FormatFuzzer is a framework for high-efficiency, high-quality generation and parsing of binary inputs.
A source-to-source transpiler for Python to Go translationProject mention: Learning Go as a Python Developer: The Good and the Bad | news.ycombinator.com | 2022-07-18
Similarly helpful, pytago is a source to source transpired for python to go
Recursive descent parsing library for Python based on functional combinators
A Python tool to help extracting information from structured PDFs.Project mention: Need free/low-cost software that allows me to view the tags in a PDF. | /r/pdf | 2023-01-31
Maybe look at this?
A Python library to parse MediaWiki WikiText
A customizable Android and iPhone WhatsApp database parser that will give you the history of your WhatsApp conversations in HTML and JSON. Android Backup Crypt12, Crypt14, Crypt15, and new schema supported.Project mention: I am willing to pay hundreds of dollar to have my conversation with my parents on Whatsapp preserved, but there is no solution. No body other than me cares? | /r/DataHoarder | 2023-06-02
Since you have the backup, this should be an option: https://github.com/KnugiHK/Whatsapp-Chat-Exporter
SIEM Logstash parsing for more than hundred technologies
Yet Another Compiler Visualizer
A light-weight, extendable, high level, universal code parser built on top of tree-sitterProject mention: Tree-Hugger: Mine / Query source code | news.ycombinator.com | 2022-10-02
arxiv_miner is a toolkit for mining research papers on CS ArXiv.
Fast and robust date extraction from web pages, with Python or on the command-line
Simple dataclasses configuration management for Python with hocon/json/yaml/properties/env-vars/dict/cli support.
Parse Robinhood 1099 Tax Document from PDF into CSV
Python module to parse Hearthstone Power.log files
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Parsing related posts
[DISCUSSION] What's your favorite Python library, and how has it helped you in your projects?
2 projects | /r/pythonhelp | 22 Apr 2023
Ingesting, parsing and making sense of device log data
2 projects | /r/dataengineering | 19 Apr 2023
data structures & algorithms resources available with python ?
1 project | /r/Python | 15 Mar 2023
Legality of Gits in OSINT - is Sherlock / Maigret legal?
1 project | /r/OSINT | 13 Mar 2023
what is colon (:) operator?
1 project | /r/learnpython | 8 Mar 2023
The Privacy, Security, & OSINT Show: 290-Extreme Privacy: Mobile Devices
1 project | /r/PrivacySecurityOSINT | 21 Feb 2023
Show HN: Replbuilder, quickly build a Python REPL CLI prompt
5 projects | news.ycombinator.com | 19 Feb 2023
A note from our sponsor - Sonar
www.sonarsource.com | 4 Jun 2023
What are some of the best open-source Parsing projects in Python? This list will help you: