Python Parsing

Open-source Python projects categorized as Parsing

Top 23 Python Parsing Projects

  • pydantic

    Data validation using Python type hints

    Project mention: utype VS pydantic - a user suggested alternative | | 2024-02-15

    utype is a concise alternative of pydantic with simplified parameters and usages, supporting both sync/async functions and generators parsing, and capable of using native logic operators to define logical types like AND/OR/NOT, also provides custom type parsing by register mechanism that supports libraries like pydantic, attrs and dataclasses

  • maigret

    🕵️‍♂️ Collect a dossier on a person by username from thousands of sites

    Project mention: IWTL how to find and delete old online accounts that I've forgotten about | /r/IWantToLearn | 2023-04-17


  • WorkOS

    The modern API for authentication & user identity. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • Maya

    Datetimes for Humans™

  • llmware

    Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.

    Project mention: FLaNK Stack Weekly 19 Feb 2024 | | 2024-02-19
  • dateutil

    Useful extensions to the standard Python datetime features

    Project mention: Using Openpyxl - keep min date, handle line breaks, handle duplicates | /r/learnpython | 2023-05-01

    Here is an example for a single cell (I'm using the dateutil package to parse the strings):

  • pyparsing

    Python library for creating PEG parsers

    Project mention: Pyparsing 3.1.0 released | /r/pyparsing | 2023-06-19

    After over a year since the last release of pyparsing, I've bundled up all the bug-fixes and changes, and they are now released as pyparsing 3.1.0. Visit this link for the details.

  • plaso

    Super timeline all the things

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • pydantic-core

    Core validation logic for pydantic written in rust

    Project mention: Is there a pydantic.BaseSettings equivalent in rust? | /r/rust | 2023-06-05

    Funny that you ask... Unfortunately it seems that the functionality you ask for is not (yet) part of this ...

  • facexlib

    FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.

    Project mention: stable diffusion downloads something from github when making a image | /r/StableDiffusion | 2023-07-22


  • socid-extractor

    ⛏️ Extract accounts info from personal pages on various sites for OSINT purpose

  • WhatsApp-Chat-Exporter

    A customizable Android and iOS/iPadOS WhatsApp database parser that will give you the history of your WhatsApp conversations in HTML and JSON. Android Backup Crypt12, Crypt14, Crypt15, and new schema supported.

    Project mention: I am willing to pay hundreds of dollar to have my conversation with my parents on Whatsapp preserved, but there is no solution. No body other than me cares? | /r/DataHoarder | 2023-06-02

    Since you have the backup, this should be an option:

  • FormatFuzzer

    FormatFuzzer is a framework for high-efficiency, high-quality generation and parsing of binary inputs.

  • pytago

    A source-to-source transpiler for Python to Go translation

  • funcparserlib

    Recursive descent parsing library for Python based on functional combinators

  • py-pdf-parser

    A Python tool to help extracting information from structured PDFs.

  • wikitextparser

    A Python library to parse MediaWiki WikiText

  • OpenSIEM-Logstash-Parsing

    SIEM Logstash parsing for more than hundred technologies

  • yacv

    Yet Another Compiler Visualizer

  • parglare

    A pure Python LR/GLR parser -

    Project mention: Parsing: The Solved Problem That Isn't (2011) | | 2024-02-21

    These are not new, but my takeaways from and are to embrace various forms of LR parsing. is a very capable GLR parser, and I've been keeping a close eye on it for use in my projects.

  • tree-hugger

    A light-weight, extendable, high level, universal code parser built on top of tree-sitter

  • arxiv-miner

    arxiv_miner is a toolkit for mining research papers on CS ArXiv.

  • htmldate

    Fast and robust date extraction from web pages, with Python or on the command-line

  • dataconf

    Simple dataclasses configuration management for Python with hocon/json/yaml/properties/env-vars/dict/cli support.


    Learn 300+ open source libraries for free using AI. LearnThisRepo lets you learn 300+ open source repos including Postgres, Langchain, VS Code, and more by chatting with them using AI!

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-21.

Python Parsing related posts


What are some of the best open-source Parsing projects in Python? This list will help you:

Project Stars
1 pydantic 17,820
2 maigret 9,454
3 Maya 3,399
4 llmware 2,645
5 dateutil 2,209
6 pyparsing 2,050
7 plaso 1,591
8 pydantic-core 1,228
9 facexlib 700
10 socid-extractor 554
11 WhatsApp-Chat-Exporter 403
12 FormatFuzzer 378
13 pytago 367
14 funcparserlib 333
15 py-pdf-parser 323
16 wikitextparser 260
17 OpenSIEM-Logstash-Parsing 172
18 yacv 132
19 parglare 132
20 tree-hugger 115
21 arxiv-miner 106
22 htmldate 104
23 dataconf 76
Learn 300+ open source libraries for free using AI.
LearnThisRepo lets you learn 300+ open source repos including Postgres, Langchain, VS Code, and more by chatting with them using AI!