Python Parser

Open-source Python projects categorized as Parser

Top 23 Python Parser Projects

  • pydantic

    Data validation using Python type hints

    Project mention: utype VS pydantic - a user suggested alternative | libhunt.com/r/utype | 2024-02-15

    utype is a concise alternative of pydantic with simplified parameters and usages, supporting both sync/async functions and generators parsing, and capable of using native logic operators to define logical types like AND/OR/NOT, also provides custom type parsing by register mechanism that supports libraries like pydantic, attrs and dataclasses

  • pdfminer.six

    Community maintained fork of pdfminer - we fathom PDF

    Project mention: Code to extract text from pdf to excel | /r/Python | 2023-06-02

    I love to use PDFMiner and PDFQuery for this https://github.com/pdfminer/pdfminer.six https://towardsdatascience.com/scrape-data-from-pdf-files-using-python-and-pdfquery-d033721c3b28

  • WorkOS

    The modern API for authentication & user identity. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • sqlglot

    Python SQL Parser and Transpiler

    Project mention: Build the dependency graph of your BigQuery pipelines at no cost: a Python implementation | dev.to | 2024-01-11

    In the project we used Python lib networkx and a DiGraph object (Direct Graph). To detect a table reference in a Query, we use sqlglot, a SQL parser (among other things) that works well with Bigquery.

  • Lark

    Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

    Project mention: Show HN: I wrote a RDBMS (SQLite clone) from scratch in pure Python | news.ycombinator.com | 2023-08-13

    Lark supports, and recommends, writing and storing the grammar in a .lark file. We have syntax highlighting support in all major IDEs, and even in github itself. For example, here is Lark's built-in grammar for Python: https://github.com/lark-parser/lark/blob/master/lark/grammar...

    You can also test grammars "live" in our online IDE: https://www.lark-parser.org/ide/

    The rationale is that it's more terse and has less visual clutter than a DSL over Python, which makes it easier to read and write.

  • sqlparse

    A non-validating SQL parser module for Python

    Project mention: Show HN: Databasediagram.com – Private, Text to Entity-Relationship Diagram Tool | news.ycombinator.com | 2023-06-08

    Suggest checking out the sqlparse library for a way to do the different flavours without needing to address each case directly: https://github.com/andialbrecht/sqlparse

  • phonenumbers

    Python port of Google's libphonenumber

    Project mention: What are approaches for extracting phone numbers with different format from many sites? | /r/webscraping | 2023-04-02

    Did you try https://github.com/daviddrysdale/python-phonenumbers? You'll still need country code to parse local formats though. How many sites do you have?

  • oletools

    oletools - python tools to analyze MS OLE2 files (Structured Storage, Compound File Binary Format) and MS Office documents, for malware analysis, forensics and debugging.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • PLY

    Python Lex-Yacc

  • snoop

    Snoop — инструмент разведки на основе открытых данных (OSINT world)

    Project mention: Osint update of the Snoop Project tool search for user by nickname | news.ycombinator.com | 2024-01-02
  • rdflib

    RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information.

  • m3u8

    Python m3u8 Parser for HTTP Live Streaming (HLS) Transmissions

  • python-user-agents

    A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.

  • typeguard

    Run-time type checker for Python

  • strictyaml

    Type-safe YAML parser and validator.

    Project mention: StrictYAML | news.ycombinator.com | 2023-09-29
  • cinemagoer

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb (to which we are not affiliated in any way) movie database about movies, people, characters and companies

  • ViperMonkey

    A VBA parser and emulation engine to analyze malicious macros.

  • Construct

    Construct: Declarative data structures for python that allow symmetric parsing and building

  • guessit

    GuessIt is a python library that extracts as much information as possible from a video filename.

  • textX

    Domain-Specific Languages and parsers in Python made easy http://textx.github.io/textX/

  • mwparserfromhell

    A Python parser for MediaWiki wikicode

    Project mention: Processing Wikipedia Dumps With Python | /r/programming | 2023-05-18

    There's also https://github.com/earwig/mwparserfromhell, if you don't want to roll your own.

  • wiktextract

    Wiktionary dump file parser and multilingual data extractor

    Project mention: Wiktionary dump file parser and multilingual data extractor | news.ycombinator.com | 2023-04-02
  • crossplane

    Quick and reliable way to convert NGINX configurations into JSON and back. (by nginxinc)

  • python-nameparser

    A simple Python module for parsing human names into their individual components

  • Onboard AI

    ChatGPT with full context of any GitHub repo. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at app.getonboardai.com.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-15.

Python Parser related posts

Index

What are some of the best open-source Parser projects in Python? This list will help you:

Project Stars
1 pydantic 17,704
2 pdfminer.six 5,182
3 sqlglot 4,867
4 Lark 4,342
5 sqlparse 3,525
6 phonenumbers 3,375
7 oletools 2,687
8 PLY 2,667
9 snoop 2,587
10 rdflib 2,046
11 m3u8 1,869
12 python-user-agents 1,404
13 typeguard 1,403
14 strictyaml 1,385
15 cinemagoer 1,175
16 ViperMonkey 1,000
17 Construct 866
18 guessit 791
19 textX 746
20 mwparserfromhell 684
21 wiktextract 673
22 crossplane 664
23 python-nameparser 626
ChatGPT with full context of any GitHub repo.
Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at app.getonboardai.com.
app.getonboardai.com