SaaSHub helps you find the best software and product alternatives Learn more β
Top 11 Python Parse Projects
-
Lark
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
-
InfluxDB
InfluxDB β Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
tika-python
Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.
-
probablepeople
:family: a python library for parsing unstructured western names into name components.
-
Project mention: Show HN: I made a website to semantically search ArXiv papers | news.ycombinator.com | 2024-12-24
Excellent project.
As mentioned in another comment, I've put together an embeddings database using the arxiv dataset (https://huggingface.co/NeuML/txtai-arxiv) recently.
For those interested in the literature search space, a couple other projects I've worked on that may be of interest.
annotateai (https://github.com/neuml/annotateai) - Annotates papers with LLMs. Supports searching the arxiv database mentioned above.
paperai (https://github.com/neuml/paperai) - Semantic search and workflows for medical/scientific papers. Built on txtai (https://github.com/neuml/txtai)
paperetl (https://github.com/neuml/paperetl) - ETL processes for medical and scientific papers. Supports full PDF docs.
-
-
-
ODBParser
OSINT tool to search, parse and dump only the open Elasticsearch and MongoDB directories that have the data you care about exposing
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
booze-tools
Booze Tools will become the complete programming-language development workbench, all written in Python 3.9 (for now).
-
pygame_aseprite_animator
A python package to allow .ase and .aseprite files to be loaded into pygame
Python Parse discussion
Python Parse related posts
-
Flattening ASTs (and Other Compiler Data Structures)
-
Is it possible to propagate higher level constructs (+, *) to the generated parse tree in an LR-style parser?
-
Parse research papers into a structured dataset
-
ETL for medical and scientific papers
-
ETL for medical and scientific papers
-
ETL for medical and scientific papers
-
Show HN: ETL for Medical and Scientific Papers
-
A note from our sponsor - SaaSHub
www.saashub.com | 21 Jun 2025
Index
What are some of the best open-source Parse projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | Lark | 5,340 |
2 | tika-python | 1,590 |
3 | probablepeople | 606 |
4 | paperetl | 385 |
5 | themer | 305 |
6 | Path-of-Accounting | 103 |
7 | ODBParser | 46 |
8 | sd-parsers | 35 |
9 | hivetools | 20 |
10 | booze-tools | 16 |
11 | pygame_aseprite_animator | 11 |