Parser Library

Top 23 Parser Library Open-Source Projects

  • parser

    ๐Ÿ“œ Extract meaningful content from the chaos of a web page

  • Project mention: Show HN: I made a tool to clean and convert any webpage to Markdown | news.ycombinator.com | 2024-04-14

    Thoroughly scraping is challenging, especially in an environment where you donโ€™t have (or want) a JavaScript runtime.

    For content extraction, I found the approach the Postlight library takes quite neat. It scores individual html nodes based on some heuristics (text length, link density, css classes). It the selects the nodes with the highest score. [1] I ported it to Swift for a personal read later app.

    [1] https://github.com/postlight/parser

  • Lark

    Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

  • Project mention: Show HN: I wrote a RDBMS (SQLite clone) from scratch in pure Python | news.ycombinator.com | 2023-08-13

    Lark supports, and recommends, writing and storing the grammar in a .lark file. We have syntax highlighting support in all major IDEs, and even in github itself. For example, here is Lark's built-in grammar for Python: https://github.com/lark-parser/lark/blob/master/lark/grammar...

    You can also test grammars "live" in our online IDE: https://www.lark-parser.org/ide/

    The rationale is that it's more terse and has less visual clutter than a DSL over Python, which makes it easier to read and write.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • participle

    A parser library for Go

  • Project mention: Parser generator | /r/golang | 2023-09-14

    I know of tools like Kaitai that do essentially what I want, just curious about my options. I guess something like https://github.com/alecthomas/participle could work to parse the format, then I'd have to generate code based on the parsed data or something

  • Chevrotain

    Parser Building Toolkit for JavaScript

  • Project mention: Ohm: A library and language for building parsers, interpreters, compilers, etc. | news.ycombinator.com | 2023-10-31

    How does this compare with Chevrotain[1]?

    More specifically, can I build lexers with Ohm? Can it generate a syntax diagram from a grammar?

    [1]: https://github.com/chevrotain/chevrotain

  • Sprache

    A tiny, friendly, C# parser construction library

  • Recognizers-Text

    Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV). Packages available at: https://www.nuget.org/profiles/Recognizers.Text, https://www.npmjs.com/~recognizers.text

  • cppast

    Library to parse and work with the C++ AST

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • Superpower

    A C# parser construction toolkit with high-quality error reporting

  • Project mention: I made a new filtering library for .NET projects | /r/dotnet | 2023-05-01

    Awesome little project you got going on there :) , I've done something similar with Ooze and extracted it to Ooze.Query later. I need to push them to nuget also when I catch some time (I jump back to it from time to time when I have some free space). Cool to see another person using parser construction libs. I see you used Sprache, I went for SuperPower for this situation.

  • megaparsec

    Industrial-strength monadic parser combinator library

  • Pidgin

    A lightweight and fast parsing library for C#.

  • Project mention: Library to parse slash commands with validation? | /r/dotnet | 2023-06-02

    pigeon https://github.com/benjamin-hodgson/Pidgin

  • BioJava

    :book::microscope::coffee: BioJava is an open-source project dedicated to providing a Java library for processing biological data.

  • Project mention: Outperforming Rust DNA sequence parsing benchmarks by 50% with Mojo | news.ycombinator.com | 2024-02-08

    Here is a cache of valid and invalid FASTQ files for unit tests

    https://github.com/biojava/biojava/tree/master/biojava-genom...

  • FParsec

    A parser combinator library for F#

  • dart-petitparser

    Dynamic parser combinators in Dart.

  • scpi-parser

    Open Source SCPI device library

  • TatSu

    ็ซœ TatSu generates Python parsers from grammars in a variation of EBNF

  • mecha

    A parser combinator library for Zig (by Hejsil)

  • gronx

    Lightweight, fast and dependency-free Cron expression parser (due checker, next/prev due date finder), task runner, job scheduler and/or daemon for Golang (tested on v1.13+) and standalone usage. If you are bold, use it to replace crontab entirely.

  • CSLY

    a C# embeddable lexer and parser generator (.Net core)

  • uriparser

    :hocho: Strictly RFC 3986 compliant URI parsing and handling library written in C89; moved from SourceForge to GitHub

  • Ksoup

    Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.

  • Project mention: Compose Rich Text Editor 0.2.0 released, with a lot of new features | /r/Kotlin | 2023-05-22

    In this version 0.2.0 I added HTML support. Markdown support is coming to 0.3.0. Since the library is multiplatform and there's no Multiplatform HTML or Markdown parsers available, I built my own multiplatform parsing library which is Ksoup and for now it only supports HTML https://github.com/MohamedRejeb/Ksoup

  • autumn

    A Java parser combinator library written with an unmatched feature set.

  • parse_it

    A python library for parsing multiple types of config files, envvars & command line arguments that takes the headache out of setting app configurations.

  • booleval

    Header-only C++17 library for evaluating logical expressions.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Parser Library related posts

Index

What are some of the best open-source Parser Library projects? This list will help you:

Project Stars
1 parser 5,245
2 Lark 4,481
3 participle 3,312
4 Chevrotain 2,397
5 Sprache 2,271
6 Recognizers-Text 1,645
7 cppast 1,636
8 Superpower 968
9 megaparsec 892
10 Pidgin 828
11 BioJava 575
12 FParsec 508
13 dart-petitparser 444
14 scpi-parser 433
15 TatSu 391
16 mecha 392
17 gronx 373
18 CSLY 334
19 uriparser 311
20 Ksoup 312
21 autumn 119
22 parse_it 107
23 booleval 81

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com