Parser

Open-source projects categorized as Parser

Top 23 Parser Open-Source Projects

  • marked

    A markdown parser and compiler. Built for speed.

  • Project mention: Eleventy vs. Next.js for static site generation | dev.to | 2023-12-14

    Next, install gray-matter to extract metadata from the front matter of markdown files, and marked to convert the markdown files to HTML:

  • swc

    Rust-based platform for the Web

  • Project mention: Rustify your JavaScript tooling | dev.to | 2024-05-21

    A big part of my work revolves around JavaScript tooling, and as such it's important to keep an eye on the ecosystem and see where things are going. It's no secret that recently lots of projects are native-ying (??) parts of their codebase, or even rewriting them to native languages altogether. Esbuild is one of the first popular and successful examples of this, which was written in Go. Other examples are Rspack and Turbopack, which are both Rust-based alternatives to Webpack, powered by SWC ("Speedy Web Compiler"). There's also Rolldown, a Rust-based alternative to Rollup powered by OXC ("The JavaScript Oxidation Compiler"), but Rollup itself is also native-ying (??) parts of their codebase and recently started using SWC for parts of their codebase. And finally, there are Oxlint (powered by OXC) and Biome as Rust-based alternatives for Eslint and Prettier respectively.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • PostCSS

    Transforming styles with JS plugins

  • Project mention: PostCSS - my initial experience | dev.to | 2024-01-11

    the plugins in the official PostCSS website were old like IE6 or the marquee tag, and

  • cheerio

    The fast, flexible, and elegant library for parsing and manipulating HTML and XML.

  • Project mention: 8 NPM Packages for JavaScript Beginners [2024][+tutorials] | dev.to | 2024-04-02

    Cheerio is your ticket to the world of server-side magic, allowing you to manipulate HTML and XML documents with jQuery-like syntax. It’s perfect for web scraping, data extraction, or just making sense of the mess that is web content. With Cheerio, you get to play around with the DOM, use CSS selectors, and basically do all the cool things you'd do in the browser, but server-side.

  • pydantic

    Data validation using Python type hints

  • Project mention: Advanced RAG with guided generation | dev.to | 2024-04-18

    First, note the method prefix_allowed_tokens_fn. This method applies a Pydantic model to constrain/guide how the LLM generates tokens. Next, see how that constrain can be applied to txtai's LLM pipeline.

  • PHP Parser

    A PHP parser written in PHP

  • Project mention: PHP-Parser: A PHP parser written in PHP | news.ycombinator.com | 2024-03-06
  • tree-sitter

    An incremental parsing system for programming tools

  • Project mention: Lezer: A Parsing System for CodeMirror, Inspired by Tree-Sitter | news.ycombinator.com | 2024-03-24

    I learned from a google search that these days upstream tree-sitter provides WebAssembly bindings.

    Source: https://github.com/tree-sitter/tree-sitter/tree/master/lib/b...

    NPM: https://www.npmjs.com/package/web-tree-sitter

    Download from the latest Github release: js file (https://github.com/tree-sitter/tree-sitter/releases/download...) and wasm file (https://github.com/tree-sitter/tree-sitter/releases/download...)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • vector

    A high-performance observability data pipeline.

  • Project mention: What is a low/reasonable cost solution for service log storage and querying? | news.ycombinator.com | 2024-05-05

    I am thinking about using https://vector.dev/ but would also love opinions on the best deal for lower or reasonable cost storage/querying of logs. Thanks!

  • Parsedown

    Better Markdown Parser in PHP

  • Project mention: Parsedown: Better Markdown Parser in PHP | news.ycombinator.com | 2024-01-05
  • jsoniter

    A high-performance 100% compatible drop-in replacement of "encoding/json" (by json-iterator)

  • Project mention: Handling high-traffic HTTP requests with JSON payloads | /r/golang | 2023-12-07

    Since most of the time would be spent decoding json, you could try to cut this time using https://github.com/bytedance/sonic or https://github.com/json-iterator/go, both are drop-in replacements for the stdlib, sonic is faster.

  • jsoup

    jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.

  • Project mention: FLaNK Stack Weekly for 20 June 2023 | dev.to | 2023-06-20
  • nom

    Rust parser combinator framework

  • Project mention: Planespotting with Rust: using nom to parse ADS-B messages | dev.to | 2023-10-28

    Just in case you are not familiar with nom, it is a parser combinator written in Rust. The most basic thing you can do with it is import one of its parsing functions, give it some byte or string input and then get a Result as output with the parsed value and the rest of the input or an error if the parser failed. tag for example is used to recognize literal character/byte sequences.

  • oxc

    βš“ A collection of JavaScript tools written in Rust.

  • Project mention: Rustify your JavaScript tooling | dev.to | 2024-05-21

    A big part of my work revolves around JavaScript tooling, and as such it's important to keep an eye on the ecosystem and see where things are going. It's no secret that recently lots of projects are native-ying (??) parts of their codebase, or even rewriting them to native languages altogether. Esbuild is one of the first popular and successful examples of this, which was written in Go. Other examples are Rspack and Turbopack, which are both Rust-based alternatives to Webpack, powered by SWC ("Speedy Web Compiler"). There's also Rolldown, a Rust-based alternative to Rollup powered by OXC ("The JavaScript Oxidation Compiler"), but Rollup itself is also native-ying (??) parts of their codebase and recently started using SWC for parts of their codebase. And finally, there are Oxlint (powered by OXC) and Biome as Rust-based alternatives for Eslint and Prettier respectively.

  • terser

    πŸ—œ JavaScript parser, mangler and compressor toolkit for ES6+

  • Project mention: How I use Devbox in my Elm projects | dev.to | 2024-05-02

    These projects use Caddy as my local development server, Dart Sass for converting my Sass files to CSS, elm, elm-format, elm-optimize-level-2, elm-review, elm-test (only in Calculator), ShellCheck to find bugs in my shell scripts, and Terser to mangle and compress JavaScript code.

  • Crafting Interpreters

    Repository for the book "Crafting Interpreters"

  • Project mention: Ask HN: Creating a language/runtime for fun, is this idea dumb or not | news.ycombinator.com | 2024-05-23

    Dumb idea? No way. Actually, I'm currently reading Crafting Interpreters which is exactly this. It's free to read online, if you wanted to check it out: https://www.craftinginterpreters.com/

    Super fun book.

  • esprima

    ECMAScript parsing infrastructure for multipurpose analysis

  • Project mention: ESLint: under the hood | dev.to | 2023-11-07

    Focusing again on ESLint, the parser used by the linter is called Espree. This is an in-house parser built by the ESLint folks to fully support ECMAScript 6 and JSX on top of the already existing Esprima. The Espree module provide APIs for both tokenization and parsing that you can easily test out.

  • sh

    A shell parser, formatter, and interpreter with bash support; includes shfmt (by mvdan)

  • Project mention: Show HN: Hucksh – A Shell with a Good Memory | news.ycombinator.com | 2023-12-21

    * The shell itself is https://github.com/mvdan/sh, a bash-like command interpreter

  • lightningcss

    An extremely fast CSS parser, transformer, bundler, and minifier written in Rust.

  • Project mention: I'm fed up with it, so I'm writing a browser | news.ycombinator.com | 2023-09-22

    Would you consider using some libraries in your project? There are lots of good ones in the Rust ecosystem, and many of them are not part of any existing browsers.

    For example:

    - https://github.com/servo/html5ever (HTML parsing - note: this is used in Servo)

    - https://github.com/parcel-bundler/lightningcss (CSS parsing)

    - https://github.com/DioxusLabs/taffy (web layout)

    - https://github.com/pop-os/cosmic-text (text layout and rendering)

    Obviously you should be free to work on whatever you like, but just as a benchmark on the scope of your project: I spent ~6 months implementing just the CSS Grid algorithm in Taffy last year. An entire browser from literal scratch is probably a 10 year project for one person.

  • astexplorer

    A web tool to explore the ASTs generated by various parsers.

  • Project mention: ESLint Plugin. What was missed in the doc? | dev.to | 2024-05-09

    https://astexplorer.net - a helpful tool to see the AST version of your code ESLint Selectors - the list of selectors typescript parser playground. astexplorer analogue for typescript. It has an essential feature: ESQuery filter. It allows you to debug your selectors in real-time.

  • remarkable

    Markdown parser, done right. Commonmark support, extensions, syntax plugins, high speed - all in one. Gulp and metalsmith plugins available. Used by Facebook, Docusaurus and many others! Use https://github.com/breakdance/breakdance for HTML-to-markdown conversion. Use https://github.com/jonschlinkert/markdown-toc to generate a table of contents.

  • sqlglot

    Python SQL Parser and Transpiler

  • Project mention: The Future of MySQL is PostgreSQL: an extension for the MySQL wire protocol | news.ycombinator.com | 2024-04-26

    This is probably referring to "zero changes to your driver code" and not "zero changes to the SQL you send over this driver".

    Translating between SQL dialects is notoriously hard and attempts to translate [1] are working in 95% of cases. But the last 5% would require 5x amount of work. That's because "SQL dialect" also includes weird edge cases of type inference of things like COALESCE(5, FALSE) and emulation of system catalogs (pg_catalog, information_schema).

    [1] https://github.com/tobymao/sqlglot

  • pdfminer.six

    Community maintained fork of pdfminer - we fathom PDF

  • Project mention: Code to extract text from pdf to excel | /r/Python | 2023-06-02

    I love to use PDFMiner and PDFQuery for this https://github.com/pdfminer/pdfminer.six https://towardsdatascience.com/scrape-data-from-pdf-files-using-python-and-pdfquery-d033721c3b28

  • body-parser

    Node.js body parsing middleware

  • Project mention: NodeJS Security Best Practices | dev.to | 2024-02-19

    Using body-parser you can set the limit on the size of the payload

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Parser related posts

  • Ask HN: Creating a language/runtime for fun, is this idea dumb or not

    1 project | news.ycombinator.com | 23 May 2024
  • Mastering JSX Editing in Emacs with Tree-sitter

    1 project | dev.to | 15 May 2024
  • Trip C++Now 2024 – think-cell

    4 projects | news.ycombinator.com | 10 May 2024
  • ESLint Plugin. What was missed in the doc?

    2 projects | dev.to | 9 May 2024
  • The Future of MySQL is PostgreSQL: an extension for the MySQL wire protocol

    1 project | news.ycombinator.com | 26 Apr 2024
  • Advanced RAG with guided generation

    2 projects | dev.to | 18 Apr 2024
  • Understanding Code Structure: A Beginner's Guide to Tree-sitter

    2 projects | dev.to | 6 Apr 2024
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 26 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more β†’

Index

What are some of the best open-source Parser projects? This list will help you:

Project Stars
1 marked 32,042
2 swc 30,156
3 PostCSS 28,238
4 cheerio 27,879
5 pydantic 19,049
6 PHP Parser 16,849
7 tree-sitter 16,754
8 vector 16,710
9 Parsedown 14,665
10 jsoniter 13,108
11 jsoup 10,709
12 nom 9,071
13 oxc 9,172
14 terser 8,449
15 Crafting Interpreters 8,248
16 esprima 6,962
17 sh 6,859
18 lightningcss 6,017
19 astexplorer 5,995
20 remarkable 5,676
21 sqlglot 5,734
22 pdfminer.six 5,514
23 body-parser 5,383

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com