simdjson

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks (by simdjson)

Simdjson Alternatives

Similar projects and alternatives to simdjson

  1. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  2. llvm-project

    The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

  3. docs

    Hardware and software docs / wiki (by AsahiLinux)

  4. coreutils

    upstream mirror (by coreutils)

  5. json

    93 simdjson VS json

    JSON for Modern C++

  6. Apache Arrow

    Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

  7. Cap'n Proto

    Cap'n Proto serialization/RPC system - core tools and C++ library

  8. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  9. q

    46 simdjson VS q

    q - Run SQL directly on delimited files and multi-file sqlite databases (by harelba)

  10. webui

    Use any web browser or WebView as GUI, with your preferred language in the backend and modern web technologies in the frontend, all in a lightweight portable library.

  11. sonic

    24 simdjson VS sonic

    A blazingly fast JSON serializing & deserializing library (by bytedance)

  12. RapidJSON

    A fast JSON parser/generator for C++ with both SAX/DOM style API

  13. ojg

    Optimized JSON for Go

  14. simdutf8

    SIMD-accelerated UTF-8 validation for Rust.

  15. pocorgtfo

    a "Proof of Concept or GTFO" mirror with an extensive index with also whole issues or individual articles as clean PDFs.

  16. nativejson-benchmark

    C/C++ JSON parser/generator benchmark

  17. encoding_rs

    A Gecko-oriented implementation of the Encoding Standard in Rust

  18. jsoniter

    3 simdjson VS jsoniter

    jsoniter (json-iterator) is fast and flexible JSON parser available in Java and Go (by json-iterator)

  19. simdjson-go

    Golang port of simdjson: parsing gigabytes of JSON per second

  20. pysimdjson

    Python bindings for the simdjson project.

  21. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better simdjson alternative or higher similarity.

simdjson discussion

Log in or Post with

simdjson reviews and mentions

Posts with mentions or reviews of simdjson. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-09-17.
  • User-Space Interrupts (2021)
    2 projects | news.ycombinator.com | 17 Sep 2024
    AVX-512 is on Zen5. Ultra-popular libraries like simdjson can use it. https://github.com/simdjson/simdjson/issues/10

    In general my hope is that runtimes pick up the good stuff & roll with it. Io_uring hasn't exactly been a stunning success on nidejs/libuv but the promise is so real that runtimes can take sweet io capabilities like io_uring or usersoace interrupts & boom, now everyone's ok is faster.

  • Wc2: Investigates optimizing 'wc', the Unix word count program
    15 projects | news.ycombinator.com | 20 Jun 2024
    State machines are great for complex situations, but when it comes to performance, it's not at all clear to me that they're the most scalable approach with modern systems.

    The data dependency between a loop iteration for each character might be pipelined really well when executed, and we can assume large enough L1/L2 cache for our lookup tables. But we're still using at least one lookup per character.

    Projects like https://github.com/simdjson/simdjson?tab=readme-ov-file#abou... are truly fascinating, because they're based on SIMD instructions that can process 64 or more bytes with a single instruction. Very much worth checking out the papers at the link above.

  • Scan HTML faster with SIMD instructions – Chrome edition
    7 projects | news.ycombinator.com | 13 Jun 2024
    Can you point to some of these benchmarks? https://news.ycombinator.com/item?id=26934854 suggests that in at least one synthetic benchmark (with a 7.5KB protobuf message which expands to a 17KB JSON payload), protobuf parsing at 2GB/s would be comparable to JSON parsing at 5GB/s.

    Meanwhile, simdjson's numbers (https://github.com/simdjson/simdjson/blob/master/doc/gbps.pn...) show a peak parsing speed of about 3GB/s depending on the workload. Of course, it's not clear you can compare these directly, since they were probably not run on systems with comparable specs. But it's not clear to me that there's a 5x difference.

    Perhaps my experience differs because I'm used to seeing very large messages being passed around, but I'd be happy to reconsider. (Or maybe I should go all-in on Cap'n Proto.)

  • SIMD < SIMT < SMT: Parallelism in Nvidia GPUs (2011)
    1 project | news.ycombinator.com | 11 Jun 2024
    I intentionally said "more towards embarrassingly parallel" rather than "only embarrassingly parallel". I don't think there's a hard cutoff, but there is a qualitative difference. One example that springs to mind is https://github.com/simdjson/simdjson - afaik there's no similarly mature GPU-based JSON parsing.
  • The Simdjson Library
    4 projects | news.ycombinator.com | 3 Jun 2024
  • Tips on adding JSON output to your command line utility. (2021)
    2 projects | news.ycombinator.com | 20 Apr 2024
    It's also supported by simdjson [0] (which has a lot of language bindings [1]):

    > Multithreaded processing of gigantic Newline-Delimited JSON (ndjson) and related formats at 3.5 GB/s

    [0] https://simdjson.org/

    [0] https://github.com/simdjson/simdjson?tab=readme-ov-file#bind...

  • 1BRC Merykitty's Magic SWAR: 8 Lines of Code Explained in 3k Words
    4 projects | news.ycombinator.com | 9 Mar 2024
  • Training great LLMs from ground zero in the wilderness as a startup
    3 projects | news.ycombinator.com | 6 Mar 2024
  • simdjson: Parsing Gigabytes of JSON per Second
    1 project | news.ycombinator.com | 23 Jan 2024
  • Use any web browser as GUI, with Zig in the back end and HTML5 in the front end
    17 projects | news.ycombinator.com | 1 Jan 2024
    String parsing is negligible compared to the speed of the DOM which is glacially slow: https://news.ycombinator.com/item?id=38835920

    Come on, people, make an effort to learn how insanely fast computers are, and how insanely inefficient our software is.

    String parsing can be done at gigabytes per second: https://github.com/simdjson/simdjson If you think that is the slowest operation in the browser, please find some resources that talk about what is actually happening in the browser?

  • A note from our sponsor - SaaSHub
    www.saashub.com | 11 Feb 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic simdjson repo stats
70
19,734
8.9
2 days ago

simdjson/simdjson is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of simdjson is C++.


Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that C++ is
the 7th most popular programming language
based on number of references?