Rust Data

Open-source Rust projects categorized as Data

Top 18 Rust Data Projects

  • prql

    PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

  • Project mention: Prolog language for PostgreSQL proof of concept | news.ycombinator.com | 2024-03-30
  • paradedb

    Postgres for Search and Analytics

  • Project mention: Using ClickHouse to scale an events engine | news.ycombinator.com | 2024-04-11
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • arroyo

    Distributed stream processing engine in Rust

  • Project mention: FLaNK AI Weekly 18 March 2024 | dev.to | 2024-03-18
  • dozer

    Dozer is a real-time data movement tool that leverages CDC from various sources and moves data into various sinks. (by getdozer)

  • Project mention: Show HN: Find simple open source bounties to solve and get paid | news.ycombinator.com | 2023-08-19

    https://github.com/getdozer/dozer/issues/1631#issuecomment-1...

    and then something has gone off the rails about the accounting process since

      Trigger.dev

  • tensorbase

    TensorBase is a new big data warehousing with modern efforts.

  • nutype

    Rust newtype with guarantees 🇺🇦 🦀

  • Project mention: Nutype 0.4.0 released | /r/rust | 2023-11-23
  • orz

    a high performance, general purpose data compressor written in the crab-lang

  • Project mention: Compressor in Rust; compresses as good as “ZSTD -19”, while being 10 times fast | news.ycombinator.com | 2023-06-18
  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • hypergraph

    Hypergraph is data structure library to create a directed hypergraph in which a hyperedge can join any number of vertices.

  • TablaM

    The practical relational programing language for data-oriented applications

  • Project mention: YC's Latest Request for Startups | news.ycombinator.com | 2024-02-14

    > Very curious if anyone knows how to pull this off.

    I work in this space (small/mid-size).

    The good news is that there are several "obvious" ways to pull this off because an ERP is the culmination of everything a company needs and does. So almost anything you can imagine on the software is part of it.

    The bad news, and the reason everyone wants a solution, is that is truly a big space, and then you need E.V.E.R.Y.T.H.I.N.G.

    ---

    My take is to start from the bottom, and build a much better version of Access/FoxPro (https://tablam.org).

    Any medium/big ERP end being a specialized computing platform that needs:

    - A programming language

    - A database engine

    - An orchestration engine

    - ELT engine

    - Auth

    - UI/Report builders

    And to be clear: NONE of the "programming language", "database engine", etc are a good fit today.

    NONE.

    This is the big thing, This is the reason (from a tech POW only) that most attempts fail.

    This is the secret of why Cobol rule(d): Is all of this! but is too old! (also, this is why SQL still is best: Is almost this).

    ---

    So, to pull this off, you need a team that knows what is "missing" from our current tools, makes a well-integrated package, and adds a "user-friendly" interface in a way that is palatable for the kind of user that uses excel (powerfully).

    Is not that impossible. FoxPro was the best example of this kind of integrated solution.

    P.D: This is my life's dream, to make this truth!

  • rust-pgdatadiff

    Sequence & table data comparison between 2 PostgresQL databases

  • Project mention: Rust-pgdatadiff: A re-write of pgdatadiff in Rust | news.ycombinator.com | 2024-03-15
  • transparency-data

    U.S. Healthcare Transparency Data. Supplemental data for the CMS/HHS price transparency rules.

  • xvc

    A robust (🐢) and fast (🐇) MLOps tool for managing data and pipelines in Rust (🦀)

  • system-info-collector

    App to collect ram/cpu usage from OS and show it in pretty graphs

  • Project mention: [media] System Info Collector - Fast and easy-to-use cli application for collecting RAM and CPU usage information over time | /r/rust | 2023-07-09

    Repository - https://github.com/qarmin/system-info-collector Binaries - https://github.com/qarmin/system-info-collector/releases

  • csvsource

    Converts a CSV file to SQL Insert Statements.

  • rusqttbom

    RusQTTbom takes weather data from the Bureau of Meteorology (BOM) and publishes that data via MQTT messages.

  • raven

    RavenCol, Tabular data manipulation in Rust (by irvingfisica)

  • server

    REST API for Gico application. It's part of Database class project (by gico-net)

  • xorsum

    Get XOR hash/digest with this command-line tool

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-04-11.

Rust Data related posts

Index

What are some of the best open-source Data projects in Rust? This list will help you:

Project Stars
1 prql 9,414
2 paradedb 3,756
3 arroyo 3,257
4 dozer 1,440
5 tensorbase 1,423
6 nutype 1,189
7 orz 794
8 hypergraph 267
9 TablaM 183
10 rust-pgdatadiff 54
11 transparency-data 27
12 xvc 22
13 system-info-collector 9
14 csvsource 8
15 rusqttbom 5
16 raven 2
17 server 1
18 xorsum 0
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com