data-transformation

Open-source projects categorized as data-transformation

Top 21 data-transformation Open-Source Projects

  • prql

    PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

  • Project mention: Prolog language for PostgreSQL proof of concept | news.ycombinator.com | 2024-03-30
  • Mage

    🧙 The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai

  • Project mention: FLaNK AI-April 22, 2024 | dev.to | 2024-04-22
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • glom

    ☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

  • Project mention: Ask HN: How can I get better at writing production-level Python? | news.ycombinator.com | 2023-07-18
  • Optimus

    :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark (by ironmussa)

  • pglogical

    Logical Replication extension for PostgreSQL 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.

  • zingg

    Scalable identity resolution, entity resolution, data mastering and deduplication using ML

  • optimus

    Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management. (by raystack)

  • Project mention: Data Engineering Tools in Go | /r/dataengineering | 2023-05-18

    You can check odpf github, they created some dataops tools using go, one of the example is optimus (https://github.com/odpf/optimus) which is a data pipeline orchestrator

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • prose

    Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK. (by microsoft)

  • Porter

    :lipstick: Durable and asynchronous data imports for consuming data at scale and publishing testable SDKs. (by ScriptFUSION)

  • collapse

    Advanced and Fast Data Transformation in R (by SebKrantz)

  • sqawk

    Like awk but with SQL and table joins

  • fastverse

    An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R

  • clojure-dsl-resources

    A curated list of Clojure resources for dealing with domain-specific languages.

  • Project mention: Let's write a simple microservice in Clojure | dev.to | 2024-04-26

    Compojure's DSL for web applications makes it easy to set up REST API routes with corresponding HTTP methods. Adding a Swagger API descriptor through libraries like ring-swagger provides a visual interface for interacting with the API and enables client code generation. You can use the Prismatic schema library for HTTP request validation and data coercing to ensure the API consumes and produces data that conforms to predefined schemas. Compojure's middleware approach allows for modular and reusable components that can handle cross-cutting concerns like authentication, logging, and request/response transformations, enhancing the API's scalability and maintainability.

  • cq

    Clojure Command-line Data Processor for JSON, YAML, EDN, XML and more (by markus-wa)

  • daany

    Daany - .NET DAta ANalYtics .NET library with the implementation of DataFrame, Time series decompositions and Linear Algebra routines BLASS and LAPACK.

  • data-lens

    Functional utilities for Common Lisp

  • Unquery

    Command line query tool for JSON files

  • fragments

    Transform and compose data for HTTP transactions.

  • jsonata-playbook

    practical examples of jsonata [go-jsonata 1.5.4]

  • cuphic

    Transform or scrape Hiccup with a declarative DSL.

  • sqlite-wf

    Simple visual ETL tool

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

data-transformation related posts

Index

What are some of the best open-source data-transformation projects? This list will help you:

Project Stars
1 prql 9,427
2 Mage 7,001
3 glom 1,825
4 Optimus 1,446
5 pglogical 935
6 zingg 877
7 optimus 737
8 prose 612
9 Porter 608
10 collapse 599
11 sqawk 308
12 fastverse 213
13 clojure-dsl-resources 170
14 cq 152
15 daany 54
16 data-lens 30
17 Unquery 16
18 fragments 14
19 jsonata-playbook 5
20 cuphic 4
21 sqlite-wf 2

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com