dbd VS data-toolset

Compare dbd vs data-toolset and see what are their differences.

dbd

dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases. (by zsvoboda)

data-toolset

Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package. (by luminousmen)
Sevalla - Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
dbd data-toolset
4 1
57 3
- -
0.0 6.8
over 3 years ago over 1 year ago
Python Python
BSD 3-clause "New" or "Revised" License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

dbd

Posts with mentions or reviews of dbd. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-01-23.
  • Easy loading Kaggle dataset to a database
    2 projects | /r/datascience | 23 Jan 2022
    I've created two examples of how to use the dbd tool to load Kaggle dataset data files (csv, json, xls, parquet) to your Postgres, MySQL, or SQLite database.Basically, you don't have to create any tables, nor run any SQL INSERT or COPY statements. Everything is automated. You just reference the datasets and files with a URL and execute a 'dbd run' command.The examples are here. Perhaps you find it useful. Let me know, what you think!
  • Easy loading dataset files to a database
    2 projects | /r/kaggle | 23 Jan 2022
    I've created two examples of how to use the [dbd](https://github.com/zsvoboda/dbd) tool to load Kaggle dataset data files (csv, json, xls, parquet) to your Postgres, MySQL, or SQLite database.
  • dbd: create your database from data files on your directory
    1 project | /r/SQL | 15 Jan 2022
    I work on the new open-sourced tool called dbd that enables you to load data from your local data files to your database and transform it using insert-from-select statements. The tool supports templating (Jinja2). It works with Postgres, MySQL, SQLite, Snowflake, Redshift, and BigQuery.
  • New opensource ELT tool
    1 project | /r/dataengineering | 9 Jan 2022
    I was looking for some declarative ELT tool for creating my analytics solutions, and DBT was the closest I've found. I liked its concept, but I came across quite a few limitations when I wanted to use it. I couldn't specify and create basic things like data types, indexes, primary/foreign keys, etc. In the end, I decided to implement my own - more straightforward and more flexible. I've published the result - dbd on GitHub. Perhaps, you can find it helpful. Your feedback is greatly appreciated!

data-toolset

Posts with mentions or reviews of data-toolset. We have used some of these posts to build our list of alternatives and similar projects.

What are some alternatives?

When comparing dbd and data-toolset you can also consider the following projects:

sqlmesh - Scalable and efficient data transformation framework - backwards compatible with dbt.

rill - Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.

ethereum-etl - Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ

pystore - Fast data store for Pandas time-series data

SnowDDL - Declarative-style object management tool for Snowflake.

prql-query - Query and transform data with PRQL

Sevalla - Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured

Did you know that Python is
the 2nd most popular programming language
based on number of references?