duckdf vs ibis

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

duckdf		ibis
	Project
3	Mentions	23
41	Stars	4,241
-	Growth	6.5%
0.0	Activity	10.0
4 months ago	Latest Commit	3 days ago
R	Language	Python
GNU General Public License v3.0 only	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

duckdf

Posts with mentions or reviews of duckdf. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-10.

DuckDB – in-process SQL OLAP database management system
4 projects | news.ycombinator.com | 10 Feb 2023

Quite a while ago, when duckdb was just a duckling, I wrote an R package that supported direct manipulation of R dataframes using SQL.[1] duckdb was the engine for this.
The approach was never as fast as data.table but did approach the speed of dplyr for more complex queries.
Life had other things in store for me and I haven’t touched this library for a while now.
At the time there was no Julia connector for duckdb, but now that there is, I’d like to try this approach in that language.
[1] https://github.com/phillc73/duckdf
ClickHouse as an alternative to Elasticsearch for log storage and analysis
13 projects | news.ycombinator.com | 2 Mar 2021

Yeah, I agree sqldf is quite slow. Fair point.
As you've seen, duckdb registers an "R data frame as a virtual table." I'm not sure what they mean by "yet" either.
Of course it is possible to write an R dataframe to an on-disk duckdb table, if that's what you want to do.
There are some simple benchmarks on the bottom of the duckdf README[1]. Essentially I found for basic SQL SELECT queries, dplyr is quicker, but for much more complex queries, the duckdf/duckdb combination performs better.
If you really want speed of course, just use data.table.
[1] https://github.com/phillc73/duckdf
Julia 1.6: what has changed since Julia 1.0?
9 projects | news.ycombinator.com | 14 Feb 2021

That's a really good point that I'd not really thought about. I'd never really considered the difference between calling just functions versus macros.
Thinking about Query.jl and DataFramesMeta.jl, and I am for sure not an expert in either, I can't specifically speak to your `head` example, but other base functions can be combined with macros. For example, see the LINQ examples from DataFramesMeta.jl[1] where `mean` is being used. Or again the LINQ style examples in Query.jl[2], where `descending` is used in the first example, or `length` later in the Grouping examples.
Is that the kind of thing you meant?
For whatever reason, with the way my brain is wired, the LINQ style of query just works for me. I have never directly used LINQ, but do have some SQL experience. In fact, I wrote some dinky little wrapper functions[3] around duckdb[4] so I could directly query R dataframes and datatables with SQL using that backend, rather than sqldf[5].
[1] https://juliadata.github.io/DataFramesMeta.jl/stable/#@linq-...
[2] https://www.queryverse.org/Query.jl/stable/linqquerycommands...
[3] https://github.com/phillc73/duckdf
[4] https://duckdb.org/
[5] https://cran.r-project.org/web/packages/sqldf/index.html

ibis

Posts with mentions or reviews of ibis. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-17.

Show HN: Hashquery, a Python library for defining reusable analysis
1 project | news.ycombinator.com | 23 Apr 2024

I really don't understand the appeal of dbt vs a proper programming language. The templating approach leads to massive spaghetti. I look forward to trying out something like Ibis [0]
0: https://ibis-project.org/
This Week In Python
5 projects | dev.to | 17 Mar 2024

ibis – portable Python dataframe library
Ibis: The portable Python dataframe library
1 project | news.ycombinator.com | 13 Mar 2024

1 project | news.ycombinator.com | 22 Feb 2024
FLaNK Stack 26 February 2024
50 projects | dev.to | 26 Feb 2024
Quarto
5 projects | news.ycombinator.com | 14 Feb 2024

The main benefit is that you get a Python (or R, Julia or Rust) interpreter. So you can evaluate code. A good example of the value of this is the Ibis docs which use Quarto: https://ibis-project.org/
Polars – A bird's eye view of Polars
4 projects | news.ycombinator.com | 13 Feb 2024

Ive found polars quite intuitive, though for python, I lean more towards [ibis](https://ibis-project.org/). The interface is nearly identical, but ibis has the benefit if building sql queries before pulling any actual data (like dbplyr) — whereas polars requires the data to be in-memory (at least for rdb’s, though correct me if Im wrong)
this to me seems like a good argument for only using ibis, but Im happy to be convinced otherwise
Ibis – Universal Interface for Data Wrangling
1 project | news.ycombinator.com | 13 Feb 2024
Vanna.ai: Chat with your SQL database
13 projects | news.ycombinator.com | 14 Jan 2024

Please add Ibis Birdbrain https://ibis-project.github.io/ibis-birdbrain/ to the list. Birdbrain is an AI-powered data bot, built on Ibis and Marvin, supporting more than 18 database backends.
See https://github.com/ibis-project/ibis and https://ibis-project.org for more details.
Ibis
1 project | news.ycombinator.com | 10 Jan 2024

What are some alternatives?

When comparing duckdf and ibis you can also consider the following projects:

tidyquery - Query R data frames with SQL

snowflake-connector-python - Snowflake Connector for Python

Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

PySpark-Boilerplate - A boilerplate for writing PySpark Jobs

julia - The Julia Programming Language

Apache Impala - Apache Impala

loki - Like Prometheus, but for logs.

pangres - SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features

Makie.jl - Interactive data visualizations and plotting in Julia

sqlite_scanner - DuckDB extension to read and write to SQLite databases

MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

katacoda

duckdf vs tidyquery ibis vs snowflake-connector-python duckdf vs Typesense ibis vs PySpark-Boilerplate duckdf vs julia ibis vs Apache Impala duckdf vs loki ibis vs pangres duckdf vs Makie.jl ibis vs sqlite_scanner duckdf vs MeiliSearch ibis vs katacoda

Compare duckdf vs ibis and see what are their differences.

duckdf

ibis

duckdf

ibis

What are some alternatives?