clickhousedb_fdw
tidyquery
clickhousedb_fdw | tidyquery | |
---|---|---|
2 | 2 | |
197 | 167 | |
2.5% | - | |
0.0 | 0.0 | |
over 3 years ago | over 1 year ago | |
C | R | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
clickhousedb_fdw
-
ClickHouse: An open-source column-oriented database management system
Maybe you could do something with postgres foreign data wrappers?
For example this exists, not sure about maturity though https://github.com/Percona-Lab/clickhousedb_fdw
-
ClickHouse as an alternative to Elasticsearch for log storage and analysis
* you can go the other way too: read Clickhouse from PostgreSQL (see https://github.com/Percona-Lab/clickhousedb_fdw, although we didn't try this)
tidyquery
-
Can "dplyr" code automatically be converted to SQL code?
tidyquery
-
ClickHouse as an alternative to Elasticsearch for log storage and analysis
> SQL is a perfect language for analytics.
Slightly off topic, but I strongly agree with this statement and wonder why the languages used for a lot of data science work (R, Python) don't have such a strong focus on SQL.
It might just be my brain, but SQL makes so much logical sense as a query language and, with small variances, is used to directly query so many databases.
In R, why learn the data.tables (OK, speed) or dplyr paradigms, when SQL can be easily applied directly to dataframes? There are libraries to support this like sqldf[1], tidyquery[2] and duckdf[3] (author). And I'm sure the situation is similar in Python.
This is not a post against great libraries like data.table and dplyr, which I do use from time to time. It's more of a question about why SQL is not more popular as the query language de jour for data science.
[1] https://cran.r-project.org/web/packages/sqldf/index.html
[2] https://github.com/ianmcook/tidyquery
[3] https://github.com/phillc73/duckdf
What are some alternatives?
Trino - Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
duckdf - 🦆 SQL for R dataframes, with ducks
pirsch - Pirsch is a drop-in, server-side, no-cookie, and privacy-focused analytics solution for Go.
meilisearch-js-plugins - The search client to use Meilisearch with InstantSearch.
tidyquant - Bringing financial analysis to the tidyverse
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
tidyverse - Easily install and load packages from the tidyverse
ClickHouse - ClickHouse® is a free analytics DBMS for big data
tidylog - Tidylog provides feedback about dplyr and tidyr operations. It provides wrapper functions for the most common functions, such as filter, mutate, select, and group_by, and provides detailed output for joins.
Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow