cockroach
Trino
Our great sponsors
cockroach | Trino | |
---|---|---|
87 | 36 | |
26,800 | 7,458 | |
1.2% | 4.7% | |
10.0 | 10.0 | |
2 days ago | 6 days ago | |
Go | Java | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cockroach
- Good database solution
-
Does Go work well as a systems language?
You absolutely can write very high performance software in Go, that's kind of the point. You can efficiently interface with C libraries. You can create the sort of software everyone says should be done in Rust, like databases and web servers and system orchestration and games and every other goddamn thing that people will say isn't the right choice for Go.
-
Embed hard-coded SQL into binaries for a cleaner look!
PostgreSQL Parser separated from CockroachDB, a distributed DB.
- Any self hostable postgres, clustering, replication and fail over system?
-
Analysing Github Stars - Extracting and analyzing data from Github using Apache NiFi®, Apache Kafka® and Apache Druid®
Spencer Kimball (now CEO at CockroachDB) wrote an interesting article on this topic in 2021 where they created spencerkimball/stargazers based on a Python script. So I started thinking: could I create a data pipeline using Nifi and Kafka (two OSS tools often used with Druid) to get the API data into Druid - and then use SQL to do the analytics? The answer was yes! And I have documented the outcome below. Here’s my analytical pipeline for Github stars data using Nifi, Kafka and Druid.
- Ask HN: What is your distributed and fault-tolerant PostgreSQL setup?
-
Anyone had a success story of replacing C++ with Go?
One of the most popular distributed DB is built in Go: https://www.cockroachlabs.com/
-
Display CockroachDB metrics in Splunk Dashboards
Recently, I worked on such an integration with Splunk. The Splunk dashboard files that emulate the DB Console are now available in our repo for everyone's benefit.
-
How do I implement a HA PostgreSQL setup in k8s/k3s ?
Technically not postgres, but could be worth checking out cockroach db, it can use any postgres driver from programming languages and is build for distribution. I recently moved from postgres to cockroach (with golang postgres driver) and didnt need to change a single query
Trino
-
Apache Iceberg as storage for on-premise data store (cluster)
Trino or Hive for SQL querying. Get Trino/Hive to talk to Nessie.
-
Uber Interview Experience/Asking Suggestions
One place to look are the projects repo's and docs, once you have a good idea of how the system is architected poking around pieces of the codebase can be helpful in letting you really understand their internals. I personally enjoy going through spark repo and trino repo and the documentation for both projects is decent and can answer many of your questions.
- Java OSS with best code quality you’ve ever seen?
-
What is the separation of storage and compute in data platforms and why does it matter?
However, once your data reaches a certain size or you reach the limits of vertical scaling, it may be necessary to distribute your queries across a cluster, or scale horizontally. This is where distributed query engines like Trino and Spark come in. Distributed query engines make use of a coordinator to plan the query and multiple worker nodes to execute them in parallel.
- Sparkless is born
- Why use Spark at all?
-
Data Engineer Github Profile?
So, there are a lot of people involved which is why you try to find some common ground regarding coding style, documentation, programming language version, … Most larger project have some guidelines you can read to see what is necessary for a PR to be approved: Trino’s Guidelines or dbt.
-
what is the most famous java application that you've worked on?
Trino, a distributed SQL engine (https://trino.io)
-
Join us online on October 31st for Backblaze Tech Day '22!
Storing and Querying Analytical Data With Trino
-
Aerospike Through SQL
For analytics, you can access Aerospike data on Trino with the Aerospike Trino Connector.
What are some alternatives?
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
dremio-oss - Dremio - the missing link in modern data
Presto - The official home of the Presto distributed SQL query engine for big data
Apache Drill - Apache Drill is a distributed MPP query layer for self describing data
ClickHouse - ClickHouse® is a free analytics DBMS for big data
Apache Calcite - Apache Calcite
vitess - Vitess is a database clustering system for horizontal scaling of MySQL.
tidb - TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://tidbcloud.com/free-trial
spring-data-jpa-mongodb-expressions - Use the MongoDB query language to query your relational database, typically from frontend.
yugabyte-db - YugabyteDB - the cloud native distributed SQL database for mission-critical applications.
rqlite - The lightweight, distributed relational database built on SQLite
InfluxDB - Scalable datastore for metrics, events, and real-time analytics