Trino
Apache Cassandra
Our great sponsors
Trino | Apache Cassandra | |
---|---|---|
44 | 35 | |
9,552 | 8,510 | |
3.1% | 0.9% | |
10.0 | 9.9 | |
5 days ago | 1 day ago | |
Java | Java | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Trino
- Trino: Fast distributed SQL query engine for big data analytics
-
Game analytic power: how we process more than 1 billion events per day
We decided not to waste time reinventing the wheel and simply installed Trino on our servers. It’s a full featured SQL query engine that works on your data. Now our analysts can use it to work with data from AppMetr and execute queries at different levels of complexity.
-
Your Thoughts on OLAPs Clickhouse vs Apache Druid vs Starrocks in 2023/2024
DevRel for StarRocks. Trino doesn't have a great caching layer (https://github.com/trinodb/trino/pull/16375) and performance (https://github.com/trinodb/trino/issues/14237) and https://github.com/oap-project/Gluten-Trino. In benchmarks and community user testing, StarRocks has outperformed.
-
Making Hard Things Easy
What if my SQL engine is Presto, Trino [1], or a similar query engine? If it's federating multiple source databases we peel the SQL back and get... SQL? Or you peel the SQL back and get... S3 + Mongo + Hadoop? Junior analysts would work at 1/10th the speed if they had to use those raw.
[1] https://trino.io/
- Trino, a open query engine that runs at ludicrous speed
-
Questions about Athena, Trino and Iceberg
The good thing is that the concepts in terms to the SQL supported by Trino transfers between them all. So its completely reasonable to start with one and move to another. In fact that is something that happens regularly. I invite to you check out the talks from the Trino Fest event that is just wrapping up today. There are presentations about all these aspects and different scenarios users encounter. All videos and slides will go live on the Trino website soon. Also feel free to join the Trino slack to chat about about all this with other users.
-
Multi-Databases across Multiple Servers - MySQL
There are distributed query engines like Trino that help with this sort of problem https://trino.io/
-
Iceberg on Cloudtrail Logs with Athena
This issue in particular is a killer for me: https://github.com/trinodb/trino/issues/10974
-
Data Lake, Real-time Analytics, or Both? Exploring Presto and ClickHouse
AFAIK Presto was forked and Trino https://trino.io/ is now the leading SQL Query engine .
-
Apache Iceberg as storage for on-premise data store (cluster)
Trino or Hive for SQL querying. Get Trino/Hive to talk to Nessie.
Apache Cassandra
-
How to Choose the Right MQTT Data Storage for Your Next Project
Apache Cassandra{:target="_blank"} is a highly scalable and fault-tolerant database that can handle large volumes of data across multiple nodes or clusters. It provides fast read and write operations, making it suitable for real-time analytics or applications with high throughput requirements.
- 10+ Open-Source Projects For Web Developers In 2023
-
Database 101: Data Consistency for Beginners
Wide Column: Apache Cassandra, ScyllaDB and DynamoDB
-
In One Minute : Hadoop
Cassandra, a replicated, fault-tolerant, decentralized and scalable database system.
-
Build Your First App with JavaScript, Node.js, and DataStax Astra DB
A popular database you might already be familiar with is Apache Cassandra®, which powers high-performing applications for thousands of companies including Hulu, Netflix, Spotify, and Apple. While this free, open-source database is known for its high availability, scalability, and resilience; the downside is that it’s also notoriously complex to set up and manage.
- Reducing logging cost by two orders of magnitude using CLP
-
Baeldung Series Part 2: Build a Dashboard With Cassandra, Astra and CQL – Mapping Event Data
In our previous article, we looked at augmenting our dashboard to store and display individual events from the Avengers using DataStax Astra, a serverless DBaaS powered by Apache Cassandra using Stargate to offer additional APIs for working with it.
-
System Design: CAP theorem
Example: Apache Cassandra, CouchDB.
-
Deploy a TikTok Clone with Node.js, Netlify, and DataStax Astra DB
For our TikTok database, we’re using DataStax Astra DB: a cloud-based database that fully manages Apache Cassandra®, one of the most robust and scalable NoSQL databases around.
-
System Design: The complete course
Data partitioning in Apache Cassandra.
What are some alternatives?
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
Druid - Apache Druid: a high performance real-time analytics database.
dremio-oss - Dremio - the missing link in modern data
LevelDB - LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
Presto - The official home of the Presto distributed SQL query engine for big data
Scylla - NoSQL data store using the seastar framework, compatible with Apache Cassandra
Apache Drill - Apache Drill is a distributed MPP query layer for self describing data
delta - An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Apache Calcite - Apache Calcite
Apache HBase - Apache HBase
ClickHouse - ClickHouse® is a free analytics DBMS for big data
Event Store - EventStoreDB, the event-native database. Designed for Event Sourcing, Event-Driven, and Microservices architectures