Apache Accumulo
Apache Hive
Apache Accumulo | Apache Hive | |
---|---|---|
2 | 14 | |
1,046 | 5,335 | |
-0.1% | 0.7% | |
9.7 | 9.6 | |
about 15 hours ago | 2 days ago | |
Java | Java | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Apache Accumulo
-
In One Minute : Hadoop
Accumulo, a sorted, distributed key/value store that provides robust, scalable data storage and retrieval.
- Apache Accumulo – sorted, distributed, robust, scalable key/value store
Apache Hive
-
Apache Iceberg as storage for on-premise data store (cluster)
Trino or Hive for SQL querying. Get Trino/Hive to talk to Nessie.
-
In One Minute : Hadoop
Hive, A data warehouse infrastructure that provides data summarization and ad hoc querying.
- Visionary French entrepreneur, David Gurle, launches new venture – Hive
-
DeWitt Clause, or Can You Benchmark %DATABASE% and Get Away With It
Apache Drill, Druid, Flink, Hive, Kafka, Spark
-
Apache Spark, Hive, and Spring Boot — Testing Guide
In this article, I'm showing you how to create a Spring Boot app that loads data from Apache Hive via Apache Spark to the Aerospike Database. More than that, I'm giving you a recipe for writing integration tests for such scenarios that can be run either locally or during the CI pipeline execution. The code examples are taken from this repository.
- Apache Hive in the vein!
-
Jinja2 not formatting my text correctly. Any advice?
ListItem(name='Apache Hive', website='https://hive.apache.org/', category='Interactive Query', short_description='Apache Hive is a data warehouse software project built on top of Apache Hadoop for providing data query and analysis. Hive gives an SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop.'),
-
Understanding SQL Dialects
Apache Hive takes in a specific SQL dialect and converts it to map-reduce.
-
The Data Engineer Roadmap 🗺
Apache Hive
-
Open Source SQL Parsers
Apache Calcite is a popular parser/optimizer that is used in popular databases and query engines like Apache Hive, BlazingSQL and many others.
What are some alternatives?
Presto - The official home of the Presto distributed SQL query engine for big data
superset - Apache Superset is a Data Visualization and Data Exploration Platform
beam - Apache Beam is a unified programming model for Batch and Streaming data processing.
ObjectBox Java (Kotlin, Android) - Java and Android Database - fast and lightweight without any ORM
Zeppelin - Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
HikariCP - 光 HikariCP・A solid, high-performance, JDBC connection pool at last.
Hazelcast - Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.
Apache Phoenix - Apache Phoenix
Apache Flink - Apache Flink
Flyway - Flyway by Redgate • Database Migrations Made Easy.
Hazelcast Jet - Distributed Stream and Batch Processing