Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge. Learn more →
Top 12 Java Hive Projects
-
APIJSON
🏆 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构。 🏆 A JSON Transmission Protocol and an ORM Library 🚀 provides APIs and Docs without writing any code.
-
Note that glibc has a similar problem in multithreaded contexts. It strands unused memory in thread-local pools, which grows your memory usage over time like a memory leak. We got lower memory usage that didn't grow over time by switching to jemalloc.
Example of this: https://github.com/prestodb/presto/issues/8993
-
Onboard AI
Learn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev.
-
Learn more about Apache Doris or find the Doris makers on Slack.
-
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Project mention: Game analytic power: how we process more than 1 billion events per day | dev.to | 2023-11-24We decided not to waste time reinventing the wheel and simply installed Trino on our servers. It’s a full featured SQL query engine that works on your data. Now our analysts can use it to work with data from AppMetr and execute queries at different levels of complexity.
-
Project mention: Apache Iceberg as storage for on-premise data store (cluster) | /r/dataengineering | 2023-03-16
Trino or Hive for SQL querying. Get Trino/Hive to talk to Nessie.
-
linkis
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
-
Project mention: Git Query Language (GQL) Aggregation Functions, Groups, Alias | /r/ProgrammingLanguages | 2023-06-30
Also are you familiar with apache drill . The idea is to put an SQL interpreter in front of any kind of database just like you are doing for git here.
-
InfluxDB
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
-
-
helicalinsight
Helical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.
-
waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
-
dataCompare
big data comparison and data profiling platform: low code,data comparison and data profiling
Project mention: Design and practice of open source big data comparison platform | /r/bigdata | 2022-12-14 -
hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Java Hive related posts
- Game analytic power: how we process more than 1 billion events per day
- Your Thoughts on OLAPs Clickhouse vs Apache Druid vs Starrocks in 2023/2024
- Log Analysis: Elasticsearch VS Apache Doris
- Ask HN: What are some SQL transpilers?
- Trino, a open query engine that runs at ludicrous speed
- Questions about Athena, Trino and Iceberg
- Multi-Databases across Multiple Servers - MySQL
-
A note from our sponsor - InfluxDB
www.influxdata.com | 1 Dec 2023
Index
What are some of the best open-source Hive projects in Java? This list will help you:
Project | Stars | |
---|---|---|
1 | APIJSON | 16,075 |
2 | Presto | 15,247 |
3 | doris | 10,120 |
4 | Trino | 8,864 |
5 | Apache Hive | 5,160 |
6 | linkis | 3,167 |
7 | Apache Drill | 1,850 |
8 | yauaa | 693 |
9 | helicalinsight | 276 |
10 | waggle-dance | 244 |
11 | dataCompare | 219 |
12 | hadoopcryptoledger | 138 |