SaaSHub helps you find the best software and product alternatives Learn more →
Top 21 Java Analytic Projects
-
Project mention: QuestDB is an open source time-series database for fast ingest and SQL queries | news.ycombinator.com | 2024-08-31
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Trino: A fast distributed SQL query engine for big data analytics | news.ycombinator.com | 2024-07-09
-
Project mention: OpenSearch vs. Elasticsearch: Why OpenSearch is the Better Choice for AWS Users | dev.to | 2024-09-25
OpenSearch Project on GitHub
-
starrocks
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
Project mention: A MySQL compatible database engine written in pure Go | news.ycombinator.com | 2024-04-09tidb has been around for a while, it is distributed, written in Go and Rust, and MySQL compatible. https://github.com/pingcap/tidb
Somewhat relatedly, StarRocks is also MySQL compatible, written in Java and C++, but it's tackling OLAP use-cases. https://github.com/StarRocks/starrocks
-
Crate
CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.
Great initiative making a list of possible Rockset replacements. Would it be possible to open the Notion page for guest contributions?
I would like to add CrateDB (I work there) to the list. CrateDB is a distributed SQL database purposely built for real-time analytics across large datasets of structured and semi-structured data. Similarly to Rockset, it indexes all data in real-time (text, vector, geospatial, time-series, and JSON) for the most efficient search and fast ad hoc query execution at any scale. It is built on top of Apache Lucene and unlike Rockset is open-source (https://github.com/crate/crate).
Rocket frequently comes up among other solutions our users were looking at before choosing CrateDB. For example https://cratedb.com/customers/govspend.
-
Project mention: Shades of Open Source - Understanding The Many Meanings of "Open" | dev.to | 2024-06-15
This practice, in itself, isn't inherently bad. Many businesses maintain commercial proprietary forks of open-source projects, but usually, the commercial version has a different name than the open-source project. For example, in the world of data catalogs, Dremio is the main developer of Nessie, and Snowflake drives Polaris. Both aim to become community-driven projects over time but will also drive integrated features in their respective commercial products under different names. For instance, if you set up your own Nessie catalog, it has a distinct name compared to the Dremio Enterprise Catalog (formerly Arctic) integrated into Dremio Cloud. The Dremio Enterprise Catalog is powered by Nessie but has additional features, so the different names prevent confusion about available features or which documentation to reference.
-
-
Elide
Elide is a Java library that lets you stand up a GraphQL/JSON-API web service with minimal effort.
-
-
Plan
Player Analytics plugin for Minecraft Server platforms - View player activity of your server with ease. :calendar: (by plan-player-analytics)
-
Rakam
📈 Collect customer event data from your apps. (Note that this project only includes the API collector, not the visualization platform)
Project mention: Show HN: Monitor your webapp with minimal setup | news.ycombinator.com | 2023-11-20 -
Smooks
Extensible data integration Java framework for building XML and non-XML fragment-based applications
-
-
fili
Easily make RESTful web services for time series reporting with Big Data analytics engines like Druid and SQL Databases.
-
-
fhir-data-pipes
A collection of tools for extracting FHIR resources and analytics services on top of that data.
Project mention: Launch HN: Metriport (YC S22) – Open-source API for healthcare data exchange | news.ycombinator.com | 2024-05-23Thank you - glad to see there are others that are aware of the mess of healthcare data!
> Would it make sense to go one step further and bet on the future being the cloud - and start supporting existing cloud solution like Google Healthcare (FHIR) API (and others) as storage layers?
Oh for sure - to clarify, we're open-source, but we definitely have a managed cloud solution. For our backend, we currently self-host the OSS version of HAPI FHIR on AWS: https://github.com/metriport/fhir-server. It works pretty well for our purposes, and we'd prefer to not use a managed solution like the Google FHIR storage for this. Mainly for customizability, control, and to keep things OSS.
With that being said, people using Metriport can store the FHIR data and raw docs coming from our API in whatever solution they wish - including the Google FHIR storage! Everything is standardized to FHIR R4, so syncing to another backend is straightforward.
In fact, a customer of ours recently used this OSS solution to sync Metriport data to their Google cloud: https://github.com/google/fhir-data-pipes
-
-
-
-
-
Java Analytics discussion
Java Analytics related posts
-
OpenSearch vs. Elasticsearch: Why OpenSearch is the Better Choice for AWS Users
-
turbopuffer: Fast Search on Object Storage
-
Variant in Apache Doris 2.1.0: a new data type 8 times faster than JSON for semi-structured data analysis
-
StarRocks – sub-second MPP OLAP database for full analytics scenarios
-
Let's Talk about Joins
-
Guiding Principles
-
Apache Pinot 1.0
-
A note from our sponsor - SaaSHub
www.saashub.com | 4 Oct 2024
Index
What are some of the best open-source Analytic projects in Java? This list will help you:
Project | Stars | |
---|---|---|
1 | QuestDB | 14,372 |
2 | Trino | 10,248 |
3 | OpenSearch | 9,607 |
4 | starrocks | 8,742 |
5 | Crate | 4,065 |
6 | dremio-oss | 1,359 |
7 | Mixpanel | 1,019 |
8 | Elide | 1,001 |
9 | zingg | 950 |
10 | Plan | 855 |
11 | Rakam | 798 |
12 | Smooks | 395 |
13 | binjr | 283 |
14 | fili | 172 |
15 | firebase-analytics | 160 |
16 | fhir-data-pipes | 151 |
17 | hits | 97 |
18 | mparticle-android-sdk | 58 |
19 | RiceStats | 11 |
20 | dead-salmon-brain | 11 |
21 | spigot-agent | 1 |