Java Analytics

Open-source Java projects categorized as Analytics

Top 21 Java Analytic Projects

  • QuestDB

    QuestDB is an open source time-series database for fast ingest and SQL queries

    Project mention: QuestDB is an open source time-series database for fast ingest and SQL queries | news.ycombinator.com | 2024-08-31
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • Trino

    Official repository of Trino, the distributed SQL query engine for big data, former

    Project mention: Trino: A fast distributed SQL query engine for big data analytics | news.ycombinator.com | 2024-07-09
  • OpenSearch

    🔎 Open source distributed and RESTful search engine.

    Project mention: OpenSearch vs. Elasticsearch: Why OpenSearch is the Better Choice for AWS Users | dev.to | 2024-09-25

    OpenSearch Project on GitHub

  • starrocks

    StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.

    Project mention: A MySQL compatible database engine written in pure Go | news.ycombinator.com | 2024-04-09

    tidb has been around for a while, it is distributed, written in Go and Rust, and MySQL compatible. https://github.com/pingcap/tidb

    Somewhat relatedly, StarRocks is also MySQL compatible, written in Java and C++, but it's tackling OLAP use-cases. https://github.com/StarRocks/starrocks

  • Crate

    CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

    Project mention: OpenAI Acquires Rockset | news.ycombinator.com | 2024-06-21

    Great initiative making a list of possible Rockset replacements. Would it be possible to open the Notion page for guest contributions?

    I would like to add CrateDB (I work there) to the list. CrateDB is a distributed SQL database purposely built for real-time analytics across large datasets of structured and semi-structured data. Similarly to Rockset, it indexes all data in real-time (text, vector, geospatial, time-series, and JSON) for the most efficient search and fast ad hoc query execution at any scale. It is built on top of Apache Lucene and unlike Rockset is open-source (https://github.com/crate/crate).

    Rocket frequently comes up among other solutions our users were looking at before choosing CrateDB. For example https://cratedb.com/customers/govspend.

  • dremio-oss

    Dremio - the missing link in modern data

    Project mention: Shades of Open Source - Understanding The Many Meanings of "Open" | dev.to | 2024-06-15

    This practice, in itself, isn't inherently bad. Many businesses maintain commercial proprietary forks of open-source projects, but usually, the commercial version has a different name than the open-source project. For example, in the world of data catalogs, Dremio is the main developer of Nessie, and Snowflake drives Polaris. Both aim to become community-driven projects over time but will also drive integrated features in their respective commercial products under different names. For instance, if you set up your own Nessie catalog, it has a distinct name compared to the Dremio Enterprise Catalog (formerly Arctic) integrated into Dremio Cloud. The Dremio Enterprise Catalog is powered by Nessie but has additional features, so the different names prevent confusion about available features or which documentation to reference.

  • Mixpanel

    Official Android Tracking Library for Mixpanel Analytics

  • Elide

    Elide is a Java library that lets you stand up a GraphQL/JSON-API web service with minimal effort.

  • zingg

    Scalable identity resolution, entity resolution, data mastering and deduplication using ML

  • Plan

    Player Analytics plugin for Minecraft Server platforms - View player activity of your server with ease. :calendar: (by plan-player-analytics)

  • Rakam

    📈 Collect customer event data from your apps. (Note that this project only includes the API collector, not the visualization platform)

    Project mention: Show HN: Monitor your webapp with minimal setup | news.ycombinator.com | 2023-11-20
  • Smooks

    Extensible data integration Java framework for building XML and non-XML fragment-based applications

  • binjr

    A Time Series Data Browser

  • fili

    Easily make RESTful web services for time series reporting with Big Data analytics engines like Druid and SQL Databases.

  • firebase-analytics

    Enable Firebase Analytics for Capacitor Apps

  • fhir-data-pipes

    A collection of tools for extracting FHIR resources and analytics services on top of that data.

    Project mention: Launch HN: Metriport (YC S22) – Open-source API for healthcare data exchange | news.ycombinator.com | 2024-05-23

    Thank you - glad to see there are others that are aware of the mess of healthcare data!

    > Would it make sense to go one step further and bet on the future being the cloud - and start supporting existing cloud solution like Google Healthcare (FHIR) API (and others) as storage layers?

    Oh for sure - to clarify, we're open-source, but we definitely have a managed cloud solution. For our backend, we currently self-host the OSS version of HAPI FHIR on AWS: https://github.com/metriport/fhir-server. It works pretty well for our purposes, and we'd prefer to not use a managed solution like the Google FHIR storage for this. Mainly for customizability, control, and to keep things OSS.

    With that being said, people using Metriport can store the FHIR data and raw docs coming from our API in whatever solution they wish - including the Google FHIR storage! Everything is standardized to FHIR R4, so syncing to another backend is straightforward.

    In fact, a customer of ours recently used this OSS solution to sync Metriport data to their Google cloud: https://github.com/google/fhir-data-pipes

  • hits

    :chart_with_upwards_trend: Hit Counter for Your GitHub or Any Kind of Websites You Want.

  • mparticle-android-sdk

    mParticle SDK for Android apps

  • RiceStats

    Tracks statistics with InfluxDB for timescale analytics as a Spigot (Paper) plugin

  • dead-salmon-brain

    Apache Spark based framework for analysis A/B experiments

  • spigot-agent

    Spigot plugin to capture and send analytics to the Aurinsk API

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java Analytics discussion

Log in or Post with

Java Analytics related posts

Index

What are some of the best open-source Analytic projects in Java? This list will help you:

Project Stars
1 QuestDB 14,372
2 Trino 10,248
3 OpenSearch 9,607
4 starrocks 8,742
5 Crate 4,065
6 dremio-oss 1,359
7 Mixpanel 1,019
8 Elide 1,001
9 zingg 950
10 Plan 855
11 Rakam 798
12 Smooks 395
13 binjr 283
14 fili 172
15 firebase-analytics 160
16 fhir-data-pipes 151
17 hits 97
18 mparticle-android-sdk 58
19 RiceStats 11
20 dead-salmon-brain 11
21 spigot-agent 1

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com