Fahmatrix – A Lightweight, Pandas-Like DataFrame Library for Java (GitHub)

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Sevalla - Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
  1. Fahmatrix

    Fahmatrix is a lightweight, modern Java library for working with tabular data, inspired by Python's Pandas and rooted in the idea of making data understanding (fahm) easy on the JVM.

    Hey HN,

    I’ve built Fahmatrix, a minimal, fast Java library for working with tabular data — inspired by Python’s pandas, but designed for performance and simplicity on the JVM.

    After working extensively with Python’s data stack, I often ran into limitations related to speed, especially in larger or long-running data workflows. So I built Fahmatrix from scratch to offer similar APIs for manipulating CSVs, performing summary statistics, slicing rows/columns, and more — but all in Java.

    Features:

    Lightweight and dependency-free

    CSV/TSV import with auto-headers

    Series/DataFrame structures (like pandas)

    describe(), mean(), stdDev(), percentile() and more

    Fast parallel operations on numeric columns

    Java 17+ support

    Docs: https://moustafa-nasr.github.io/Fahmatrix/

  2. Sevalla

    Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!

    Sevalla logo
  3. Tablesaw

    Java dataframe and visualization library

    Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

    https://github.com/jtablesaw/tablesaw

    https://github.com/dflib/dflib

    My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

  4. dflib

    In-memory Java DataFrame library

    Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:

    https://github.com/jtablesaw/tablesaw

    https://github.com/dflib/dflib

    My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Kotlin DataFrame: Revolutionizing Data Processing on the JVM

    1 project | dev.to | 18 May 2025
  • Java DataFrame library 1.0 GA release

    1 project | news.ycombinator.com | 18 Dec 2024
  • Plotting Financial Data in Kotlin with Kandy

    3 projects | dev.to | 9 Apr 2024
  • Solomon: Data Exploration tool.

    1 project | /r/Python | 21 Mar 2023
  • Tablesaw: Java Dataframe and Visualization Library

    1 project | news.ycombinator.com | 6 Feb 2023

Did you know that Java is
the 8th most popular programming language
based on number of references?