Java Metadata

Open-source Java projects categorized as Metadata

Top 9 Java Metadata Projects

  1. datahub

    The Metadata Platform for your Data and AI Stack

    Project mention: DataHub: The Data Discovery Platform for the Modern Data Stack | news.ycombinator.com | 2025-02-24
  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. grobid

    A machine learning software for extracting information from scholarly documents

    Project mention: Starting July 1, Academic Publishers Can't Paywall NIH-Funded Research | news.ycombinator.com | 2025-05-01

    what do you mean exactly? I was suprised how with grobid many of at least the arXiv papers are easily converted to xml for better processing than PDF.

    Most of the papers are constructed from their latex sources so there's an easy way to undo it i guess.

    https://github.com/kermitt2/grobid

  4. metadata-extractor

    Extracts Exif, IPTC, XMP, ICC and other metadata from image, video and audio files

  5. marquez

    Collect, aggregate, and visualize a data ecosystem's metadata

  6. gravitino

    World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

    Project mention: What is Data Agent and how to build it in 15 Minutes | news.ycombinator.com | 2024-08-16
  7. odd-platform

    First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

  8. pdf-metadata-editor

    PDF Metadata Editor is a simple tool you can use to edit the metadata (Author, Keywors, etc.) of a PDF document.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. CommitCombo

    깃허브 커밋 기록을 아름답게 꾸미는 프로젝트 ⭐

  11. eitco-mavenizer

    Helps you to find or define Maven UIDs for any JAR file and generate corresponding artifact install scripts.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java Metadata discussion

Log in or Post with

Java Metadata related posts

  • Starting July 1, Academic Publishers Can't Paywall NIH-Funded Research

    1 project | news.ycombinator.com | 1 May 2025
  • DataHub: The Data Discovery Platform for the Modern Data Stack

    1 project | news.ycombinator.com | 24 Feb 2025
  • DataHub: Open-Source Metadata Platform

    1 project | news.ycombinator.com | 23 Feb 2025
  • What is Data Agent and how to build it in 15 Minutes

    1 project | news.ycombinator.com | 16 Aug 2024
  • Gravitino: A Powerful Open Data Catalog for Geo-Distributed Metadata Lakes

    1 project | news.ycombinator.com | 6 Aug 2024
  • You don't need to worry about your data get kidnapped in lakehouses

    1 project | news.ycombinator.com | 21 Jun 2024
  • Gravitino: Powerful Open Data Catalog

    1 project | news.ycombinator.com | 14 Jun 2024
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 24 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source Metadata projects in Java? This list will help you:

# Project Stars
1 datahub 10,644
2 grobid 4,042
3 metadata-extractor 2,665
4 marquez 1,918
5 gravitino 1,500
6 odd-platform 1,317
7 pdf-metadata-editor 177
8 CommitCombo 37
9 eitco-mavenizer 12

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Java is
the 8th most popular programming language
based on number of references?