Java Data

Open-source Java projects categorized as Data | Edit details

Top 9 Java Data Projects

  • GitHub repo OpenRefine

    OpenRefine is a free, open source power tool for working with messy data and improving it

    Project mention: Data mapping process | reddit.com/r/dataengineering | 2021-11-05

    In terms of open source - is OpenRefine what you are after?

  • GitHub repo airbyte

    Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.

    Project mention: Starting small Airbyte on GCP | dev.to | 2021-11-16

    As a premise, it is assumed to be integrated into the data infrastructure centered on BigQuery. Then build a system with GCP services. This time, as a small start, I planned to introduce the ETL of one service in Zendesk into production, and gradually add processing of ETL of other services while confirming stable operation. The Airbyte repository describes how to deploy with docker-compose in GCE, so I decided to deploy with this method and operate it for a while. There is also a deployment method with Kubernetes, but GKE has not only the cost of compute resources but also the cost of operation work, so I decided not to do it this time.

  • Scout APM

    Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.

  • GitHub repo data-transfer-project

    The Data Transfer Project makes it easy for people to transfer their data between online service providers. We are establishing a common framework, including data models and protocols, to enable direct transfer of data both into and out of participating online service providers.

    Project mention: Dé-monopoliser l’internet par l’interopérabilité | reddit.com/r/france | 2021-10-12
  • GitHub repo proteus

    Proteus : A JSON based LayoutInflater for Android

    Project mention: Looking at options for adding layout files to an installed app, without updating APK | reddit.com/r/androiddev | 2021-11-21

    Flipkart has created an Proteus

  • GitHub repo OpenMetadata

    Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

    Project mention: OpenMetadata | reddit.com/r/dataengineering | 2021-09-23

    Hi, We are a team building OpenMetadata, a single place to discover, collaborate and get your data right. Please check our announcement here https://blog.open-metadata.org/announcing-openmetadata-20399b816e60 Check out our code https://github.com/open-metadata/OpenMetadata . If you are interested in learning please do join our slack and ask any questions you may have http://openmetadata.slack.com

  • GitHub repo nessie

    Nessie: Transactional Catalog for Data Lakes with Git-like semantics

    Project mention: Project Nessie provides Git-like capabilities for your Data Lake | news.ycombinator.com | 2021-03-10
  • GitHub repo riot

    Get data in and out of Redis (by redis-developer)

    Project mention: Redis instance type switch | reddit.com/r/aws | 2021-04-13

    Give RIOT a try. https://github.com/redis-developer/riot

  • Nanos

    Run Linux Software Faster and Safer than Linux with Unikernels.

  • GitHub repo Db4o-gpl

    Db4o GPL version for .netstardard2.0 & Java7+ Android Xamarin..., the best database project to help you to learn how to write a database

  • GitHub repo SheetsIO

    Small configurable Java app that pulls data from a Google Spreadsheet (using v4 api) and writes to files and a local webserver.

    Project mention: OBS scoreboard for my students | reddit.com/r/obs | 2021-04-02

    SheetsIO, https://github.com/GrandyB/SheetsIO, uses a Google Sheet to write files on your computer which you can then read using OBS text or image sources. This does not require any programming knowledge outside of writing a JSON config file (to tell it what cell data to write to what file) which isn't that hard. There's an introduction video on the page that shows how to get started.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-11-21.

Java Data related posts

Index

What are some of the best open-source Data projects in Java? This list will help you:

Project Stars
1 OpenRefine 8,500
2 airbyte 4,652
3 data-transfer-project 3,271
4 proteus 1,215
5 OpenMetadata 514
6 nessie 379
7 riot 79
8 Db4o-gpl 20
9 SheetsIO 9
Find remote jobs at our new job board 99remotejobs.com. There are 34 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com