Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises Learn more →
Top 23 Java Machine Learning Projects
-
Project mention: Pyflink : Flink DataStream (KafkaSource) API to consume from Kafka | reddit.com/r/dataengineering | 2023-05-13
Does anyone have fully running Pyflink code snippet to read from Kafka using the new Flink DataStream (KafkaSource) API and just print out the output to console or write it out to a file. Most of the examples and the official Flink GitHubare using the old API (FlinkKafkaConsumer).
-
Project mention: Getting thousands of files of output back from a container | reddit.com/r/docker | 2023-05-02
Did you check out tools like https://hadoop.apache.org/ ?
-
InfluxDB
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
-
Deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.
-
mit-deep-learning-book-pdf
MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville
Project mention: Is supervised machine learning the same as linear regression? | reddit.com/r/learnmachinelearning | 2023-03-07 -
Check out Smile too.
-
Project mention: Understanding AI for coders: Tabnine (your alternative to GitHub Copilot) | news.ycombinator.com | 2022-06-21
There's a standard GitHub uses for license files (which must be at the root of the repo) which fills in the "license" field on the right column of the repo. If the standard isn't met then the link just says "View license". I imagine TabNine is pulling the license from the GitHub API.
https://docs.github.com/en/repositories/managing-your-reposi...
https://github.com/Vedenin/useful-java-links
When master branch is renamed to main, GitHub redirects any old links. https://github.com/github/renaming#renaming-existing-branche...
-
Vespa(4.3k ⭐) → A fully featured search engine and vector database. It supports vector search (ANN), lexical search, and search in structured data, all in the same query. Integrated machine-learned model inference allows you to apply AI to make sense of your data in real time.
-
ONLYOFFICE
ONLYOFFICE Docs — document collaboration in your environment. Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises
-
Project mention: Is there a course that teaches you how to make an API with a trained model? | reddit.com/r/learnmachinelearning | 2023-05-27
-
Project mention: Tablesaw: Java Dataframe and Visualization Library | news.ycombinator.com | 2023-02-06
-
It seems to have been picked up by Eclipse and there is also Oracle Labs' Tribuo and Deep Java Library. All seem active, but I don't know much about any of them. I agree it's probably best to follow the community and use a more popular tool like PyTorch.
-
Project mention: 🥪 Best Sites For ebooks, articles, research papers etc..🥪 | reddit.com/r/RockMods | 2023-05-17
-
-
Hi, I'm building a realtime analysis solution for our domain oriented microservice backend. All domain emit events in kafka. I'm looking for a solution to ingest data in an OLAP database based on processing those events (enrichment, filtering etc.). I found https://siddhi.io/ which looks promising. Since the last release (2019) the product is now part of WSo2 solution. I'm also looking at https://www.benthos.dev/. I'm more interested in a declarative solution than code.
-
elasticsearch-learning-to-rank
Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch
Project mention: New free tool that uses fine-tuned BERT model to surface answers from research papers | reddit.com/r/LanguageTechnology | 2022-10-28I worked on a learning-to-rank problem at a previous job (which unfortunately never got deployed womp, womp). This was early days, so at the time I was looking at using LambdaMART with solr or elasticsearch for reranking with a Bayesian click model to get pseudo-labels for relevance.
-
It seems to have been picked up by Eclipse and there is also Oracle Labs' Tribuo and Deep Java Library. All seem active, but I don't know much about any of them. I agree it's probably best to follow the community and use a more popular tool like PyTorch.
-
DatumBox
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
-
Project mention: Hopworks: MLOps platform with Python-centric Feature Store | news.ycombinator.com | 2022-12-02
-
lychee.js
:seedling: Next-Gen AI-Assisted Isomorphic Application Engine for Embedded, Console, Mobile, Server and Desktop
-
-
git clone https://github.com/apache/submarine.git cd submarine git checkout rel/release-0.7.0 helm install submarine ./helm-charts/submarine
-
I'm trying to run a minecraft mod that requires a special dependency to work properly. These are the instructions and this is the mod page. It only shows what to do on debian and redhat. I tried installing both the libgfortran and libgfortran-32bit packages, but it still doesn't work. I also tried installing apt, but it doesn't work and I was unable to find the instructions on how to use it on non-debian based distros.
-
Project mention: [R] LMFlow Benchmark: An Automatic Evaluation Framework for Open-Source LLMs | reddit.com/r/MachineLearning | 2023-05-09
This is super interesting! Thanks for sharing. We're also working on this research field from an open-source angle (https://github.com/Giskard-AI/giskard)
-
-
Sonar
Write Clean Java Code. Always.. Sonar helps you commit clean code every time. With over 600 unique rules to find Java bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
Java Machine Learning related posts
- Is there a course that teaches you how to make an API with a trained model?
- Pytorch eating memory on every api call
- 🥪 Best Sites For ebooks, articles, research papers etc..🥪
- Pyflink : Flink DataStream (KafkaSource) API to consume from Kafka
- Looking for opensource projects to contribute.
- Need statistic test library for Spark Scala
- Getting thousands of files of output back from a container
-
A note from our sponsor - ONLYOFFICE
www.onlyoffice.com | 30 May 2023
Index
What are some of the best open-source Machine Learning projects in Java? This list will help you:
Project | Stars | |
---|---|---|
1 | Apache Flink | 21,279 |
2 | Apache Hadoop | 13,530 |
3 | Deeplearning4j | 12,952 |
4 | mit-deep-learning-book-pdf | 11,230 |
5 | Smile | 5,743 |
6 | useful-java-links | 5,516 |
7 | vespa | 4,419 |
8 | serve | 3,468 |
9 | Tablesaw | 3,239 |
10 | Deep Java Library (DJL) | 3,220 |
11 | grobid | 2,140 |
12 | Apache Mahout | 2,062 |
13 | Siddhi | 1,439 |
14 | elasticsearch-learning-to-rank | 1,427 |
15 | Tribuo | 1,151 |
16 | DatumBox | 1,084 |
17 | hopsworks | 921 |
18 | lychee.js | 798 |
19 | JSAT | 759 |
20 | submarine | 642 |
21 | jblas | 576 |
22 | giskard | 463 |
23 | knime-core | 449 |