|2 months ago||3 months ago|
|Apache License 2.0||GNU General Public License v3.0 or later|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
We haven't tracked posts mentioning Apache Mahout yet.
Tracking mentions began in Dec 2020.
What libraries do you use for machine learning and data visualizing in scala?
5 projects | reddit.com/r/scala | 27 Nov 2021
I use smile https://github.com/haifengl/smile with ammonite and it feels pretty easy/good to work with. Of course for pure looking at data, and exploration, you're not going to beat python.
Python VS Scala
2 projects | reddit.com/r/scala | 2 Jul 2021
Actually, it does. Scala has Spark for data science and some ML libs like Smile.
[R] NLP Machine Learning with low RAM
1 project | reddit.com/r/MachineLearning | 2 Jun 2021
I guess I must have a mistake somewhere. It's not much code. it's written in Kotlin with smile. My dataset is only about 32MB. I load the dataset into memory. I then use 80% of the data for training, and the other for later testing. I get just the columns I need and store them in the variable dataset.
Kotlin with Randon Forest Classifier
1 project | reddit.com/r/Kotlin | 19 Apr 2021
I've heard good things about Smile, probably beats libs like Weka by far. I'm not sure if you can load a scikit-learn model though, so you might need to retrain the model in Kotlin.
Machine learning on JVM
6 projects | reddit.com/r/scala | 5 Apr 2021
I was using Smile for some period - https://haifengl.github.io/ - it's quite small and lightweight Java lib with some very basic algorithms - I was using in particularly cauterization. Along with this it provides Scala API.
What are some alternatives?
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
Deeplearning4j - Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.
Breeze - Breeze is a numerical processing library for Scala.
ND4S - ND4S: N-Dimensional Arrays for Scala. Scientific Computing a la Numpy. Based on ND4J.
H2O - Sparkling Water provides H2O functionality inside Spark cluster
Apache Flink - Apache Flink
Deep Java Library (DJL) - An Engine-Agnostic Deep Learning Framework in Java
grobid - A machine learning software for extracting information from scholarly documents