saddle
Breeze
Our great sponsors
saddle | Breeze | |
---|---|---|
1 | 3 | |
37 | 3,433 | |
- | 0.2% | |
6.8 | 5.1 | |
3 days ago | 2 months ago | |
Scala | Scala | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
saddle
-
Data science in Scala
You might be interested in the saddle library which is a dataframe manipulation library similar to python pandas.
Breeze
-
Arbitrary functions of n dimensions in Scala
Also, you can look at breeze.generic.UFunc for an inspiration.
-
Data science in Scala
You can use https://github.com/scalanlp/breeze. A Scala library that's sorta a numpy/plotting equivalent. Unlike Spark which covers more use cases than just the classic Data Science workflow, Breeze is built specifically for "Data Science in Scala". The drawback is a classic one in Scala land where some major libraries abruptly get abandoned. Breeze's commits seem to have slowed down significantly and their website on their github page www.scalanlp.org is broken.
-
Machine learning on JVM
I haven't checked in on this project in a long time, but Breeze is something akin to NumPy/SciPy.
What are some alternatives?
spark-nlp - State of the Art Natural Language Processing
ND4S - ND4S: N-Dimensional Arrays for Scala. Scientific Computing a la Numpy. Based on ND4J.
hamilton - Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
Spire - Powerful new number types and numeric abstractions for Scala.
SynapseML - Simple and Distributed Machine Learning
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
sketch - AI code-writing assistant that understands data content
Smile - Statistical Machine Intelligence & Learning Engine
photon-ml - A scalable machine learning library on Apache Spark
Saddle
Numsca - numsca is numpy for scala
Zeppelin - Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.