Breeze
SynapseML
Our great sponsors
- InfluxDB - Collect and Analyze Billions of Data Points in Real Time
- Onboard AI - Learn any GitHub repo in 59 seconds
- SaaSHub - Software Alternatives and Reviews
Breeze | SynapseML | |
---|---|---|
3 | 18 | |
3,416 | 4,858 | |
0.2% | 6.2% | |
3.9 | 8.6 | |
over 1 year ago | 8 days ago | |
Scala | Scala | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Breeze
-
Data science in Scala
You can use https://github.com/scalanlp/breeze. A Scala library that's sorta a numpy/plotting equivalent. Unlike Spark which covers more use cases than just the classic Data Science workflow, Breeze is built specifically for "Data Science in Scala". The drawback is a classic one in Scala land where some major libraries abruptly get abandoned. Breeze's commits seem to have slowed down significantly and their website on their github page www.scalanlp.org is broken.
-
Machine learning on JVM
I haven't checked in on this project in a long time, but Breeze is something akin to NumPy/SciPy.
SynapseML
- FLaNK Stack Weekly for 12 September 2023
- Microsoft announces new tool for applying ChatGPT and GPT-4 at massive scales
-
Data science in Scala
b) There are libraries around e.g. Microsoft SynapseML, LinkedIn Photon ML
-
[P] Microsoft releases SynapseML v0.9.5 with support for speech synthesis, anomaly detection, and geospatial analytics on large-scale data
Link to Release Notes: https://github.com/microsoft/SynapseML/releases/tag/v0.9.5
-
Machine learning on JVM
Microsoft ML for Spark gets you a range of powerful ML features on Spark.
What are some alternatives?
mmlspark - Simple and Distributed Machine Learning [Moved to: https://github.com/microsoft/SynapseML]
Spire - Powerful new number types and numeric abstractions for Scala.
ND4S - ND4S: N-Dimensional Arrays for Scala. Scientific Computing a la Numpy. Based on ND4J.
Smile - Statistical Machine Intelligence & Learning Engine
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
Saddle
Numsca - numsca is numpy for scala
Zeppelin - Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
isolation-forest - A Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Algebird - Abstract Algebra for Scala
Squants - The Scala API for Quantities, Units of Measure and Dimensional Analysis
Compute.scala - Scientific computing with N-dimensional arrays