Scalding
Summingbird
Scalding | Summingbird | |
---|---|---|
- | 1 | |
3,471 | 2,118 | |
0.1% | - | |
2.5 | 1.7 | |
11 months ago | over 2 years ago | |
Scala | Scala | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Scalding
We haven't tracked posts mentioning Scalding yet.
Tracking mentions began in Dec 2020.
Summingbird
-
Pelikan, Twitterâs framework for building caches
I like this bird pun. Another project at Twitter with a bird pun for a name: Summingbird.
https://github.com/twitter/summingbird
What are some alternatives?
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
Deeplearning4j - Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.
Apache Flink - Apache Flink
spark-deployer - Deploy Spark cluster in an easy way.
Hail - Cloud-native genomic dataframes and batch computing
Scrunch - Mirror of Apache Crunch (Incubating)
Reactive-kafka - Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.
metorikku - A simplified, lightweight ETL Framework based on Apache Spark
Scio - A Scala API for Apache Beam and Google Cloud Dataflow.
GridScale - Scala library for accessing various file, batch systems, job schedulers and grid middlewares.