Summingbird
Scalding
Our great sponsors
Summingbird | Scalding | |
---|---|---|
1 | - | |
2,118 | 3,469 | |
- | 0.1% | |
1.7 | 2.5 | |
over 2 years ago | 11 months ago | |
Scala | Scala | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Summingbird
-
Pelikan, Twitterâs framework for building caches
I like this bird pun. Another project at Twitter with a bird pun for a name: Summingbird.
https://github.com/twitter/summingbird
Scalding
We haven't tracked posts mentioning Scalding yet.
Tracking mentions began in Dec 2020.
What are some alternatives?
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
Apache Flink - Apache Flink
Deeplearning4j - Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.
Hail - Cloud-native genomic dataframes and batch computing
spark-deployer - Deploy Spark cluster in an easy way.
Reactive-kafka - Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.
Scrunch - Mirror of Apache Crunch (Incubating)
metorikku - A simplified, lightweight ETL Framework based on Apache Spark
GridScale - Scala library for accessing various file, batch systems, job schedulers and grid middlewares.
Scio - A Scala API for Apache Beam and Google Cloud Dataflow.