amazon-emr-with-delta-lake
BigDL
amazon-emr-with-delta-lake | BigDL | |
---|---|---|
1 | 5 | |
17 | 6,003 | |
- | 16.4% | |
4.0 | 9.9 | |
6 months ago | 2 days ago | |
Jupyter Notebook | Python | |
MIT No Attribution | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
amazon-emr-with-delta-lake
BigDL
- PyTorch Library for Running LLM on Intel CPU and GPU
-
LLaMA Now Goes Faster on CPUs
Any performance benchmark against intel's 'IPEX-LLM'[0] or others?
[0] - https://github.com/intel-analytics/ipex-llm
- BigDL-LLM: running LLM on your laptop using INT4
- Fast, distributed, secure AI for Big Data
-
Machine learning on JVM
Intel BigDL for Spark which again is for Spark.
What are some alternatives?
H2O - H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
ngods-stocks - New Generation Opensource Data Stack Demo
Zeppelin - Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
demo-code - Bits of code I use during live demos
Axle - Axle Domain Specific Language for Scientific Cloud Computing and Visualization
data-engineering-zoomcamp - Free Data Engineering course!
deequ - Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Spire - Powerful new number types and numeric abstractions for Scala.
PredictionIO - PredictionIO, a machine learning server for developers and ML engineers.
Persist-Units - Scala Units of Measure Types
Tensorflow_scala - TensorFlow API for the Scala Programming Language