Zeppelin
BigDL
Our great sponsors
Zeppelin | BigDL | |
---|---|---|
8 | 5 | |
6,263 | 5,957 | |
0.4% | 19.8% | |
8.7 | 9.9 | |
3 days ago | 1 day ago | |
Java | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Zeppelin
-
Serverless Apache Zeppelin on AWS
Now we can proceed with the definition of Apache Zeppelin. It is a web-based notebook that enables data-driven, interactive data analytics and collaborative documents with Python, Scala, SQL, Spark, and more. You can execute code and even schedule a job (via cron) to run at regular intervals.
-
Visualization using Pyspark Dataframe
Have you tried Apache Zepellin I remember that you can pretty print spark dataframes directly on it with z.show(df)
-
Fast CSV Processing with SIMD
I used to use Zeppelin, some kind of Jupyter Notebook for Spark (that supports Parquet). But it may be better alternatives.
https://zeppelin.apache.org/
-
What libraries do you use for machine learning and data visualizing in scala?
Another more widely used notebooks for scala and spark: https://zeppelin.apache.org/
-
How to use IPython in Apache Zeppelin Notebook
[1] Apache Zeppelin http://zeppelin.apache.org/ [2] Zeppelin notebooks website http://zeppelin-notebook.com/. [3] Zeppelin notebooks git repo https://github.com/zjffdu/zeppelin-notebook
-
BI Application in Golang.
Apache Zeppelin
-
Using InterSystems Caché and Apache Zeppelin
For all who think: What the heck is Apache Zeppelin? Here are some details what the project site says:
-
Is there a way to collaborate in real-time for Jupyter Notebooks?
Check out Zeppelin. It's similar to Jupyter and allows real-time editing by multiple users. https://zeppelin.apache.org/
BigDL
- PyTorch Library for Running LLM on Intel CPU and GPU
-
LLaMA Now Goes Faster on CPUs
Any performance benchmark against intel's 'IPEX-LLM'[0] or others?
[0] - https://github.com/intel-analytics/ipex-llm
- BigDL-LLM: running LLM on your laptop using INT4
- Fast, distributed, secure AI for Big Data
-
Machine learning on JVM
Intel BigDL for Spark which again is for Spark.
What are some alternatives?
Breeze - Breeze is a numerical processing library for Scala.
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
Spark Notebook - Interactive and Reactive Data Science using Scala and Spark.
Axle - Axle Domain Specific Language for Scientific Cloud Computing and Visualization
Algebird - Abstract Algebra for Scala
deequ - Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Figaro - Figaro Programming Language and Core Libraries
Spire - Powerful new number types and numeric abstractions for Scala.
Smile - Statistical Machine Intelligence & Learning Engine
PredictionIO - PredictionIO, a machine learning server for developers and ML engineers.
Persist-Units - Scala Units of Measure Types