Zeppelin
Spark Notebook
Our great sponsors
Zeppelin | Spark Notebook | |
---|---|---|
7 | 0 | |
5,784 | 3,125 | |
0.9% | 0.0% | |
9.0 | 0.0 | |
4 days ago | 10 months ago | |
Java | JavaScript | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Zeppelin
-
Visualization using Pyspark Dataframe
Have you tried Apache Zepellin I remember that you can pretty print spark dataframes directly on it with z.show(df)
-
Fast CSV Processing with SIMD
I used to use Zeppelin, some kind of Jupyter Notebook for Spark (that supports Parquet). But it may be better alternatives.
-
What libraries do you use for machine learning and data visualizing in scala?
Another more widely used notebooks for scala and spark: https://zeppelin.apache.org/
-
How to use IPython in Apache Zeppelin Notebook
[1] Apache Zeppelin http://zeppelin.apache.org/ [2] Zeppelin notebooks website http://zeppelin-notebook.com/. [3] Zeppelin notebooks git repo https://github.com/zjffdu/zeppelin-notebook
-
BI Application in Golang.
Apache Zeppelin
-
Using InterSystems Caché and Apache Zeppelin
For all who think: What the heck is Apache Zeppelin? Here are some details what the project site says:
-
Is there a way to collaborate in real-time for Jupyter Notebooks?
Check out Zeppelin. It's similar to Jupyter and allows real-time editing by multiple users. https://zeppelin.apache.org/
Spark Notebook
We haven't tracked posts mentioning Spark Notebook yet.
Tracking mentions began in Dec 2020.
What are some alternatives?
Breeze - Breeze is a numerical processing library for Scala.
Figaro - Figaro Programming Language and Core Libraries
BigDL - Building Large-Scale AI Applications for Distributed Big Data
Smile - Statistical Machine Intelligence & Learning Engine
OpenMOLE - Workflow engine for exploration of simulation models using high throughput computing
PredictionIO - PredictionIO, a machine learning server for developers and ML engineers.
Algebird - Abstract Algebra for Scala
FACTORIE - FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.
Chalk