geni
tablecloth
geni | tablecloth | |
---|---|---|
4 | 10 | |
275 | 265 | |
0.7% | 1.5% | |
5.6 | 9.2 | |
5 months ago | 14 days ago | |
Clojure | HTML | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
geni
-
Spark Anyone?
sparkling is fine. there is also geni
- LLVM!
-
Scala is a Maintenance Nightmare
I haven't tried Spark from Kotlin, but it's a nice experience working with it in Clojure, and I have yet to see a language more expressive than Clojure. :)
-
Data engineering and Clojure?
I think for the large scale stuff, wrappers like geni are pretty nice and built on top of established tech. There were several distributed computing platforms like onyx and storm that popped up in clojure as well that may be interesting to look at. clojure toolbox has a good index of libraries to examine.
tablecloth
-
Is there a library for rank polymorphism in clojure?
Another, and definitely better for serious projects, approach is to use https://github.com/scicloj/tablecloth or things that it mentions - tech.ml.dataset and dtype-next.
-
Data-recur meeting 4: an intro to Tablecloth
The forth meeting will be at the end of October and will be dedicated to the Tablecloth dataset manipulation library by generateme, with an intro by Ethan Miller, who is nowadays involved in developing Tablecloth.
-
Best Data Tools for my use case
I really like geni: it is really idiomatic in its approach to Apache Spark. There are some gaps (no UDFs), and I am not sure that the project is as active as it used to be. But I still use it and find it very nice (I do have Apache Spark background already). tablecloth is an alternative dataframe library that is being used by a lot of folks in the Clojure data science world. For that matter, you should check out scicloj, and also hang out in the data channel in zulip.
- Why Clojure is not widely adopted like mainstream languages?
-
re:Clojure 2021 workshop: Wrangling datasets with Tablecloth by Mey Beisaron (2021-11-07)
At this re:Clojure workshop (Nov. 7th), @ladymeyy taught us about Tablecloth.
- On Sunday: a workshop by Mey Beisaron about Tablecloth
-
Scicloj ml-study 15: data visualization
In both sessions, we will practice data visualization on a real-world data problem. Among other things, we will try a new data visualization library that Ashima Panjwani is working on. We will assume basic familiarity with Clojure and with Tablecloth. Both sessions will be independent, probably overlapping in content.
-
Scicloj study sessions this weekend: data wrangling with Tablecloth
We are planning some Scicloj study sessions this weekend about data wrangling with Tablecloth.
- LLVM!
-
Clojure High Performance Data Processing System
And in general for R integration and more data science goodies checkout scicloj and in the vein of dplyr style extremely thought out interfaces I highly recommend tablecloth.
What are some alternatives?
tech.ml.dataset - A Clojure high performance data processing system
jackdaw - A Clojure library for the Apache Kafka distributed streaming platform.
hanami - Interactive arts and charts plotting with Clojure(Script) and Vega-lite / Vega. Flower viewing 花見 (hanami)
holy-lambda - The extraordinary simple, performant, and extensible custom AWS Lambda runtime for Clojure.
dtype-next - A Clojure library designed to aid in the implementation of high performance algorithms and systems.
kotlinx.collections.immutable - Immutable persistent collections for Kotlin
libpython-clj - Python bindings for Clojure
notespace - using your namespace as a notebook
tech.ml - This library has been superceded by https://github.com/scicloj/scicloj.ml.
frovedis - Framework of vectorized and distributed data analytics
geni-performance-benchmark