clojask
tablecloth
clojask | tablecloth | |
---|---|---|
5 | 10 | |
114 | 274 | |
- | 3.3% | |
4.2 | 9.1 | |
9 months ago | 23 days ago | |
Clojure | HTML | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
clojask
-
Data-recur meeting 2: general monthly - updates about Clojask, ds4clj, and more
Among other things, we will have a brief intro to Clojask - a library for parallel computing of larger-than-memory datasets developed at HKU Business School.
-
Question about data engineer in clojure
You can give Clojask a try, it's designed for larger-than-memory datasets. https://github.com/clojure-finance/clojask
-
A data science course for Clojurians – are you interested?
You could give Clojask a try. If you need to read from different file types other than .csv, you can also use the Clojask "plug-in" called clojask-io
- Clojask – data processing with parallel computing on larger-than-memory datasets
-
Clojask: A parallel data processing framework that is designed for large datasets
Clojask is a data processing framework that is designed for large datasets, inspired by Dask, Spark and NoSQL databases.
tablecloth
-
Is there a library for rank polymorphism in clojure?
Another, and definitely better for serious projects, approach is to use https://github.com/scicloj/tablecloth or things that it mentions - tech.ml.dataset and dtype-next.
-
Data-recur meeting 4: an intro to Tablecloth
The forth meeting will be at the end of October and will be dedicated to the Tablecloth dataset manipulation library by generateme, with an intro by Ethan Miller, who is nowadays involved in developing Tablecloth.
-
Best Data Tools for my use case
I really like geni: it is really idiomatic in its approach to Apache Spark. There are some gaps (no UDFs), and I am not sure that the project is as active as it used to be. But I still use it and find it very nice (I do have Apache Spark background already). tablecloth is an alternative dataframe library that is being used by a lot of folks in the Clojure data science world. For that matter, you should check out scicloj, and also hang out in the data channel in zulip.
- Why Clojure is not widely adopted like mainstream languages?
-
re:Clojure 2021 workshop: Wrangling datasets with Tablecloth by Mey Beisaron (2021-11-07)
At this re:Clojure workshop (Nov. 7th), @ladymeyy taught us about Tablecloth.
- On Sunday: a workshop by Mey Beisaron about Tablecloth
-
Scicloj ml-study 15: data visualization
In both sessions, we will practice data visualization on a real-world data problem. Among other things, we will try a new data visualization library that Ashima Panjwani is working on. We will assume basic familiarity with Clojure and with Tablecloth. Both sessions will be independent, probably overlapping in content.
-
Scicloj study sessions this weekend: data wrangling with Tablecloth
We are planning some Scicloj study sessions this weekend about data wrangling with Tablecloth.
- LLVM!
-
Clojure High Performance Data Processing System
And in general for R integration and more data science goodies checkout scicloj and in the vein of dplyr style extremely thought out interfaces I highly recommend tablecloth.
What are some alternatives?
cascalog - Data processing on Hadoop without the hassle.
tech.ml.dataset - A Clojure high performance data processing system
geni - A Clojure dataframe library that runs on Spark
hanami - Interactive arts and charts plotting with Clojure(Script) and Vega-lite / Vega. Flower viewing 花見 (hanami)
dtype-next - A Clojure library designed to aid in the implementation of high performance algorithms and systems.
clojask-io - Reading and writing various file formats for Clojask: clojask-io is a library designed to extend the file support for Clojask. This library can also be used alone to read in and output dataset files.
libpython-clj - Python bindings for Clojure
tech.ml - This library has been superceded by https://github.com/scicloj/scicloj.ml.
geni-performance-benchmark
deep-diamond - A fast Clojure Tensor & Deep Learning library
notespace - using your namespace as a notebook