squirrel-core
squirrel-datasets-core
Our great sponsors
squirrel-core | squirrel-datasets-core | |
---|---|---|
1 | 2 | |
279 | 43 | |
2.5% | - | |
5.9 | 2.3 | |
11 days ago | 8 months ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
squirrel-core
-
[P] Squirrel: A new OS library for fast & flexible large-scale data loading
Today we open-sourced Squirrel, a data infrastructure library that my colleagues and I have been working on over the past 1.5 years: https://github.com/merantix-momentum/squirrel-core
squirrel-datasets-core
-
[P] Squirrel: A new OS library for fast & flexible large-scale data loading
Have a look at this tutorial to learn how to convert to messagepack by using Spark.
What are some alternatives?
talking-head-anime-3-demo - Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project
uda - Unsupervised Data Augmentation (UDA)
ML-YouTube-Courses - 📺 Discover the latest machine learning / AI courses on YouTube.
datasaurus - Do computer vision with 1000x less data
PDEBench - PDEBench: An Extensive Benchmark for Scientific Machine Learning
podium - Podium: a framework agnostic Python NLP library for data loading and preprocessing
deeplake - Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
speaking_with_plato - Exploring Plato's philosophy with AI - A Data Spiral blog article
datasets - 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
papers-with-data - A curated list of papers that released datasets along with their work
Failed-ML - Compilation of high-profile real-world examples of failed machine learning projects
chicksexer - A Python package for gender classification.