lithops
squirrel-core
lithops | squirrel-core | |
---|---|---|
2 | 1 | |
306 | 279 | |
2.3% | 0.7% | |
9.5 | 5.6 | |
4 days ago | 8 days ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
lithops
- Lithops: A multi-cloud framework for embarrassingly parallel jobs
-
[D] For those of you who don't own a GPU, how do you run your experiments or train your models?
At work for non-ML/non-GPU stuff I've been using Lithops for running code on dynamically-provisioned cloud resources (serverless or VM). It pickles your code & runtime variables, sends them to cloud storage, runs the code & downloads the results, all relatively transparently. You're just calling Python functions with Python objects on your local computer and not having to worry about deploying your code, packaging your data, etc. Better still, you can scale up for things like hyperparameter sweeps by just dispatching more calls in parallel, and it will provision more resources.
squirrel-core
-
[P] Squirrel: A new OS library for fast & flexible large-scale data loading
Today we open-sourced Squirrel, a data infrastructure library that my colleagues and I have been working on over the past 1.5 years: https://github.com/merantix-momentum/squirrel-core
What are some alternatives?
secrets-env - Extension to the crystal lang ENV module to support reading secrets
squirrel-datasets-core - Squirrel dataset hub
treequeues - High performance queues for pytree objects.
talking-head-anime-3-demo - Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project
pyspark-on-aws-emr - The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
ML-YouTube-Courses - 📺 Discover the latest machine learning / AI courses on YouTube.
PDEBench - PDEBench: An Extensive Benchmark for Scientific Machine Learning
deeplake - Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
speaking_with_plato - Exploring Plato's philosophy with AI - A Data Spiral blog article
datasets - 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
papers-with-data - A curated list of papers that released datasets along with their work
Failed-ML - Compilation of high-profile real-world examples of failed machine learning projects