-
accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
llamazoo
Discontinued Large Model Collider - The Platform for serving LLM models [Moved to: https://github.com/gotzmann/collider]
The repo can be found here, the readme is not up-to-date. The code is a bit messy.
As /u/RabbitHole32 already mentioned, the speed increase stems from a patch which modifies, how a certain, large tensor is distributed between the GPU's. The patch was created by /u/emvw7yf. Here you can find the respective GitHub issue: https://github.com/huggingface/accelerate/issues/1394
Looks like exactly same idea I'm doing right now with LLaMAZoo: https://github.com/gotzmann/llamazoo
Related posts
-
NPi – An Open Source project for enhancing AI Agents in taking action
-
Show HN: K8sAI – open-source GPT CLI tool for Kubernetes
-
SUQL: Conversational Search over Structured and Unstructured Data with LLMs
-
Show HN: FileKitty – Combine and label text files for LLM prompt contexts
-
Ask HN: Freelancer? Seeking freelancer? (May 2024)