The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Python high-throughput Projects
-
warp-drive
Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Project mention: Run 70B LLM Inference on a Single 4GB GPU with This New Technique | news.ycombinator.com | 2023-12-03
NOTE:
The open source projects on this list are ordered by number of github stars.
The number of mentions indicates repo mentiontions in the last 12 Months or
since we started tracking (Dec 2020).
Python high-throughput related posts
- Run 70B LLM Inference on a Single 4GB GPU with This New Technique
- Colorful Custom RTX 4060 Ti GPU Clocks Outed, 8 GB VRAM Confirmed
- FlexGen: Running large language models on a single GPU
- FlexGen: Running large language models on a single GPU
- FlexGen: Running large language models on a single GPU
- FlexGen: Running large language models on a single GPU
- Could this new flexgen be used in place of GPTq? or is this different?
-
A note from our sponsor - WorkOS
workos.com | 27 Apr 2024
Index
Project | Stars | |
---|---|---|
1 | FlexGen | 8,999 |
2 | warp-drive | 434 |
Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com