Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Gpt-fast Alternatives
Similar projects and alternatives to gpt-fast
-
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
gpt-fast reviews and mentions
-
[D] GPT-Fast performance on larger batch sizes
I'm toying around with gpt-fast (https://github.com/pytorch-labs/gpt-fast) and was wondering if anyone has run experiments @ BS>1?
- Optimum-NVIDIA - 28x faster inference in just 1 line of code !?
- GPT-Fast: Simple and efficient GPT inference in <1000 LOC of Python
-
GPT-Fast: A fast and hackable implementation of transformer inference in <1000 lines of native PyTorch with support for quantization, speculative decoding, TP, Nvidia/AMD support, and more!
And check out the code here: https://github.com/pytorch-labs/gpt-fast
-
80% faster, 50% less memory, 0% loss of accuracy Llama finetuning
How does this compare to PyTorch labs optimizations for Sam and llama2 ?
https://github.com/pytorch-labs/segment-anything-fast
https://github.com/pytorch-labs/gpt-fast
- Fast and hackable PyTorch native transformer inference
-
Accelerating Generative AI with PyTorch II: GPT, Fast
I'm wondering if gpt-fast has a version that can be run from Windows Command Prompt or Powershell?
https://github.com/pytorch-labs/gpt-fast/issues/45
-
A note from our sponsor - InfluxDB
www.influxdata.com | 27 Apr 2024
Stats
pytorch-labs/gpt-fast is an open source project licensed under BSD 3-clause "New" or "Revised" License which is an OSI approved license.
The primary programming language of gpt-fast is Python.
Sponsored