batch-inference

Dynamic batching library for Deep Learning inference. Tutorials for LLM, GPT scenarios. (by microsoft)

Batch-inference Alternatives

Similar projects and alternatives to batch-inference

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better batch-inference alternative or higher similarity.

batch-inference reviews and mentions

Posts with mentions or reviews of batch-inference. We have used some of these posts to build our list of alternatives and similar projects.
  • Tutorial to improve GPT throughput 16 times with dynamic batching
    1 project | /r/deeplearning | 18 May 2023
    I wrote a tutorial to improve GPT completion throughput with dynamic batching https://microsoft.github.io/batch-inference/examples/gpt_completion.html. And I can achieve 16 times throughput on V100 comparing to baseline. We built a python dynamic batching library so you can apply it on your own models easily https://github.com/microsoft/batch-inference.

Stats

Basic batch-inference repo stats
1
64
6.4
12 months ago

microsoft/batch-inference is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of batch-inference is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com