[P] optimization of Hugging Face Transformer models to get Inference < 1 Millisecond Latency + deployment on production ready inference server

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

triton_transformers

0 248 4.8 Python

Discontinued Deploy optimized transformer based models in production [Moved to: https://github.com/ELS-RD/transformer-deploy]

Will you be adding Openvino for CPU implementation too to the repo?
optuna

6 9,583 9.9 Python

A hyperparameter optimization framework

There are plenty of different options to do that in OSS, the most well known being optuna (https://github.com/optuna/optuna).
InfluxDB

www.influxdata.com
sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Optuna – A Hyperparameter Optimization Framework
2 projects | news.ycombinator.com | 6 Apr 2024
How to test optimal parameters
2 projects | /r/algotrading | 9 Dec 2023
Optuna – A Hyperparameter Optimization Framework
1 project | news.ycombinator.com | 8 Dec 2023
FOSS hyperparameter optimization framework to automate hyperparameter search
1 project | news.ycombinator.com | 10 Aug 2023
How to tune more than 2 hyperparameters in Grid Search in Python?
1 project | /r/learnmachinelearning | 4 Feb 2023

[P] optimization of Hugging Face Transformer models to get Inference < 1 Millisecond Latency + deployment on production ready inference server

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
Python Machine Learning Parallel Distributed hyperparameter-optimization
Post date: 5 Nov 2021

triton_transformers

optuna

InfluxDB

Related posts

[P] optimization of Hugging Face Transformer models to get Inference &lt; 1 Millisecond Latency + deployment on production ready inference server

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning Python Machine Learning Parallel Distributed hyperparameter-optimization Post date: 5 Nov 2021

triton_transformers

optuna

InfluxDB

Related posts

[P] optimization of Hugging Face Transformer models to get Inference < 1 Millisecond Latency + deployment on production ready inference server

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
Python Machine Learning Parallel Distributed hyperparameter-optimization
Post date: 5 Nov 2021