Convert Pegasus model to ONNX

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

fastT5

5 540 0.0 Python

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.

I am working on a project where I fine-tuned a Pegasus model on the Reddit dataset. Now, I need to convert the fine-tuned model to ONNX for the deployment stage. I have followed this guide from Huggingface to convert to the ONNX model for unsupported architects. I got it done but the ONNX model can't generate text. Turned out that Pegasus is an encoder-decoder model and most guides are for either encoder-model (e.g. BERT) or decoder-model (e.g. GPT2). I found the only example of converting an encoder-decoder model to ONNX from here https://github.com/Ki6an/fastT5.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

[P] boost T5 models speed up to 5x & reduce the model size by 3x using fastT5.

1 project | /r/MachineLearning | 25 Mar 2021
Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow

7 projects | dev.to | 29 Apr 2024
Oracle-samples/sd4j: Stable Diffusion pipeline in Java using ONNX Runtime

2 projects | news.ycombinator.com | 1 Jan 2024
Shipping large ML models with electron

5 projects | dev.to | 11 Apr 2023
Speeding up T5

2 projects | /r/LanguageTechnology | 22 Jan 2023

Convert Pegasus model to ONNX

This page summarizes the projects mentioned and recommended in the original post on /r/learnmachinelearning
Python t5 Onnx Onnxruntime quantization
Post date: 22 Sep 2022

fastT5

InfluxDB

Related posts

[P] boost T5 models speed up to 5x & reduce the model size by 3x using fastT5.

Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow

Oracle-samples/sd4j: Stable Diffusion pipeline in Java using ONNX Runtime

Shipping large ML models with electron

Speeding up T5

Convert Pegasus model to ONNX

This page summarizes the projects mentioned and recommended in the original post on /r/learnmachinelearning Python t5 Onnx Onnxruntime quantization Post date: 22 Sep 2022

fastT5

InfluxDB

Related posts

[P] boost T5 models speed up to 5x &amp; reduce the model size by 3x using fastT5.

Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow

Oracle-samples/sd4j: Stable Diffusion pipeline in Java using ONNX Runtime

Shipping large ML models with electron

Speeding up T5

This page summarizes the projects mentioned and recommended in the original post on /r/learnmachinelearning
Python t5 Onnx Onnxruntime quantization
Post date: 22 Sep 2022

[P] boost T5 models speed up to 5x & reduce the model size by 3x using fastT5.