TransformerEngine vs ivy

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference. (by NVIDIA)

Source Code

docs.nvidia.com

Suggest alternative

Edit details

ivy

The Unified AI Framework (by unifyai)

Python Machine Learning Deep Learning neural-network GPU Autograd Ivy Abstraction Template Tensorflow Pytorch Mxnet Numpy Jax

Source Code

unify.ai

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

TransformerEngine		ivy
	Project
2	Mentions	17
1,428	Stars	14,021
13.1%	Growth	0.5%
9.5	Activity	10.0
4 days ago	Latest Commit	about 15 hours ago
Python	Language	Python
Apache License 2.0	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

TransformerEngine

Posts with mentions or reviews of TransformerEngine. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-01-18.

Benchmarking Large Language Models on NVIDIA H100 GPUs with CoreWeave (Part 1)
1 project | /r/nvidia | 30 Apr 2023

4090 now has its 8-bit float enabled as well, see the [transformer engine issue](https://github.com/NVIDIA/TransformerEngine/issues/15)
GPUs for Deep Learning in 2023 – An In-depth Analysis
4 projects | news.ycombinator.com | 18 Jan 2023

Would be curious to see your benchmarks. Btw, Nvidia will be providing support for fp8 in a future release of CUDA - https://github.com/NVIDIA/TransformerEngine/issues/15
I think TMA may not matter as much for consumer cards given the disproportionate amount of fp32 / int32 compute that they have.
Would be interesting to see how close to theoretical folks are able to get once CUDA support comes through.

ivy

Posts with mentions or reviews of ivy. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-28.

Keras 3.0
4 projects | news.ycombinator.com | 28 Nov 2023

See also https://github.com/unifyai/ivy which I have not tried but seems along the lines of what you are describing, working with all the major frameworks
Show HN: Carton – Run any ML model from any programming language
4 projects | news.ycombinator.com | 27 Sep 2023

is this ancillary to what [these guys](https://github.com/unifyai/ivy) are trying to do?
Ivy: All in one machine learning framework
1 project | news.ycombinator.com | 1 Aug 2023
Ivy ML Transpiler and Framework
1 project | news.ycombinator.com | 1 Aug 2023
[D] Keras 3.0 Announcement: Keras for TensorFlow, JAX, and PyTorch
3 projects | /r/MachineLearning | 11 Jul 2023

https://unify.ai/ They are trying to do what Ivy is doing already.
Ask for help: what is the best way to have code both support torch and numpy?
1 project | /r/pytorch | 22 Feb 2023

Check Ivy.
CoreML Stable Diffusion
2 projects | news.ycombinator.com | 1 Dec 2022

ROCm's great for data centers, but good luck finding anything about desktop GPUs on their site apart from this lone blog post: https://community.amd.com/t5/instinct-accelerators/exploring...
There's a good explanation of AMD's ROCm targets here: https://news.ycombinator.com/item?id=28200477
It's currently a PITA to get common Python libs like Numba to even talk to AMD cards (admittedly Numba won't talk to older Nvidia cards either and they deprecate ruthlessly; I had to downgrade 8 versions to get it working with a 5yo mobile workstation). YC-backed Ivy claims to be working on unifying ML frameworks in a hardware-agnostic way but I don't have enough experience to assess how well they're succeeding yet: https://lets-unify.ai
I was happy to see DiffusionBee does talk the GPU in my late-model intel Mac, though for some reason it only uses 50% of its power right now. I'm sure the situation will improve as Metal 3.0 and Vulkan get more established.
DL Frameworks in a nutshell
1 project | /r/DataScienceMemes | 10 Sep 2022

Won't it all come together with https://lets-unify.ai/ ?
Unified Machine Learning
1 project | news.ycombinator.com | 26 Aug 2022
[Discussion] Opinions on unify AI
2 projects | /r/deeplearning | 25 Jul 2022

What do you think about unify AI https://lets-unify.ai.

What are some alternatives?

When comparing TransformerEngine and ivy you can also consider the following projects:

Whisper - High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

PaddleNLP - 👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

autocvd - Tool to automatically set CUDA_VISIBLE_DEVICES based on GPU utilization. Usable from command line and code.

ColossalAI - Making large AI models cheaper, faster and more accessible

warp-drive - Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)

DeepFaceLive - Real-time face swap for PC streaming or video calls

nanoGPT - The simplest, fastest repository for training/finetuning medium-sized GPTs.

PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

fastaudio - 🔊 Audio and fastai v2

lisp - Toy Lisp 1.5 interpreter

liberate-fhe - A Fully Homomorphic Encryption (FHE) library for bridging the gap between theory and practice with a focus on performance and accuracy.

Kornia - Geometric Computer Vision Library for Spatial AI

TransformerEngine vs Whisper ivy vs PaddleNLP TransformerEngine vs autocvd ivy vs ColossalAI TransformerEngine vs warp-drive ivy vs DeepFaceLive TransformerEngine vs nanoGPT ivy vs PaddleOCR TransformerEngine vs fastaudio ivy vs lisp TransformerEngine vs liberate-fhe ivy vs Kornia

Compare TransformerEngine vs ivy and see what are their differences.

TransformerEngine

ivy

TransformerEngine

ivy

What are some alternatives?