tensorrtx
gpt-2
tensorrtx | gpt-2 | |
---|---|---|
3 | 64 | |
6,584 | 21,146 | |
- | 1.1% | |
8.4 | 2.5 | |
6 days ago | 25 days ago | |
C++ | Python | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tensorrtx
-
A Three-pronged Approach to Bringing ML Models Into Production
In terms of the latter, this is quite common when employing non-standard SOTA models. You may discover a variety of TensorRT implementations on the web if you want to use popular models—for example, in the project where we needed to train an object-detection algorithm on Rutorch and deploy it on Triton, we used many cases of PyTorch -> TensorRT -> Triton. The implementation of the model on TensoRT was taken from here. You may also be interested in this repository, as it contains many current implementations supported by developers.
-
Dall-E 2
I'll try them out. I have an RTX 2070, which apparently supports fp16. But it only has 8GB RAM.
I used the instructions here to check: https://github.com/wang-xinyu/tensorrtx/blob/master/tutorial...
-
Increasing usb cam FPS with Yolov5 on a Jetson Xavier NX?
Optimize your model using TensorRT. There is a good implementation here: https://github.com/wang-xinyu/tensorrtx/tree/master/yolov5
gpt-2
-
What are LLMs? An intro into AI, models, tokens, parameters, weights, quantization and more
Medium models: Roughly between 1B to 10B parameters. This is where Mistral 7B, Phi-3, Gemma from Google DeepMind, and wizardlm2 sit. Fun fact: GPT 2 was a medium sized model, much smaller than its latest versions.
- Sam Altman is still trying to return as OpenAI CEO
- Build Personal ChatGPT Using Your Data
-
Are the recent advancements in AI technology primarily driven by recent discoveries or the progress in hardware capabilities and the abundance of available data?
"Our model, called GPT-2 (a successor to GPT), was trained simply to predict the next word in 40GB of Internet text. Due to our concerns about malicious applications of the technology, we are not releasing the trained model. As an experiment in responsible disclosure, we are instead releasing a much smaller model for researchers to experiment with, as well as a technical paper. "
-
BING IS NOW THE DEFAULT SEARCH FOR CHATGPT
They did release GPT-2 under the MIT License.
-
Don Knuth Plays with ChatGPT
Did you arrive at this certainty through reading something other than what OpenAI has published? The document [0] that describes the training data for GPT-2 makes this assertion hilarious to me.
[0]: https://github.com/openai/gpt-2/blob/master/model_card.md#da...
- Was frustriert euch an der Nutzung oder der Diskussion um KI?
- The AI
-
Help with pet project to learn - Running ChatGPT-2 at home
I made a clone of https://github.com/openai/gpt-2 on my local laptop
- По поводу опасности ИИ и предложений остановить разработки на 6 месяцев.
What are some alternatives?
TensorRT - NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
dalle-mini - DALL·E Mini - Generate images from a text prompt
tensorflow-yolov4-tflite - YOLOv4, YOLOv4-tiny, YOLOv3, YOLOv3-tiny Implemented in Tensorflow 2.0, Android. Convert YOLO v4 .weights tensorflow, tensorrt and tflite
minGPT - A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
v-diffusion-pytorch - v objective diffusion inference code for PyTorch.
Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time
gpt-neo - An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
dalle-2-preview
sentencepiece - Unsupervised text tokenizer for Neural Network-based text generation.
SegmentationCpp - A c++ trainable semantic segmentation library based on libtorch (pytorch c++). Backbone: VGG, ResNet, ResNext. Architecture: FPN, U-Net, PAN, LinkNet, PSPNet, DeepLab-V3, DeepLab-V3+ by now.
jukebox - Code for the paper "Jukebox: A Generative Model for Music"