yolov7
NUWA
Our great sponsors
yolov7 | NUWA | |
---|---|---|
33 | 23 | |
12,530 | 2,795 | |
- | 0.7% | |
4.0 | 3.3 | |
about 1 month ago | 10 months ago | |
Jupyter Notebook | ||
GNU General Public License v3.0 only | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
yolov7
- FLaNK Stack Weekly 16 October 2023
-
Train a ML model able to identify animal species
If you want something off-the-shelf, try YoloV7.
-
[D] Extracting the class labels and bounding boxes for objects, from a YOLO7 model after converting to an ONNX model
(Please note, this is a re-post of my original question here, I think this subreddit might be more appropriate for asking this question)At work, we use Unity, we have a project that needs object detection and classification. We decided to use this YOLO7 model (for non-technical reasons, It had to be the exact same model as the company does have pre-trained weights for this exact model). However, Unity only supports ONNX so I exported the model as an ONNX model, using the code provided in the repo:
-
DL for the Web: Repository of Models
Github Projects offering pretrained weights and train / run scripts. Example
- [D] yolov7 not finding GPU device: Solution
-
yolov7-pose: What does the output layers of an exported onnx model mean?
I played around with the demo code of human pose estimation of the yolov7 model (https://github.com/WongKinYiu/yolov7/tree/pose) and wanted to convert this to onnx format in order to continue in a C++ environment.
- [P] Any object detection library
-
I made the “AI invisibility cloak." Ask AI expert Tom Goldstein about security and safety of AI systems, and how to hack them.
I would add that Yolo has evolved over time, and variants of the architecture continue to come out today. Here's one that came out this summer: https://github.com/WongKinYiu/yolov7
-
What is the correct strides when add decoding level to a mlmodel converted from yolov7
there is a trained custom model with yolov7 that was converted to mlmodel with export.py (from the yolov7 repository: https://github.com/WongKinYiu/yolov7).
-
Use YOLOv5 tensorflow.js models to speed up annotation
Hi u/DDDqp! YOLOv7 is for sure on my list. Problem is that, as it is right now YOLOv7 does not offer export to tensorflow.js. I actually created the issue by asking if they plan to add that export: https://github.com/WongKinYiu/yolov7/issues/885 No response yet. But I think it is more than possible. I can even work on that export for them. If they would have exported to tfjs, than I most certainly can work on the NPM package.
NUWA
-
How long until we can create full length movies in ai ?
Github: https://github.com/microsoft/NUWA/tree/main/assets/nuwa_infinity/animation
- [D] Most Popular AI Research July 2022 pt. 2 - Ranked Based On GitHub Stars
- Most Popular AI Research July 2022 pt. 2 - Ranked Based On GitHub Stars
-
I'm building a timeline for generative image ML models. What's missing?
Microsoft NUWA: https://github.com/microsoft/NUWA
-
With so many new Text to Image "AI" emerging lately, is it not crazy to speculate about Text to Video?
Microsoft NUWA
-
Have any researchers in the field discussed anything about the prospect of 'text-to-video' - something that's a bit like DALL-E 2, but with a video as the finished output?
NÜWA from Microsoft.
-
Imagine this: complete "fake AI people" are coming, and you didn't even see this coming!
P.S., Lucidrains remade it! AND he's adding an audio transformer to it tomorrow he says! But he needs feedback and someone to train it, I don't think there is enough resources helping this project's training. You can reach him through: https://github.com/microsoft/NUWA
-
NÜWA - text to image
(from here)
What are some alternatives?
yolov3 - YOLOv3 in PyTorch > ONNX > CoreML > TFLite
edgetpu - Coral issue tracker (and legacy Edge TPU API source)
edgetpu-yolo - Minimal-dependency Yolov5 export and inference demonstration for the Google Coral EdgeTPU
YOLOv4 - Port of YOLOv4 to C# + TensorFlow
darknet - Convolutional Neural Networks
XMem - [ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
BCNet - Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]
CogVideo - Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"
CATNet - 🛰️ Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing Images (TNNLS 2023)
yolov5 - YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
yolact - A simple, fully convolutional model for real-time instance segmentation.
DALLE2-video - Direct application of DALLE-2 to video synthesis, using factored space-time Unet and Transformers