YOLOX vs Swin-Transformer-Object-Detection

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/ (by Megvii-BaseDetection)

Source Code

Suggest alternative

Edit details

Swin-Transformer-Object-Detection

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation. (by SwinTransformer)

mscoco swin-transformer cascade mask-rcnn object-detection reppoints swin

Source Code

arxiv.org

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

YOLOX		Swin-Transformer-Object-Detection
	Project
12	Mentions	4
9,012	Stars	1,710
1.5%	Growth	0.7%
1.5	Activity	0.0
about 2 months ago	Latest Commit	about 1 year ago
Python	Language	Python
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

YOLOX

Posts with mentions or reviews of YOLOX. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-18.

Learning Exchange, lets training YoloX
1 project | /r/deeplearning | 1 Mar 2023

So I am trying to do my best and train YOLOX for an object detection case using Google Colab.
Understanding heatmaps
1 project | /r/computervision | 20 Feb 2023

https://github.com/Megvii-BaseDetection/YOLOX I have only tried the pretrained yolo X nano. I get corner responses even if the inference image is padded with a large margin which is unexpected
Open discussion and useful links people trying to do Object Detection
4 projects | /r/deeplearning | 18 Feb 2023

* Nice implemention of Yolo that is BSD license (not GPL) https://github.com/Megvii-BaseDetection/YOLOX
[P] Image search with localization and open-vocabulary reranking.
8 projects | /r/MachineLearning | 15 Dec 2022

I wanted to have a few choices getting localization into image search (index and search time). I immediately thought of using a region proposal network (rpn) from mask-rcnn to create patches that can also be indexed and searched (and add the localisation). I figured it might be somewhat agnostic to classes. I did not want to use mmdetection or detectron2 due to their dependencies and just getting the rpn was not worth it. I was encouraged by the PyTorch native implementations of detection/segmentation models but ended up finding yolox the best.
DeepSort with PyTorch(support yolo series)
13 projects | /r/u_No_Experience9104 | 20 Sep 2022

Megvii-BaseDetection/YOLOX
[D][P] YOLOv6: state-of-the-art object detection at 1242 FPS
4 projects | /r/MachineLearning | 28 Jun 2022
Looking for help for hire
1 project | /r/tensorflow | 7 Jun 2022

Modern video can be broken into a series of still problems. AI vision models can make these types of classification in as fast as video. Here is a particularly there is a controversial company from China that does this very well on faces in video and they have open sourced the models: https://github.com/Megvii-BaseDetection/YOLOX
High-tech
1 project | /r/funny | 29 Mar 2022

Not really a problem, see results here. Just use yolox_x. Thank you for your attention.
Advice on Masters project | Vision transformers
3 projects | /r/MLQuestions | 5 Mar 2022

From what I understand the swin transformer outputs a single dimension feature vector and the yolo head takes inputs from 3 different layers from the backbone?? and I think I will need to write the backbone implementation here.
Is YOLOX object detector NMS free?
1 project | /r/MLQuestions | 16 Nov 2021

Swin-Transformer-Object-Detection

Posts with mentions or reviews of Swin-Transformer-Object-Detection. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-03-05.

Transfer Learning on Swin Transformer as a backbone for instance segmentation using MRCNN
1 project | /r/learnmachinelearning | 27 Feb 2023

I'm currently trying to transfer learn a set of custom classes of fish, for instance segmentation. I have found the official implementation of Swin Transformer as a backbone for instance segmentation using MRCNN: https://github.com/SwinTransformer/Swin-Transformer-Object-Detection.
Advice on Masters project | Vision transformers
3 projects | /r/MLQuestions | 5 Mar 2022

Hi, So my project is to do with object detection on trash in the wild on this fairly obscure dataset: http://tacodataset.org/ and I was thinking of applying vision transformers to it for feature extraction. I was thinking of taking the YOLOX implementation and swapping out the backbone with swin transformers and perform bunch of comparisons/experiments for the write up. Sort of like how they applied swin transformers to mask R-CNN here but I am struggling to understand where to begin.
[P] I implemented DeepMind's "Perceiver" in PyTorch
3 projects | /r/MachineLearning | 15 Apr 2021

Yes, have a look at this paper.
[P] Code and pretrained models for Swin Transformer are released (SOTA models on COCO and ADE20K)
3 projects | /r/MachineLearning | 14 Apr 2021

Object detection on COCO: https://github.com/SwinTransformer/Swin-Transformer-Object-Detection

What are some alternatives?

When comparing YOLOX and Swin-Transformer-Object-Detection you can also consider the following projects:

tensorflow-yolov4-tflite - YOLOv4, YOLOv4-tiny, YOLOv3, YOLOv3-tiny Implemented in Tensorflow 2.0, Android. Convert YOLO v4 .weights tensorflow, tensorrt and tflite

Mask_RCNN - Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

tensorrt_demos - TensorRT MODNet, YOLOv4, YOLOv3, SSD, MTCNN, and GoogLeNet

Video-Swin-Transformer - This is an official implementation for "Video Swin Transformers".

PINTO_model_zoo - A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, CoreML.

Swin-Transformer-Tensorflow - Unofficial implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" (https://arxiv.org/abs/2103.14030)

yolov5 - YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Swin-Transformer-Semantic-Segmentation - This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.

RPi_64-bit_Zero-2-image - Raspberry Pi Zero 2 W 64-bit OS image with OpenCV, TensorFlow Lite and ncnn Framework.

Perceiver - Implementation of Perceiver, General Perception with Iterative Attention in TensorFlow

YOLOv6 - YOLOv6: a single-stage object detection framework dedicated to industrial applications.

Swin-Transformer-Serve - Deploy Swin Transformer using TorchServe

YOLOX vs tensorflow-yolov4-tflite Swin-Transformer-Object-Detection vs Mask_RCNN YOLOX vs tensorrt_demos Swin-Transformer-Object-Detection vs Video-Swin-Transformer YOLOX vs PINTO_model_zoo Swin-Transformer-Object-Detection vs Swin-Transformer-Tensorflow YOLOX vs yolov5 Swin-Transformer-Object-Detection vs Swin-Transformer-Semantic-Segmentation YOLOX vs RPi_64-bit_Zero-2-image Swin-Transformer-Object-Detection vs Perceiver YOLOX vs YOLOv6 Swin-Transformer-Object-Detection vs Swin-Transformer-Serve

Compare YOLOX vs Swin-Transformer-Object-Detection and see what are their differences.

YOLOX

Swin-Transformer-Object-Detection

YOLOX

Swin-Transformer-Object-Detection

What are some alternatives?