clipseg
YOLOX
clipseg | YOLOX | |
---|---|---|
7 | 12 | |
1,022 | 9,042 | |
- | 1.1% | |
3.8 | 1.0 | |
4 months ago | 13 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
clipseg
-
How to blend a logo or clip art to a design
Following the comments to this old post, I tried to use in-painting with manual mask selection. I didn't get beautiful results but I'm sure with some tweaking, I could make it better. The main problem I had was having to manually select the area where I wanted to place the logo and trying to resize my logo mask to the fit the segment. I tried some automatic segmentation tools (Clipseg and Segment Anything). I couldn't tell the segmentation models to find a good area to for logo placement (i.e. some small flat surface). Given the complexity of what I was dealing with, I think there could be a better way (XY problem).
-
New Feature: "ZOOM ENHANCE" for the A111 WebUI. Automatically fix small details like faces and hands!
The addon utilizes clipseg for region masking, which was trained on "an extended version of the PhraseCut dataset" (many thousands of images.)
-
Txt2mask just received a big update!! 🎅
You'll also need to make sure to update your clipseg repo. The script won't do this for you. Namely you just need to update this models/clipseg.py file to ensure your clipseg has support for the new model.
-
[P] Image search with localization and open-vocabulary reranking.
For localisation at search time I ended up using OWL-ViT. This worked really well. I did not try Detic or CLIPseg but would be interested to hear if anyone else has tried these?
-
Who needs prompt2prompt anyway? SD 1.5 inpainting model with clipseg prompt for "hair" and various prompts for different hair colors
clipseg is an image segmentation method used to find a mask for an image from a prompt. I implemented it as an executor for dalle-flow and added it to my bot yasd-discord-bot.
-
txt2mask working in imaginAIry python library
Automated Replacement (txt2mask) by clipseg
- txt2mask was just released! We don't have to use the brush tool for inpainting anymore!
YOLOX
-
Learning Exchange, lets training YoloX
So I am trying to do my best and train YOLOX for an object detection case using Google Colab.
-
Understanding heatmaps
https://github.com/Megvii-BaseDetection/YOLOX I have only tried the pretrained yolo X nano. I get corner responses even if the inference image is padded with a large margin which is unexpected
-
Open discussion and useful links people trying to do Object Detection
* Nice implemention of Yolo that is BSD license (not GPL) https://github.com/Megvii-BaseDetection/YOLOX
-
[P] Image search with localization and open-vocabulary reranking.
I wanted to have a few choices getting localization into image search (index and search time). I immediately thought of using a region proposal network (rpn) from mask-rcnn to create patches that can also be indexed and searched (and add the localisation). I figured it might be somewhat agnostic to classes. I did not want to use mmdetection or detectron2 due to their dependencies and just getting the rpn was not worth it. I was encouraged by the PyTorch native implementations of detection/segmentation models but ended up finding yolox the best.
-
DeepSort with PyTorch(support yolo series)
Megvii-BaseDetection/YOLOX
- [D][P] YOLOv6: state-of-the-art object detection at 1242 FPS
-
Looking for help for hire
Modern video can be broken into a series of still problems. AI vision models can make these types of classification in as fast as video. Here is a particularly there is a controversial company from China that does this very well on faces in video and they have open sourced the models: https://github.com/Megvii-BaseDetection/YOLOX
-
High-tech
Not really a problem, see results here. Just use yolox_x. Thank you for your attention.
-
Advice on Masters project | Vision transformers
From what I understand the swin transformer outputs a single dimension feature vector and the yolo head takes inputs from 3 different layers from the backbone?? and I think I will need to write the backbone implementation here.
- Is YOLOX object detector NMS free?
What are some alternatives?
stable-diffusion - Latent Text-to-Image Diffusion
tensorflow-yolov4-tflite - YOLOv4, YOLOv4-tiny, YOLOv3, YOLOv3-tiny Implemented in Tensorflow 2.0, Android. Convert YOLO v4 .weights tensorflow, tensorrt and tflite
Detic - Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
Swin-Transformer-Object-Detection - This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
LAVIS - LAVIS - A One-stop Library for Language-Vision Intelligence
tensorrt_demos - TensorRT MODNet, YOLOv4, YOLOv3, SSD, MTCNN, and GoogLeNet
imaginAIry - Pythonic AI generation of images and videos
PINTO_model_zoo - A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, CoreML.
txt2mask - Automatically create masks for Stable Diffusion inpainting using natural language.
yolov5 - YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
dalle-flow - 🌊 A Human-in-the-Loop workflow for creating HD images from text
RPi_64-bit_Zero-2-image - Raspberry Pi Zero 2 W 64-bit OS image with OpenCV, TensorFlow Lite and ncnn Framework.