Detic
lang-seg
Detic | lang-seg | |
---|---|---|
11 | 2 | |
1,769 | 672 | |
1.0% | 2.2% | |
1.9 | 1.1 | |
about 1 month ago | 5 months ago | |
Python | Jupyter Notebook | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Detic
-
Autodistill: A new way to create CV models
Some of the foundation/base models include: * GroundedSAM (Segment Anything Model) * DETIC * GroundingDINO
-
[P] Image search with localization and open-vocabulary reranking.
For localisation at search time I ended up using OWL-ViT. This worked really well. I did not try Detic or CLIPseg but would be interested to hear if anyone else has tried these?
-
training object detector using classified images?
git clone https://github.com/facebookresearch/Detic cd Detic pip install -r requirements python demo.py --config-file configs/Detic_LCOCOI21k_CLIP_SwinB_896b32_4x_ft4x_max-size.yaml --input desk.jpg --output out.jpg --vocabulary lvis --opts MODEL.WEIGHTS models/Detic_LCOCOI21k_CLIP_SwinB_896b32_4x_ft4x_max-size.pth
-
[P] Any object detection library
You might want to take a look at DETIC : https://github.com/facebookresearch/Detic (Open Vocabulary Object Detection, trained on thousands of classes)
-
[P] Awesome Image Segmentation Project Based on Deep Learning (5.6k star)
Are there any open-label segmentation model included in this repo, like Detic or LSeg?
-
[R] CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory + Code + Robot demo
We made this using pretty recent advances in web-data pretrained models like Detic and LSeg for detection, CLIP for visual queries, and Sentence BERT for semantic queries. Our "database" is really a neural field (Instant NGP) that maps from 3D coordinates to a high dimensional embedding vector in the same representation space as CLIP and SBERT.
-
[P] Using OpenAI's CLIP repository as a support, I was able to create a software to detect anything in an image at its original resolution!
Is it similar to the open vocabulary detic?
-
Researchers at Meta and the University of Texas at Austin Propose ‘Detic’: A Method to Detect Twenty-Thousand Classes using Image-Level Supervision
Code for https://arxiv.org/abs/2201.02605 found: https://github.com/facebookresearch/Detic
- Detecting Twenty-thousand Classes using Image-level Supervision
-
[R] Detecting Twenty-thousand Classes using Image-level Supervision
github: https://github.com/facebookresearch/Detic
lang-seg
-
[P] Awesome Image Segmentation Project Based on Deep Learning (5.6k star)
Are there any open-label segmentation model included in this repo, like Detic or LSeg?
-
[R] CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory + Code + Robot demo
We made this using pretty recent advances in web-data pretrained models like Detic and LSeg for detection, CLIP for visual queries, and Sentence BERT for semantic queries. Our "database" is really a neural field (Instant NGP) that maps from 3D coordinates to a high dimensional embedding vector in the same representation space as CLIP and SBERT.
What are some alternatives?
GroundingDINO - Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
clip-fields - Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields
FasterRCNN - Clean and readable implementations of Faster R-CNN in PyTorch and TensorFlow 2 with Keras.
sentence-transformers - Multilingual Sentence & Image Embeddings with BERT
ultralytics - NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
CLIP - CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
segment-anything - The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
instant-ngp - Instant neural graphics primitives: lightning fast NeRF and more
clipseg - This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".
super-gradients - Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.