multimodal-maestro
segment-anything-video
multimodal-maestro | segment-anything-video | |
---|---|---|
1 | 1 | |
955 | 914 | |
2.6% | - | |
8.6 | 6.1 | |
3 months ago | 8 days ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
multimodal-maestro
segment-anything-video
What are some alternatives?
Mask_RCNN - Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
anylabeling - Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything, MobileSAM!!
InternGPT - InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
yolo_series_deepsort_pytorch - Deepsort with yolo series. This project support the existing yolo detection model algorithm (YOLOV8, YOLOV7, YOLOV6, YOLOV5, YOLOV4Scaled, YOLOV4, YOLOv3', PPYOLOE, YOLOR, YOLOX ).
LLaVA - [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
yolov7_d2 - 🔥🔥🔥🔥 (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥
multi_token - Embed arbitrary modalities (images, audio, documents, etc) into large language models.
mmyolo - OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
yolov8-face - yolov8 face detection with landmark
inference - A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.