SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python segment-anything Projects
-
Project mention: Show HN: Using YOLO to Detect Office Chairs in 40M Hotel Photos | news.ycombinator.com | 2025-01-25
They did it on their own computer. https://github.com/ultralytics/ultralytics
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
-
-
segment-geospatial
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
-
InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
-
anylabeling
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
-
autodistill
Images to inference with no labeling (use foundation models to train supervised models).
-
Sevalla
Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
-
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
-
awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
-
OpenAdapt
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
Project mention: FastVLM: Dramatically Faster Vision Language Model from Apple | news.ycombinator.com | 2025-05-12 -
sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
-
GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale (by FoundationVision)
-
wunjo.wladradchenko.ru
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
I’ve been building Wunjo, an Open Source AI-powered video editing tool that can today automatically cut, highlight, and transform videos with a simple text prompt. Sounds cool, right? Yet, getting to 1K stars on GitHub feels like an endless grind. This is a set of tools in software to optimization process of video, photo editing and API (API Docs) inside for other pet-projects.
-
-
-
comfyui_segment_anything
Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.
-
-
Almost everyone has heard of libraries like OpenCV, Pytorch, and Torchvision. But there have been incredible leaps and bounds in other libraries to help support new tasks that have helped push research even further. It would be impossible to thank each and every project and the thousands of contributors who have helped make the entire community better. MedSAM2 has been helping bring the awesomeness of SAM2 to the medical field, segmenting organs in a variety of medical imaging methods. Rerun has made it easier than ever to stream multimodal data for spatial and embodied AI.
-
awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
-
TinySAM
[AAAI 2025] Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"
-
Instruct2Act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
-
samexporter
Exporting Segment Anything, MobileSAM, and Segment Anything 2 into ONNX format for easy deployment
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python segment-anything discussion
Python segment-anything related posts
-
Why My Open Source Project Wunjo Can’t Reach 1K Stars? 😢
-
How to blend a logo or clip art to a design
-
Textual inversion. The best way to prepare photos of a person?
-
Keying/masking person on a footage
-
AnyLabeling Auto-labeling with MobileSAM - the newest and fastest variant of Segment Anything
-
Best way to mask images automatically?
-
Advice for multi-animal tracking for scientific research?
-
A note from our sponsor - SaaSHub
www.saashub.com | 1 Sep 2025
Index
What are some of the best open-source segment-anything projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | ultralytics | 44,791 |
2 | Track-Anything | 6,795 |
3 | sd-webui-segment-anything | 3,508 |
4 | segment-geospatial | 3,391 |
5 | InternGPT | 3,216 |
6 | anylabeling | 2,801 |
7 | autodistill | 2,384 |
8 | Caption-Anything | 1,754 |
9 | awesome-openai-vision-api-experiments | 1,684 |
10 | OpenAdapt | 1,365 |
11 | sd-webui-inpaint-anything | 1,272 |
12 | GLEE | 1,149 |
13 | wunjo.wladradchenko.ru | 1,062 |
14 | sam-pt | 1,011 |
15 | segment-anything-video | 986 |
16 | comfyui_segment_anything | 974 |
17 | SegmentAnythingin3D | 960 |
18 | Medical-SAM2 | 808 |
19 | awesome-foundation-and-multimodal-models | 630 |
20 | TinySAM | 506 |
21 | Instruct2Act | 365 |
22 | samexporter | 349 |
23 | Hi-SAM | 308 |