Python segment-anything

Open-source Python projects categorized as segment-anything

Top 23 Python segment-anything Projects

  • Track-Anything

    Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

  • Project mention: Keying/masking person on a footage | /r/davinciresolve | 2023-10-21

    I was doing rotoscoping for a silhouette of a girl dancing in front of a building, then I saw this amazing tool: https://github.com/gaomingqi/Track-Anything

  • sam-hq

    Segment Anything in High Quality [NeurIPS 2023]

  • Project mention: 12-Jun-2023 | /r/dailyainews | 2023-06-12

    Segment Anything in High Quality (https://arxiv.org/abs/2306.01567)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • sd-webui-segment-anything

    Segment Anything for Stable Diffusion WebUI

  • Project mention: Textual inversion. The best way to prepare photos of a person? | /r/StableDiffusion | 2023-12-06

    One idea would be to use Segment Anything to cut out the character/face from the background and then replace with random backgrounds that you generate with stable diffusion. Here's an extension for Automatic1111 :) https://github.com/continue-revolution/sd-webui-segment-anything

  • InternGPT

    InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

  • Project mention: How do I use the programs on Github? | /r/github | 2023-06-16

    You can also create an issue and ask the developers for help.

  • segment-geospatial

    A Python package for segmenting geospatial data with the Segment Anything Model (SAM)

  • Project mention: Anyone using segment-geospatial? | /r/gis | 2023-04-28

    Maybe try reporting as a GitHub issue? https://github.com/opengeos/segment-geospatial/issues

  • anylabeling

    Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything, MobileSAM!!

  • Project mention: AnyLabeling Auto-labeling with MobileSAM - the newest and fastest variant of Segment Anything | /r/computervision | 2023-06-28

    Check out AnyLabeling v0.3.2 today: https://github.com/vietanhdev/anylabeling/releases/tag/v0.3.2.

  • Caption-Anything

    Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • autodistill

    Images to inference with no labeling (use foundation models to train supervised models).

  • Project mention: Ask HN: Who is hiring? (February 2024) | news.ycombinator.com | 2024-02-01

    Roboflow | Open Source Software Engineer, Web Designer / Developer, and more. | Full-time (Remote, SF, NYC) | https://roboflow.com/careers?ref=whoishiring0224

    Roboflow is the fastest way to use computer vision in production. We help developers give their software the sense of sight. Our end-to-end platform[1] provides tooling for image collection, annotation, dataset exploration and curation, training, and deployment.

    Over 250k engineers (including engineers from 2/3 Fortune 100 companies) build with Roboflow. We now host the largest collection of open source computer vision datasets and pre-trained models[2]. We are pushing forward the CV ecosystem with open source projects like Autodistill[3] and Supervision[4]. And we've built one of the most comprehensive resources for software engineers to learn to use computer vision with our popular blog[5] and YouTube channel[6].

    We have several openings available but are primarily looking for strong technical generalists who want to help us democratize computer vision and like to wear many hats and have an outsized impact. Our engineering culture is built on a foundation of autonomy & we don't consider an engineer fully ramped until they can "choose their own loss function". At Roboflow, engineers aren't just responsible for building things but also for helping us figure out what we should build next. We're builders & problem solvers; not just coders. (For this reason we also especially love hiring past and future founders.)

    We're currently hiring full-stack engineers for our ML and web platform teams, a web developer to bridge our product and marketing teams, several technical roles on the sales & field engineering teams, and our first applied machine learning researcher to help push forward the state of the art in computer vision.

    [1]: https://roboflow.com/?ref=whoishiring0224

    [2]: https://roboflow.com/universe?ref=whoishiring0224

    [3]: https://github.com/autodistill/autodistill

    [4]: https://github.com/roboflow/supervision

    [5]: https://blog.roboflow.com/?ref=whoishiring0224

    [6]: https://www.youtube.com/@Roboflow

  • multimodal-maestro

    Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

  • Project mention: Show HN: Multimodal Maestro – Prompt tools for use with LMMs | news.ycombinator.com | 2023-11-29
  • sd-webui-inpaint-anything

    Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

  • Project mention: How to send masked photo to img2img inpainting | /r/StableDiffusion | 2023-10-12

    Hey everyone,Today I installed inpaint-anything extension from github(https://github.com/Uminosachi/sd-webui-inpaint-anything) to Stable Diffusion. It works perfectly.

  • segment-anything-video

    MetaSeg: Packaged version of the Segment Anything repository

  • Project mention: Sharing an automatic tool I built for video background removal (free and better than runwayml) | /r/StableDiffusion | 2023-05-04
  • sam-pt

    SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.

  • Project mention: using AI to fill the scenes vertically | /r/StableDiffusion | 2023-07-12

    Part of this process can be a remastering step also for old videos. Masking backgrounds across frames and performing super-resolution with all known references and scaling details for characters using fine-tuned models for each actor. We have a lot new SAM tools to assist with this process. It probably won't be magically done for a while, but a few people could remaster a show rather than a large team.

  • GLEE

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale (by FoundationVision)

  • Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01
  • SegmentAnythingin3D

    Segment Anything in 3D with NeRFs (NeurIPS 2023)

  • Project mention: When SAM Meets NeRF: This AI Model Can Segment Anything in 3D | /r/machinelearningnews | 2023-05-22
  • awesome-foundation-and-multimodal-models

    👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

  • Project mention: Foundation Multimodal Models | news.ycombinator.com | 2024-03-01
  • comfyui_segment_anything

    Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.

  • Project mention: How to blend a logo or clip art to a design | /r/StableDiffusion | 2023-12-08

    Following the comments to this old post, I tried to use in-painting with manual mask selection. I didn't get beautiful results but I'm sure with some tweaking, I could make it better. The main problem I had was having to manually select the area where I wanted to place the logo and trying to resize my logo mask to the fit the segment. I tried some automatic segmentation tools (Clipseg and Segment Anything). I couldn't tell the segmentation models to find a good area to for logo placement (i.e. some small flat surface). Given the complexity of what I was dealing with, I think there could be a better way (XY problem).

  • TinySAM

    Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"

  • Project mention: TinySAM – trim-down version of Sam | news.ycombinator.com | 2023-12-26
  • Instruct2Act

    Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

  • Project mention: [R]Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model | /r/MachineLearning | 2023-05-20

    Code: https://github.com/OpenGVLab/Instruct2Act

  • inpaint-anything

    Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything. (by Uminosachi)

  • Project mention: How to change / custom Model ID of HuggingFace Model | /r/StableDiffusion | 2023-10-15

    I am using Inpaint-Anything. In that they are using these & these type of models for inpainting, I think they called diffusers, sorry if I am wrong idk couz I am new to this and I want to create absolutereality_v181INPAINTING.safetensors model for inpainting like above type of models.

  • Hi-SAM

    [arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

  • Project mention: Hi-Sam: Marrying Segment Anything Model for Hierarchical Text Segmentation | news.ycombinator.com | 2024-02-21
  • zero-shot-prediction-plugin

    Run zero-shot prediction models on your data

  • Project mention: Zero-Shot Prediction Plugin for FiftyOne | dev.to | 2024-03-13

    fiftyone plugins download https://github.com/jacobmarks/zero-shot-prediction-plugin

  • samat

    SAM Annotaton Tool

  • Project mention: [P] Made a simple semantic segmentation annotation tool with segment-anything masks support in PyQt5 | /r/MachineLearning | 2023-09-26

    I just open-sourced (MIT License) semantic segmentation annotation tool powered by segment-anything model that I used for a while in my projects. Hopefully it will help someone as it seems to me that it is more suitable for small projects than popular huge web based annotation tools. Link to the project: SAMAT (any feedback in Discussions section on GitHub is appreciated) Features:

  • sam-clip

    Use Grounding DINO, Segment Anything, and CLIP to label objects in images.

  • Project mention: SAM-CLIP: Combine Grounded Sam and Clip for Instance Segmentation | news.ycombinator.com | 2023-11-17
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python segment-anything related posts

Index

What are some of the best open-source segment-anything projects in Python? This list will help you:

Project Stars
1 Track-Anything 6,090
2 sam-hq 3,347
3 sd-webui-segment-anything 3,192
4 InternGPT 3,121
5 segment-geospatial 2,652
6 anylabeling 1,852
7 Caption-Anything 1,600
8 autodistill 1,529
9 multimodal-maestro 942
10 sd-webui-inpaint-anything 914
11 segment-anything-video 907
12 sam-pt 905
13 GLEE 879
14 SegmentAnythingin3D 785
15 awesome-foundation-and-multimodal-models 509
16 comfyui_segment_anything 384
17 TinySAM 356
18 Instruct2Act 254
19 inpaint-anything 154
20 Hi-SAM 120
21 zero-shot-prediction-plugin 23
22 samat 23
23 sam-clip 20

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com