lang-segment-anything

SAM with text prompt (by luca-medeiros)

Lang-segment-anything Alternatives

Similar projects and alternatives to lang-segment-anything

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better lang-segment-anything alternative or higher similarity.

lang-segment-anything reviews and mentions

Posts with mentions or reviews of lang-segment-anything. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-23.
  • Show HN: OK-Robot: open, modular home robot framework for pick-and-drop anywhere
    5 projects | news.ycombinator.com | 23 Feb 2024
    User fishbotics already answers a lot of these questions downstream, but just confirming it here as an author of the project/paper:

    > - How does it know what objects are? Does it use some sort of realtime object classifier neural net? What limitations are there here?

    We use Lang-SAM (https://github.com/luca-medeiros/lang-segment-anything) to do most of this, with CLIP embeddings (https://openai.com/research/clip) doing most of the heavy lifting of connecting image and text. One of the nice properties of using CLIP-like models is that you don't have to specify the classes you may want to query later, you can just come up with them during runtime.

    > - Does the robot know when it can't perform a request? I.e. if you ask it to move a large box or very heavy kettlebell?

    Nope! As it is right now, the models are very simple and they don't try to do anything fancy. However, that's why we open up our code! So the community can build smarter robots on top of this project that can use even more visual cues about the environment.

    > - How well does it do if the object is hidden or obscured? Does it go looking for it? What if it must move another object to get access to the requested one?

    It fails when the object is hidden or obscured in the initial scan, but once again we think it could be a great starting point for further research :) One of the nice things, however, is that we take full 3D information in consideration, and so even if some object is visible from only some of the angles, the robot has a chance to find it.

  • FLaNK Stack Weekly for 30 Oct 2023
    24 projects | dev.to | 30 Oct 2023
  • Language Segment Anything
    1 project | news.ycombinator.com | 6 Oct 2023
  • Is the Segment Anything Model (SAM) useful for actual object detection?
    1 project | /r/computervision | 17 Apr 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 9 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Stats

Basic lang-segment-anything repo stats
4
1,194
6.5
25 days ago

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com