[R] CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory + Code + Robot demo

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

clip-fields

2 138 4.6 Python

Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields

Best part, I believe, is that you should be able to train your own CLIP-Field for your living room if you have an hour, a decent GPU, and a way to get RGB-D video (an iPhone 13 Pro works great!) I hope you can give the code a try: https://github.com/notmahi/clip-fields or check out the website https://mahis.life/clip-fields/ for more interactive demos. Our Arxiv submission is also out now, at https://arxiv.org/abs/2210.05663, and if you want a longer tl;dr with a couple more videos, check out this tweet. Thanks!

Detic

11 1,769 1.9 Python

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

We made this using pretty recent advances in web-data pretrained models like Detic and LSeg for detection, CLIP for visual queries, and Sentence BERT for semantic queries. Our "database" is really a neural field (Instant NGP) that maps from 3D coordinates to a high dimensional embedding vector in the same representation space as CLIP and SBERT.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
lang-seg

2 672 1.1 Jupyter Notebook

Language-Driven Semantic Segmentation

We made this using pretty recent advances in web-data pretrained models like Detic and LSeg for detection, CLIP for visual queries, and Sentence BERT for semantic queries. Our "database" is really a neural field (Instant NGP) that maps from 3D coordinates to a high dimensional embedding vector in the same representation space as CLIP and SBERT.

CLIP

103 22,209 1.2 Jupyter Notebook

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

We made this using pretty recent advances in web-data pretrained models like Detic and LSeg for detection, CLIP for visual queries, and Sentence BERT for semantic queries. Our "database" is really a neural field (Instant NGP) that maps from 3D coordinates to a high dimensional embedding vector in the same representation space as CLIP and SBERT.

sentence-transformers

45 13,793 9.2 Python

Multilingual Sentence & Image Embeddings with BERT

We made this using pretty recent advances in web-data pretrained models like Detic and LSeg for detection, CLIP for visual queries, and Sentence BERT for semantic queries. Our "database" is really a neural field (Instant NGP) that maps from 3D coordinates to a high dimensional embedding vector in the same representation space as CLIP and SBERT.

instant-ngp

147 15,364 6.7 Cuda

Instant neural graphics primitives: lightning fast NeRF and more

We made this using pretty recent advances in web-data pretrained models like Detic and LSeg for detection, CLIP for visual queries, and Sentence BERT for semantic queries. Our "database" is really a neural field (Instant NGP) that maps from 3D coordinates to a high dimensional embedding vector in the same representation space as CLIP and SBERT.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Understanding Automatic Differentiation in 30 lines of Python
9 projects | news.ycombinator.com | 24 Aug 2023
I want a 3d scanner...
1 project | /r/3Dprinting | 8 Jul 2023
Mind-blowing results (LORA/Checkpoint mix)
3 projects | /r/StableDiffusion | 4 Jul 2023
Scanning in real life environments to be viewed in VR >>> taking pictures. Simple process from video -> render, using instant-ngp
1 project | /r/virtualreality | 29 Jun 2023
How about Ranger Green?
1 project | /r/airsoft | 30 May 2023

[R] CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory + Code + Robot demo

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
neural-network clip Machine Learning neural-fields Cuda
Post date: 13 Oct 2022

clip-fields

Detic

InfluxDB

lang-seg

CLIP

sentence-transformers

instant-ngp

Related posts

[R] CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory + Code + Robot demo

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning neural-network clip Machine Learning neural-fields Cuda Post date: 13 Oct 2022

clip-fields

Detic

InfluxDB

lang-seg

CLIP

sentence-transformers

instant-ngp

Related posts

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
neural-network clip Machine Learning neural-fields Cuda
Post date: 13 Oct 2022