Top 22 JavaScript Computer Vision Projects

label-studio

49 16,385 9.8 JavaScript

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Project mention: First 15 Open Source Advent projects | dev.to | 2023-12-15

14. LabelStudio by Human Signal | Github | tutorial

ai-deadlines

3 5,380 8.8 JavaScript

:alarm_clock: AI conference deadline countdowns
SurveyJS

surveyjs.io sponsored

Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
Universal Data Tool

0 1,871 0.0 JavaScript

Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
DeepCamera

9 1,703 2.3 JavaScript

Open-Source AI Camera. Empower any camera/CCTV with state-of-the-art AI, including facial recognition, person recognition(RE-ID) car detection, fall detection and more

Project mention: Open-Source AI Camera | news.ycombinator.com | 2023-06-19

lego-art-remix

2 243 0.0 JavaScript

Powerful computer vision assisted Lego mosaic creator · Over 1 million images created (so far!)
handpose-facemesh-demos

3 183 1.8 JavaScript

🎥🤟 8 minimalistic templates for tfjs mediapipe handpose and facemesh
FacePause

1 176 1.8 JavaScript

Look Away to Pause Youtube - Experimental Chrome Extension
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
cep

2 162 0.0 JavaScript

CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.
dji-aerial-georeferencing

8 158 3.2 JavaScript

Detect objects in drone videos and plot them on a map

Project mention: DJI drone flight log viewer | news.ycombinator.com | 2023-05-08

How does this compare with the data you get from Airdata?
When I was working on an aerial georeferencing project[1] I found that the on-device flight logs didn’t contain all the info in the serverside logs (eg heading, gimbal orientation, GPS coords, altitude)
[1] https://github.com/roboflow/dji-aerial-georeferencing

parker

7 129 8.3 JavaScript

Parking detection and monitoring webapp that runs entirely in the browser

Project mention: What personal project best demonstrates your exceptional development skills and creative prowess? | /r/webdev | 2023-07-07

Alt-Text-Generator

3 128 6.8 Svelte

Generate alt text for your images!

Project mention: Alt Text Generator | news.ycombinator.com | 2023-07-21

TFJS-object-detection

1 124 0.0 JavaScript

Real-time custom object detection in the browser using tensorflow.js
WebAR.rocks.face

1 77 4.4 JavaScript

WebAR.rocks face detection and tracking JavaScript library
yolov7-tfjs

1 75 1.3 JavaScript

Object Detection using Yolov7 in tensorflow.js
compreface-javascript-sdk

1 65 0.0 JavaScript

JavaScript SDK for CompreFace - free and open-source face recognition system from Exadel
teachablemachine-node

1 53 0.0 JavaScript

Using Teachable Machine Models in Node.js
blackjack-basic-strategy

23 26 2.0 JavaScript

A computer vision powered Blackjack basic strategy app powered by Roboflow.

Project mention: Show HN: Pip install inference, open source computer vision deployment | news.ycombinator.com | 2023-08-23

It’s an easy to use inference server for computer vision models.
The end result is a Docker container that serves a standardized API as a microservice that your application uses to get predictions from computer vision models (though there is also a native Python interface).
It’s backed by a bunch of component pieces:
* a server (so you don’t have to reimplement things like image processing & prediction visualization on every project)
* standardized APIs for computer vision tasks (so switching out the model weights and architecture can be done independently of your application code)
* model architecture implementations (which implement the tensor parsing glue between images & predictions) for supervised models that you've fine-tuned to perform custom tasks
* foundation model implementations (like CLIP & SAM) that tend to chain well with fine-tuned models
* reusable utils to make adding support for new models easier
* a model registry (so your code can be independent from your model weights & you don't have to re-build and re-deploy every time you want to iterate on your model weights)
* data management integrations (so you can collect more images of edge cases to improve your dataset & model the more it sees in the wild)
* ecosystem (there are tens of thousands of fine-tuned models shared by users that you can use off the shelf via Roboflow Universe[1])
Additionally, since it's focused specifically on computer vision, it has specific CV-focused features (like direct camera stream input) and makes some different tradeoffs than other more general ML solutions (namely, optimized for small-fast models that run at the edge & need support for running on many different devices like NVIDIA Jetsons and Raspberry Pis in addition to beefy cloud servers).
[1] https://universe.roboflow.com

poor-mans-vr

3 26 7.0 JavaScript

Watch videos in VR using only your front camera + tensorflow.js

Project mention: Ask HN: Show me your half baked project | news.ycombinator.com | 2023-10-12

https://github.com/muxamilian/poor-mans-vr
A poor man's VR: Using the front camera and tensorflow.js, the smartphone becomes a “window” into the real world. Video and image content appear as if they were seen through this window. To do this, the viewer’s position is determined using a neural network. The viewed content is then moved according to the viewer’s position. This makes it seem like the content is physically behind the smartphone and is viewed through the smartphone’s screen. This effect is especially useful for content captured using an ultra-wide lens.

rickblocker

1 14 0.0 JavaScript

Audio visual mitigation of Rickrolls using computer vision.
Yolov8-Segmentation-on-Browser

1 13 4.6 JavaScript

Example of YOLOv8 Segmentation on Browser. It is powered by Onnx and served through JavaScript without any frameworks
checkpose

5 12 6.5 JavaScript

Offline Body Movement Analysis in the Browser
grassland

4 0 4.8 JavaScript

The Grassland Client (desktop version)

Project mention: Are there any blockchain data science groups? | /r/datascience | 2023-07-11

What we're doing might be of interest to you. https://www.grassland.network. All data is stored on a custom, spatio-temporal blockchain in real-time and can be used as a data source for ML training and inference.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

JavaScript Computer Vision related posts

Building an Instill AI Pipeline in 5 minutes
1 project | dev.to | 22 Oct 2023
A poor man’s VR (front camera and tensorflow.js)
1 project | /r/virtualreality | 9 Oct 2023
Show HN: Real-Time 3D Gaussian Splatting in WebGL
4 projects | news.ycombinator.com | 11 Sep 2023
Alt Text Generator
1 project | news.ycombinator.com | 21 Jul 2023
Are there any blockchain data science groups?
1 project | /r/datascience | 11 Jul 2023
Looking for contributors for an Open Source - Rust - Computer Vision Project
2 projects | /r/ProgrammingBuddies | 1 Jun 2023
DeepStack (dead?) vs CompreFace (slow?)
3 projects | /r/selfhosted | 7 May 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 23 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Computer Vision projects in JavaScript? This list will help you:

	Project	Stars
1	label-studio	16,385
2	ai-deadlines	5,380
3	Universal Data Tool	1,871
4	DeepCamera	1,703
5	lego-art-remix	243
6	handpose-facemesh-demos	183
7	FacePause	176
8	cep	162
9	dji-aerial-georeferencing	158
10	parker	129
11	Alt-Text-Generator	128
12	TFJS-object-detection	124
13	WebAR.rocks.face	77
14	yolov7-tfjs	75
15	compreface-javascript-sdk	65
16	teachablemachine-node	53
17	blackjack-basic-strategy	26
18	poor-mans-vr	26
19	rickblocker	14
20	Yolov8-Segmentation-on-Browser	13
21	checkpose	12
22	grassland	0