JavaScript Computer Vision

Open-source JavaScript projects categorized as Computer Vision

Top 22 JavaScript Computer Vision Projects

  • label-studio

    Label Studio is a multi-type data labeling and annotation tool with standardized output format

  • Project mention: First 15 Open Source Advent projects | dev.to | 2023-12-15

    14. LabelStudio by Human Signal | Github | tutorial

  • ai-deadlines

    :alarm_clock: AI conference deadline countdowns

  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

    SurveyJS logo
  • Universal Data Tool

    Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.

  • DeepCamera

    Open-Source AI Camera. Empower any camera/CCTV with state-of-the-art AI, including facial recognition, person recognition(RE-ID) car detection, fall detection and more

  • Project mention: Open-Source AI Camera | news.ycombinator.com | 2023-06-19
  • lego-art-remix

    Powerful computer vision assisted Lego mosaic creator · Over 1 million images created (so far!)

  • handpose-facemesh-demos

    🎥🤟 8 minimalistic templates for tfjs mediapipe handpose and facemesh

  • FacePause

    Look Away to Pause Youtube - Experimental Chrome Extension

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • cep

    CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.

  • dji-aerial-georeferencing

    Detect objects in drone videos and plot them on a map

  • Project mention: DJI drone flight log viewer | news.ycombinator.com | 2023-05-08

    How does this compare with the data you get from Airdata?

    When I was working on an aerial georeferencing project[1] I found that the on-device flight logs didn’t contain all the info in the serverside logs (eg heading, gimbal orientation, GPS coords, altitude)

    [1] https://github.com/roboflow/dji-aerial-georeferencing

  • parker

    Parking detection and monitoring webapp that runs entirely in the browser

  • Project mention: What personal project best demonstrates your exceptional development skills and creative prowess? | /r/webdev | 2023-07-07
  • Alt-Text-Generator

    Generate alt text for your images!

  • Project mention: Alt Text Generator | news.ycombinator.com | 2023-07-21
  • TFJS-object-detection

    Real-time custom object detection in the browser using tensorflow.js

  • WebAR.rocks.face

    WebAR.rocks face detection and tracking JavaScript library

  • yolov7-tfjs

    Object Detection using Yolov7 in tensorflow.js

  • compreface-javascript-sdk

    JavaScript SDK for CompreFace - free and open-source face recognition system from Exadel

  • teachablemachine-node

    Using Teachable Machine Models in Node.js

  • blackjack-basic-strategy

    A computer vision powered Blackjack basic strategy app powered by Roboflow.

  • Project mention: Show HN: Pip install inference, open source computer vision deployment | news.ycombinator.com | 2023-08-23

    It’s an easy to use inference server for computer vision models.

    The end result is a Docker container that serves a standardized API as a microservice that your application uses to get predictions from computer vision models (though there is also a native Python interface).

    It’s backed by a bunch of component pieces:

    * a server (so you don’t have to reimplement things like image processing & prediction visualization on every project)

    * standardized APIs for computer vision tasks (so switching out the model weights and architecture can be done independently of your application code)

    * model architecture implementations (which implement the tensor parsing glue between images & predictions) for supervised models that you've fine-tuned to perform custom tasks

    * foundation model implementations (like CLIP & SAM) that tend to chain well with fine-tuned models

    * reusable utils to make adding support for new models easier

    * a model registry (so your code can be independent from your model weights & you don't have to re-build and re-deploy every time you want to iterate on your model weights)

    * data management integrations (so you can collect more images of edge cases to improve your dataset & model the more it sees in the wild)

    * ecosystem (there are tens of thousands of fine-tuned models shared by users that you can use off the shelf via Roboflow Universe[1])

    Additionally, since it's focused specifically on computer vision, it has specific CV-focused features (like direct camera stream input) and makes some different tradeoffs than other more general ML solutions (namely, optimized for small-fast models that run at the edge & need support for running on many different devices like NVIDIA Jetsons and Raspberry Pis in addition to beefy cloud servers).

    [1] https://universe.roboflow.com

  • poor-mans-vr

    Watch videos in VR using only your front camera + tensorflow.js

  • Project mention: Ask HN: Show me your half baked project | news.ycombinator.com | 2023-10-12

    https://github.com/muxamilian/poor-mans-vr

    A poor man's VR: Using the front camera and tensorflow.js, the smartphone becomes a “window” into the real world. Video and image content appear as if they were seen through this window. To do this, the viewer’s position is determined using a neural network. The viewed content is then moved according to the viewer’s position. This makes it seem like the content is physically behind the smartphone and is viewed through the smartphone’s screen. This effect is especially useful for content captured using an ultra-wide lens.

  • rickblocker

    Audio visual mitigation of Rickrolls using computer vision.

  • Yolov8-Segmentation-on-Browser

    Example of YOLOv8 Segmentation on Browser. It is powered by Onnx and served through JavaScript without any frameworks

  • checkpose

    Offline Body Movement Analysis in the Browser

  • grassland

    The Grassland Client (desktop version)

  • Project mention: Are there any blockchain data science groups? | /r/datascience | 2023-07-11

    What we're doing might be of interest to you. https://www.grassland.network. All data is stored on a custom, spatio-temporal blockchain in real-time and can be used as a data source for ML training and inference.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

JavaScript Computer Vision related posts

Index

What are some of the best open-source Computer Vision projects in JavaScript? This list will help you:

Project Stars
1 label-studio 16,385
2 ai-deadlines 5,380
3 Universal Data Tool 1,871
4 DeepCamera 1,703
5 lego-art-remix 243
6 handpose-facemesh-demos 183
7 FacePause 176
8 cep 162
9 dji-aerial-georeferencing 158
10 parker 129
11 Alt-Text-Generator 128
12 TFJS-object-detection 124
13 WebAR.rocks.face 77
14 yolov7-tfjs 75
15 compreface-javascript-sdk 65
16 teachablemachine-node 53
17 blackjack-basic-strategy 26
18 poor-mans-vr 26
19 rickblocker 14
20 Yolov8-Segmentation-on-Browser 13
21 checkpose 12
22 grassland 0

Sponsored
Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com