mediapipe
tfjs-models
Our great sponsors
mediapipe | tfjs-models | |
---|---|---|
49 | 50 | |
25,331 | 13,677 | |
1.8% | 1.0% | |
9.9 | 7.7 | |
5 days ago | 7 days ago | |
C++ | TypeScript | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mediapipe
-
MEDIAPIPE on-device diffusion plugins for conditioned text-to-image generation
Today, we announce MediaPipe diffusion plugins, which enable controllable text-to-image generation to be run on-device. Expanding upon our prior work on GPU inference for on-device large generative models, we introduce new low-cost solutions for controllable text-to-image generation that can be plugged into existing diffusion models and their Low-Rank Adaptation (LoRA) variants.
-
Running a TensorFlow object detector model and drawing boxes around objects at 60 FPS - all in React Native / JavaScript!
You can just grab the TFLite version! https://github.com/google/mediapipe/blob/master/docs/solutions/models.md
-
Open source Background Remover: Remove Background from images and video using AI
I was going to say that I like the MediaPipe Selfie Segmentation model for doing this sort of thing in a web page, but I've just noticed (when getting the GitHub link[1]) that Google have marked the code as legacy[2] ... no idea if the new solution is better/easier to use[3].
For what it's worth, my CodePen using the old model is here: https://codepen.io/kaliedarik/pen/PopBxBM
[1] - https://github.com/google/mediapipe/blob/master/docs/solutio...
[2] - "Attention: Thank you for your interest in MediaPipe Solutions. As of April 4, 2023, this solution was upgraded to a new MediaPipe Solution."
[3] - https://developers.google.com/mediapipe/solutions/vision/ima...
-
Getting face feature pose statistics
I found MediaPipe's Face Mesh and was impressed with how simple it was to get going, but it just gives you the landmark points and I've not gone any further yet.
-
New ControlNet Face Model
We've trained ControlNet on a subset of the LAION-Face dataset using modified output from MediaPipe's face mesh annotator to provide a new level of control when generating images of faces.
-
Trained an ML model using TensorFlow.js to classify American Sign Language (ASL) alphabets on browser. We are creating an open-source platform and would love to receive your feedback on our project.
Medipaipe library link: https://mediapipe.dev/
-
mediapipe VS daisykit - a user suggested alternative
2 projects | 24 Mar 2023
- Google Summer of code 2023 is coming
-
10$ Full Body Tracking! I'm proud to release ToucanTrack (in Beta!). Get decent FBT with the power of 2 PS3 Eye Cameras and AI!
If you're looking for the differences in terms of how inference is done, I recommend you take a look at MediaPipe's source code. MediaPipe doesn't use raw code, but uses a "graph" instead (eg. pose_landmark_cpu.pbtxt), which can be visualised using MediaPipe Viz. I also used axinc-ai/ailia-models as the base (preprocessing, inference, postprocessing, etc...) which I further built upon (keypoint refinement, roi from keypoints, filtering / smoothing, etc...)
-
Started working on this motion tracking prototype demo game in python and Unity!
I thought of doing that but unfortunately medipipe requires a RGB input and performs better with it more on that here
tfjs-models
-
Running a TensorFlow object detector model and drawing boxes around objects at 60 FPS - all in React Native / JavaScript!
I am wondering, will this also work with tensoflow.js or only tflite? I'd like to use this hand pose estimation from mediapipe: https://github.com/tensorflow/tfjs-models/tree/master/hand-pose-detection
-
ML tool to read PDF file and answer questions from its content
I got the basic concept working using TensorFlow QnA model but the answers don't seem very accurate. Infact, unless you specifically ask the exact question, you dont get the right answer. Its not intelligent enough because the entire PDF content becomes a bag of words instead of having context to those words. for eg. When someone types "languages", it should search within a section named Languages.
-
React + Tensorflow.js , a cool recipe for AI powered applications
I think you are beginning to connect the dots by now 😉 What we will do is to build a small proof-of-concept (POC) by writing a simple react app and hook up a pre-trained tensorflow.js model, The text toxicity model to "moderate" the user's text input and show a notification of what's wrong with it, a text toxicity meter if you will...
-
Chat moderation in Daily using TensorFlow.js
TensorFlow.js is a JavaScript library developers can use to run pre-trained machine-learning models in the browser. The library has a variety of models for tasks such as object identification and language processing. One of these models is the text toxicity detection model.
- Does tensorflow offer a 3d meshing model for body parts?
-
Seeking Advice on Resources for Creating an Image Analysis and Manipulation AI
No clue if it will work on drawn images. It depends on what the model was trained on. The models github is a good place to get started on figuring that out: https://github.com/tensorflow/tfjs-models/tree/body-pix-v2.0.4/body-pix
-
[AskJS] Rate a string on how much sense it makes
You probably want to check out something like tensorflow https://github.com/tensorflow/tfjs-models where you can build and test your models.
-
Exploring face mapping with TFJS
All data to be used with a model needs to be prepared to be in the correct input format. In this case images needs to be turned into tensors. The exact format is usually documented but I found the underlying documentation for this model a bit lacking in details about the exact input/outputs but still useful for interpreting them https://github.com/tensorflow/tfjs-models/tree/master/face-landmarks-detection. Instead we can examine the model.json which is more explict. We can find a few keys that define the input and output:
-
[N] Body tracking with TensorFlow
The BlazePose software determines the position of the human body based on the camera image. For instance, a fitness app can automatically evaluate your technique, an online store can suggest suitable clothing styles, or games - create an avatar that repeats your actions. Model code is available on GitHub, there is also a demo in the browser.
-
Building UIs in Figma with hand movements
Since the release of the latest version of the MediaPipe handpose detection machine learning model that allows the detection of multiple hands, I've had in mind to try to use it to create UIs, and here's the result of a quick prototype built in a few hours!
What are some alternatives?
openpose - OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
ue4-mediapipe-plugin - UE4 MediaPipe plugin
Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration
AlphaPose - Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
BlazePose-tensorflow - A third-party Tensorflow Implementation for paper "BlazePose: On-device Real-time Body Pose tracking".
jeelizFaceFilter - Javascript/WebGL lightweight face tracking library designed for augmented reality webcam filters. Features : multiple faces detection, rotation, mouth opening. Various integration examples are provided (Three.js, Babylon.js, FaceSwap, Canvas2D, CSS3D...).
pifuhd - High-Resolution 3D Human Digitization from A Single Image.
flutter_hand_tracking_plugin - 这是一个 Flutter Packge 以实现摄像头精确追踪并识别十指的运动路径/轨迹和手势动作, 且输出22个手部关键点以支持更多手势自定义. 基于这个包可以编写业务逻辑将手势信息实时转化为指令信息: 一二三四五, rock, spiderman...还可以对不同手势编写不同特效. 可用于短视频直播特效, 智能硬件等领域, 为人机互动带来更自然丰富的体验
bevy - A refreshingly simple data-driven game engine built in Rust
Unity-Robotics-Hub - Central repository for tools, tutorials, resources, and documentation for robotics simulation in Unity.
stylegan2-pytorch - Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
Bazel - a fast, scalable, multi-language and extensible build system