BoofCV
mediapipe
Our great sponsors
BoofCV | mediapipe | |
---|---|---|
20 | 49 | |
1,034 | 25,331 | |
- | 1.8% | |
8.5 | 9.9 | |
about 2 months ago | 5 days ago | |
Java | C++ | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
BoofCV
-
Recommended camera/projector calibration software?
BoofCV https://github.com/lessthanoptimal/BoofCV
-
JDK 21 - Image Performance Improvements
Is there any fast way to get pixel values and pixel coordinate? I had to jump through a lot of hoops to get convert BufferedImages into a format that's useful for image processing it to be a reasonable speed in BoofCV. getRGB() is glacial. At one point I was trying to convince the JDK team to make private data structures public again. Right now it's inconsistent what you have access too.
-
Good Open Source Repositories that Accepts New Contributors
Speaking of using the vector api, I see there's a class in boofcv that converts rgb to hsv. I've previously written a simd accelerated version of rgb to hsv using the java vector api. For anybody looking to do a bit of code janitor work, converting the one-off library into something that could be contributed to /u/lessthanoptimal's project might be a worthwhile contribution.
-
JavaFX .jar (from clojure) won't find "glass" when run via `java -jar`
For instance I looked at BoofCV and all I found was : https://github.com/lessthanoptimal/BoofCV/issues/265 " the question now seems to be can you compile the library as native. The answer is probably but someone needs to try it."
-
BoofCV v0.38 Release Summary
It's an all-Java computer vision library, see https://boofcv.org/ and https://github.com/lessthanoptimal/BoofCV
-
BoofCV v0.38: Much improved scene reconstruction, loop closure, more concurrency. Also updated PyBoof
project website: https://boofcv.org
-
Tesseract OCR
Image processing strongly depends on what image you wanna use. To find an "auto" approach, that works for every image is nearly impossible...
I once wrote a bookscanner app in Java (https://boofcv.org), where everything was done automatically (preprocessing, object detection / book extraction, skin detection / finger removal, deskewing, line-slope-correction and so on). It was very difficult to adjust the parameters, that at least most of the books looked good.
- Does anyone know of an open-source library that not only has object-detection but also object following?
mediapipe
-
MEDIAPIPE on-device diffusion plugins for conditioned text-to-image generation
Today, we announce MediaPipe diffusion plugins, which enable controllable text-to-image generation to be run on-device. Expanding upon our prior work on GPU inference for on-device large generative models, we introduce new low-cost solutions for controllable text-to-image generation that can be plugged into existing diffusion models and their Low-Rank Adaptation (LoRA) variants.
-
Running a TensorFlow object detector model and drawing boxes around objects at 60 FPS - all in React Native / JavaScript!
You can just grab the TFLite version! https://github.com/google/mediapipe/blob/master/docs/solutions/models.md
-
Open source Background Remover: Remove Background from images and video using AI
I was going to say that I like the MediaPipe Selfie Segmentation model for doing this sort of thing in a web page, but I've just noticed (when getting the GitHub link[1]) that Google have marked the code as legacy[2] ... no idea if the new solution is better/easier to use[3].
For what it's worth, my CodePen using the old model is here: https://codepen.io/kaliedarik/pen/PopBxBM
[1] - https://github.com/google/mediapipe/blob/master/docs/solutio...
[2] - "Attention: Thank you for your interest in MediaPipe Solutions. As of April 4, 2023, this solution was upgraded to a new MediaPipe Solution."
[3] - https://developers.google.com/mediapipe/solutions/vision/ima...
-
Getting face feature pose statistics
I found MediaPipe's Face Mesh and was impressed with how simple it was to get going, but it just gives you the landmark points and I've not gone any further yet.
-
New ControlNet Face Model
We've trained ControlNet on a subset of the LAION-Face dataset using modified output from MediaPipe's face mesh annotator to provide a new level of control when generating images of faces.
-
Trained an ML model using TensorFlow.js to classify American Sign Language (ASL) alphabets on browser. We are creating an open-source platform and would love to receive your feedback on our project.
Medipaipe library link: https://mediapipe.dev/
-
mediapipe VS daisykit - a user suggested alternative
2 projects | 24 Mar 2023
- Google Summer of code 2023 is coming
-
10$ Full Body Tracking! I'm proud to release ToucanTrack (in Beta!). Get decent FBT with the power of 2 PS3 Eye Cameras and AI!
If you're looking for the differences in terms of how inference is done, I recommend you take a look at MediaPipe's source code. MediaPipe doesn't use raw code, but uses a "graph" instead (eg. pose_landmark_cpu.pbtxt), which can be visualised using MediaPipe Viz. I also used axinc-ai/ailia-models as the base (preprocessing, inference, postprocessing, etc...) which I further built upon (keypoint refinement, roi from keypoints, filtering / smoothing, etc...)
-
Started working on this motion tracking prototype demo game in python and Unity!
I thought of doing that but unfortunately medipipe requires a RGB input and performs better with it more on that here
What are some alternatives?
openpose - OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
ue4-mediapipe-plugin - UE4 MediaPipe plugin
Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration
AlphaPose - Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
BlazePose-tensorflow - A third-party Tensorflow Implementation for paper "BlazePose: On-device Real-time Body Pose tracking".
jeelizFaceFilter - Javascript/WebGL lightweight face tracking library designed for augmented reality webcam filters. Features : multiple faces detection, rotation, mouth opening. Various integration examples are provided (Three.js, Babylon.js, FaceSwap, Canvas2D, CSS3D...).
pifuhd - High-Resolution 3D Human Digitization from A Single Image.
flutter_hand_tracking_plugin - 这是一个 Flutter Packge 以实现摄像头精确追踪并识别十指的运动路径/轨迹和手势动作, 且输出22个手部关键点以支持更多手势自定义. 基于这个包可以编写业务逻辑将手势信息实时转化为指令信息: 一二三四五, rock, spiderman...还可以对不同手势编写不同特效. 可用于短视频直播特效, 智能硬件等领域, 为人机互动带来更自然丰富的体验
tfjs-models - Pretrained models for TensorFlow.js
bevy - A refreshingly simple data-driven game engine built in Rust
OpenCV - Open Source Computer Vision Library
Unity-Robotics-Hub - Central repository for tools, tutorials, resources, and documentation for robotics simulation in Unity.