Pytorch
mediapipe

Pytorch | mediapipe | |
---|---|---|
381 | 51 | |
86,719 | 28,512 | |
1.5% | 1.5% | |
10.0 | 9.9 | |
1 day ago | 5 days ago | |
Python | C++ | |
BSD 3-clause "New" or "Revised" License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Pytorch
-
Must-Know 2025 Developer’s Roadmap and Key Programming Trends
Python’s Growth in Data Work and AI: Python continues to lead because of its easy-to-read style and the huge number of libraries available for tasks from data work to artificial intelligence. Tools like TensorFlow and PyTorch make it a must-have. Whether you’re experienced or just starting, Python’s clear style makes it a good choice for diving into machine learning. Actionable Tip: If you’re new to Python, try projects that combine data with everyday problems. For example, build a simple recommendation system using Pandas and scikit-learn.
-
Decorator JITs: Python as a DSL
Basically this style of code - https://github.com/pytorch-labs/attention-gym/pull/84/files - has issues like this - https://github.com/pytorch/pytorch/pull/137452 https://github.com/pytorch/pytorch/issues/144511 https://github.com/pytorch/pytorch/issues/145869
For some higher level context, see https://pytorch.org/blog/flexattention/
-
Building an AI-powered Financial Data Analyzer with NodeJS, Python, SvelteKit, and TailwindCSS - Part 0
The AI Service will be built using aiohttp (asynchronous Python web server) and integrates PyTorch, Hugging Face Transformers, numpy, pandas, and scikit-learn for financial data analysis.
- PyTorch 2.6.0 Release
-
Responsible Innovation: Open Source Best Practices for Sustainable AI
Open source frameworks like PyTorch are already enabling Machine Learning breakthroughs because they’re living communities where great things happen through:
-
Golang Vs. Python Performance: Which Programming Language Is Better?
- Data Science and AI: TensorFlow, PyTorch and scikit-learn are only a few of the standard Python libraries. - Web Development: development of web-based applications is made simple by frameworks such as Flask as well as Django. - Prototyping: Python's ease of use lets you quickly iterate and testing concepts.
-
How to resolve the dlopen problem with Nvidia and PyTorch or Tensorflow inside a virtual env
By chance, Tensorflow or PyTorch can work with pip packages from Nvidia.
- Making VLLM work on WSL2
-
2025’s Must-Know Tech Stacks
PyTorch
-
Experiments with Byte Matrix Multiplication
> It's quite common in machine learning operations to multiply a matrix of unsigned byte by a matrix of signed byte. Don't ask me why, but that's the case.
Overflow is the reason. Intel's vpmaddubsw takes int8_t and uint8_t to give you results in int16_t. If both are unsigned 255 * 255 = 65025 will be out of range for int16_t so likely the instruction is designed to take int8_t and uint8_t. The overflow (or rather saturation with this instruction) can still occur because it sums to adjacent multiplication. See my comment in PyTorch. https://github.com/pytorch/pytorch/blob/a37db5ae3978010e1bb7...
mediapipe
-
Integrating MediaPipe with DeepSeek for Enhanced AI Performance
Code Examples: Check out the MediaPipe and LLM Integration Examples provided by Google AI Edge.
-
Developing an ASL App with Kaggle’s Top Model and Customized MediaPipe Gesture Model
Hand gesture recognition model customization guide
-
Mediapipe openpose Controlnet model for SD
mediapipe/docs/solutions/pose.md at master · google/mediapipe · GitHub
-
MEDIAPIPE on-device diffusion plugins for conditioned text-to-image generation
Today, we announce MediaPipe diffusion plugins, which enable controllable text-to-image generation to be run on-device. Expanding upon our prior work on GPU inference for on-device large generative models, we introduce new low-cost solutions for controllable text-to-image generation that can be plugged into existing diffusion models and their Low-Rank Adaptation (LoRA) variants.
-
Running a TensorFlow object detector model and drawing boxes around objects at 60 FPS - all in React Native / JavaScript!
You can just grab the TFLite version! https://github.com/google/mediapipe/blob/master/docs/solutions/models.md
-
OpenAI came after our domain because we use GPT in it
I believe Google already released transformers under an apache 2 license with a patent grant:
https://github.com/google/mediapipe/blob/master/mediapipe/mo...
-
Open source Background Remover: Remove Background from images and video using AI
I was going to say that I like the MediaPipe Selfie Segmentation model for doing this sort of thing in a web page, but I've just noticed (when getting the GitHub link[1]) that Google have marked the code as legacy[2] ... no idea if the new solution is better/easier to use[3].
For what it's worth, my CodePen using the old model is here: https://codepen.io/kaliedarik/pen/PopBxBM
[1] - https://github.com/google/mediapipe/blob/master/docs/solutio...
[2] - "Attention: Thank you for your interest in MediaPipe Solutions. As of April 4, 2023, this solution was upgraded to a new MediaPipe Solution."
[3] - https://developers.google.com/mediapipe/solutions/vision/ima...
-
[P] Pattern recognition
I have used mediapipe very successfully in multiple projects and it's very easy to get running. You can choose from many different vision tasks including hand landmarks ( https://github.com/google/mediapipe/blob/master/docs/solutions/hands.md )
-
Getting face feature pose statistics
I found MediaPipe's Face Mesh and was impressed with how simple it was to get going, but it just gives you the landmark points and I've not gone any further yet.
-
New ControlNet Face Model
We've trained ControlNet on a subset of the LAION-Face dataset using modified output from MediaPipe's face mesh annotator to provide a new level of control when generating images of faces.
What are some alternatives?
tinygrad - You like pytorch? You like micrograd? You love tinygrad! ❤️
openpose - OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
tensorflow - An Open Source Machine Learning Framework for Everyone
AlphaPose - Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
pifuhd - High-Resolution 3D Human Digitization from A Single Image.
Flux.jl - Relax! Flux is the ML library that doesn't make you tensor
ue4-mediapipe-plugin - UE4 MediaPipe plugin
Deep Java Library (DJL) - An Engine-Agnostic Deep Learning Framework in Java
flutter_hand_tracking_plugin - 这是一个 Flutter Packge 以实现摄像头精确追踪并识别十指的运动路径/轨迹和手势动作, 且输出22个手部关键点以支持更多手势自定义. 基于这个包可以编写业务逻辑将手势信息实时转化为指令信息: 一二三四五, rock, spiderman...还可以对不同手势编写不同特效. 可用于短视频直播特效, 智能硬件等领域, 为人机互动带来更自然丰富的体验
CNTK - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
BlazePose-tensorflow - A third-party Tensorflow Implementation for paper "BlazePose: On-device Real-time Body Pose tracking".
