Top 16 Python Vision Projects
-
Project mention: Show HN: MCP Server to let agents control the browser | news.ycombinator.com | 2025-04-03
Hey HN, we were playing around with MCPs over the weekend and thought it would be cool to build an MCP that lets Claude / Cursor / Windsurf control your browser: https://github.com/Skyvern-AI/skyvern/tree/main/integrations...
Just for context, we’re building Skyvern, an open source AI Agent that can control and interact with browsers using prompts, similar to OpenAI’s Operator.
The MCP Server can:
- This allows Claude to navigate to docs websites / stack overflow and look up information like the top posts on hackernews
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
-
-
deepdrive
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
-
-
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
LLaVA-Mini
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Project mention: LLaVA-Mini: Efficient Image and Video Large Multimodal Models | news.ycombinator.com | 2025-01-12 -
ChatGPT-OpenAI-Smart-Speaker
This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice and vision-driven conversations, with additional web search capabilities via OpenAI and Langchain agents.
-
-
-
-
-
CPPE-Dataset
Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Vision discussion
Python Vision related posts
Index
What are some of the best open-source Vision projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | skyvern | 13,738 |
2 | donkeycar | 3,271 |
3 | SimpleCV | 2,719 |
4 | tsdf-fusion-python | 1,307 |
5 | deepdrive | 918 |
6 | caer | 788 |
7 | vector-python-sdk | 573 |
8 | LLaVA-Mini | 504 |
9 | ChatGPT-OpenAI-Smart-Speaker | 295 |
10 | MLP-Mixer-pytorch | 217 |
11 | eqxvision | 107 |
12 | MP4-Mux-Tool | 71 |
13 | mtgscan | 69 |
14 | CPPE-Dataset | 69 |
15 | text-generator.io | 35 |
16 | Auto-GPT | 7 |