Python Vision

Open-source Python projects categorized as Vision

Top 16 Python Vision Projects

  1. skyvern

    Automate browser-based workflows with LLMs and Computer Vision

    Project mention: Show HN: MCP Server to let agents control the browser | news.ycombinator.com | 2025-04-03

    Hey HN, we were playing around with MCPs over the weekend and thought it would be cool to build an MCP that lets Claude / Cursor / Windsurf control your browser: https://github.com/Skyvern-AI/skyvern/tree/main/integrations...

    Just for context, we’re building Skyvern, an open source AI Agent that can control and interact with browsers using prompts, similar to OpenAI’s Operator.

    The MCP Server can:

    - This allows Claude to navigate to docs websites / stack overflow and look up information like the top posts on hackernews

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. donkeycar

    Open source hardware and software platform to build a small scale self driving car.

  4. SimpleCV

    The Open Source Framework for Machine Vision

  5. tsdf-fusion-python

    Python code to fuse multiple RGB-D images into a TSDF voxel volume.

  6. deepdrive

    Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving

  7. caer

    High-performance Vision library in Python. Scale your research, not boilerplate.

  8. vector-python-sdk

    Anki Vector Python SDK

  9. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  10. LLaVA-Mini

    LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

    Project mention: LLaVA-Mini: Efficient Image and Video Large Multimodal Models | news.ycombinator.com | 2025-01-12
  11. ChatGPT-OpenAI-Smart-Speaker

    This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice and vision-driven conversations, with additional web search capabilities via OpenAI and Langchain agents.

  12. MLP-Mixer-pytorch

    Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision

  13. eqxvision

    A Python package of computer vision models for the Equinox ecosystem.

  14. MP4-Mux-Tool

    Mp4Box GUI

  15. mtgscan

    Recognition of Magic cards on images. Detection with OCR.

  16. CPPE-Dataset

    Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset

  17. text-generator.io

    Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io

  18. Auto-GPT

    Auto-GPT + CLIP vision for stable v0.3.1 (by zer0int)

  19. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Vision discussion

Log in or Post with

Python Vision related posts

  • AIM Weekly for 07 Oct 2024

    16 projects | dev.to | 7 Oct 2024
  • Is there a way for people to translate scans if I buy the raws? there are a lot of older series that sound interesting but no ones ever translated them.

    2 projects | /r/mangadex | 18 Dec 2021
  • [R] MLP-Mixer: An all-MLP Architecture for Vision

    1 project | /r/ResearchML | 5 May 2021

Index

What are some of the best open-source Vision projects in Python? This list will help you:

# Project Stars
1 skyvern 13,738
2 donkeycar 3,271
3 SimpleCV 2,719
4 tsdf-fusion-python 1,307
5 deepdrive 918
6 caer 788
7 vector-python-sdk 573
8 LLaVA-Mini 504
9 ChatGPT-OpenAI-Smart-Speaker 295
10 MLP-Mixer-pytorch 217
11 eqxvision 107
12 MP4-Mux-Tool 71
13 mtgscan 69
14 CPPE-Dataset 69
15 text-generator.io 35
16 Auto-GPT 7

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?