notebooks
supervision
notebooks | supervision | |
---|---|---|
19 | 15 | |
4,164 | 14,068 | |
3.2% | 4.8% | |
8.3 | 9.9 | |
17 days ago | 1 day ago | |
Jupyter Notebook | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
notebooks
-
Supervision: Reusable Computer Vision
Yeah, inference[1] is our open source package for running locally (either directly in Python or via a Docker container). It works with all the models on Universe, models you train yourself (assuming we support the architecture; we have a bunch of notebooks available[2]), or train in our platform, plus several more general foundation models[3] (for things like embeddings, zero-shot detection, question answering, OCR, etc).
We also have a hosted API[4] you can hit for most models we support (except some of the large vision models that are really GPU-heavy) if you prefer.
[1] https://github.com/roboflow/inference
[2] https://github.com/roboflow/notebooks
[3] https://inference.roboflow.com/foundation/about/
[4] https://docs.roboflow.com/deploy/hosted-api
-
Roboflow Notebooks: 30+ tutorials on using SOTA models and vision techniques
We (the Roboflow open source team) actively write open source Google Colab notebooks showing how to use new SOTA models. Our library covers SAM, CLIP, Detectron2, YOLOv8, RTMDet, DINOv2, and more. These notebooks helped me cross the chasm from "how do I use X model?" to being able to both write and understand inference code.
- Notebooks: How to tutorials for computer vision models and techniques
-
Training Instance Segmentation models on custom dataset
Here's an open source SegFormer notebook and guide: https://github.com/roboflow/notebooks/blob/main/notebooks/train-segformer-segmentation-on-custom-data.ipynb
-
[Advice request] How on earth am I supposed to break into machine learning research as an undergraduate?
Great ways to get some experience in general ML: * https://kaggle.com/learn to up your skill-set, practice a bit, and improve breadth of knowledge in topics like deep learning and computer vision * https://huggingface.co/learn free NLP courses that will really beef up your skillset * https://madewithml.com - robust tutorials for the end-to-end deep learning MLOps process * https://roboflow.com/learn - intro course material and some advanced topics in computer vision; tutorial walkthroughs for model training: https://github.com/roboflow/notebooks
-
Generate Synthetic Computer Vision Data with Stable Diffusion Image-to-Image
Repo: https://github.com/roboflow/notebooks/blob/main/notebooks/sa...
-
Rich Jupyter Notebook Diffs on GitHub... Finally.
Here are the notebooks I spend day and night refining: https://github.com/roboflow/notebooks
-
Tools for object detection on satellite images
You’ll just need to have labeled solar panel images, and pick a model architecture and tutorial to train with: https://github.com/roboflow/notebooks
-
[OC] Football Players Tracking with YOLOv5 + ByteTrack + OpenCV
dataset: https://universe.roboflow.com/roboflow-jvuqo/football-players-detection-3zvbc/dataset/4 code: https://github.com/roboflow/notebooks/blob/main/notebooks/how-to-track-football-players.ipynb video: https://youtu.be/QCG8QMhga9k
-
Should I get a Google Coral USB Accelerator for my RPI4 or should I just buy a Nvidia Jetson Nano?
Have fun! Great field. Just also try out the first few OpenCV tutorials, and train a few custom model to deploy to see what you think. Here’s a ton of free open source notebooks: https://github.com/roboflow/notebooks
supervision
-
Supervision: Reusable Computer Vision
You can always slice the images into smaller ones, run detection on each tile, and combine results. Supervision has a utility for this - https://supervision.roboflow.com/latest/detection/tools/infe..., but it only works with detections. You can get a much more accurate result this way. Here is some side-by-side comparison: https://github.com/roboflow/supervision/releases/tag/0.14.0.
- Supervision – reusable computer vision tools
-
Ask HN: Who is hiring? (February 2024)
Roboflow | Open Source Software Engineer, Web Designer / Developer, and more. | Full-time (Remote, SF, NYC) | https://roboflow.com/careers?ref=whoishiring0224
Roboflow is the fastest way to use computer vision in production. We help developers give their software the sense of sight. Our end-to-end platform[1] provides tooling for image collection, annotation, dataset exploration and curation, training, and deployment.
Over 250k engineers (including engineers from 2/3 Fortune 100 companies) build with Roboflow. We now host the largest collection of open source computer vision datasets and pre-trained models[2]. We are pushing forward the CV ecosystem with open source projects like Autodistill[3] and Supervision[4]. And we've built one of the most comprehensive resources for software engineers to learn to use computer vision with our popular blog[5] and YouTube channel[6].
We have several openings available but are primarily looking for strong technical generalists who want to help us democratize computer vision and like to wear many hats and have an outsized impact. Our engineering culture is built on a foundation of autonomy & we don't consider an engineer fully ramped until they can "choose their own loss function". At Roboflow, engineers aren't just responsible for building things but also for helping us figure out what we should build next. We're builders & problem solvers; not just coders. (For this reason we also especially love hiring past and future founders.)
We're currently hiring full-stack engineers for our ML and web platform teams, a web developer to bridge our product and marketing teams, several technical roles on the sales & field engineering teams, and our first applied machine learning researcher to help push forward the state of the art in computer vision.
[1]: https://roboflow.com/?ref=whoishiring0224
[2]: https://roboflow.com/universe?ref=whoishiring0224
[3]: https://github.com/autodistill/autodistill
[4]: https://github.com/roboflow/supervision
[5]: https://blog.roboflow.com/?ref=whoishiring0224
[6]: https://www.youtube.com/@Roboflow
-
Image segmentation in huggingface
You'll need to plot the predictions. There are a few open source tools to do that, supervision is one you can use (https://github.com/roboflow/supervision) and opencv is another common option (https://github.com/opencv/opencv)
- Show HN: Supervision, reusable computer vision utilities
- FLaNK Stack Weekly 28 August 2023
-
Show HN: Pip install inference, open source computer vision deployment
Thanks for the suggestion! Definitely agree, we’ve seen that work extremely well for Supervision[1] and Autodistill, some of our other open source projects.
There’s still a lot of polish like this we need to do; we’ve spent most of our effort cleaning up the code and documentation to prep for open sourcing the repo.
Next step is improving the usability of the pip pathway (that interface was just added; the http server was all we had for internal use). Then we’re going to focus on improving the content and expanding the models it supports.
[1] https://github.com/roboflow/supervision
[2] https://github.com/autodistill/autodistill
-
Show HN: VisionScript, abstract programming language for computer vision
a new, popular library for basic functionality (converting between annotation formats, evaluating models, doing object tracking) has been supervision: https://github.com/roboflow/supervision
-
Ask HN: Who is hiring? (August 2023)
Roboflow | Multiple Roles | Full-time (Remote, SF, NYC) | https://roboflow.com/careers?ref=whoishiring0823
Roboflow is the fastest way to use computer vision in production. We help developers give their software the sense of sight. Our end-to-end platform[1] provides tooling for image collection, annotation, dataset exploration and curation, training, and deployment.
Over 250k engineers (including engineers from 2/3 Fortune 100 companies) build with Roboflow. We now host the largest collection of open source computer vision datasets and pre-trained models[2]. We are pushing forward the CV ecosystem with open source projects like Autodistill[3] and Supervision[4]. And we've built one of the most comprehensive resources for software engineers to learn to use computer vision with our popular blog[5] and YouTube channel[6].
We have several openings available, but are primarily looking for strong technical generalists who want to help us democratize computer vision and like to wear many hats and have an outsized impact. Our engineering culture is built on a foundation of autonomy & we don't consider an engineer fully ramped until they can "choose their own loss function". At Roboflow, engineers aren't just responsible for building things but also for helping figure out what we should build next. We're builders & problem solvers; not just coders. (For this reason we also especially love hiring past and future founders.)
We're currently hiring full-stack engineers for our ML and web platform teams, a web developer to bridge our product and marketing teams, several technical roles on the sales & field engineering teams, and our first applied machine learning researcher to help push forward the state of the art in computer vision.
[1]: https://roboflow.com/?ref=whoishiring0823
[2]: https://roboflow.com/universe?ref=whoishiring0823
[3]: https://github.com/autodistill/autodistill
[4]: https://github.com/roboflow/supervision
[5]: https://blog.roboflow.com/?ref=whoishiring0823
[6]: https://www.youtube.com/@Roboflow
-
python tools to load, save, split, and convert computer vision datasets | link in comment
repository: https://github.com/roboflow/supervision documentation: https://roboflow.github.io/supervision
What are some alternatives?
ultralytics - NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
refact - WebUI for Fine-Tuning and Self-hosting of Open-Source Large Language Models for Coding
rankseg - [JMLR 2023] RankSEG: A consistent ranking-based framework for segmentation
seamless_communication - Foundational Models for State-of-the-Art Speech and Text Translation
Made-With-ML - Learn how to design, develop, deploy and iterate on production-grade ML applications.
opencopilot - 🕊️ Build and embed open-source AI Copilots into your product with ease
glami-1m - The largest multilingual image-text classification dataset. It contains fashion products.
shell-ai - LangChain powered shell command generator and runner CLI
make-sense - Free to use online tool for labelling photos. https://makesense.ai
FaceFusion - Next generation face swapper and enhancer
uav-detection - Drone / Unmanned Aerial Vehicle (UAV) Detection is a very safety critical project. It takes in Infrared (IR) video streams and detects drones in it with high accuracy.
visionscript - A high-level programming language for using computer vision.