automatic-video-processing VS deep_sort

Compare automatic-video-processing vs deep_sort and see what are their differences.

automatic-video-processing

Turn any live video stream or locally stored video into a dataset of interesting samples for ML training, or any other type of analysis. (by Sieve-Data)

deep_sort

Simple Online Realtime Tracking with a Deep Association Metric (by nwojke)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
automatic-video-processing deep_sort
23 10
72 5,059
- -
5.0 0.0
about 2 years ago 13 days ago
Python Python
- GNU General Public License v3.0 only
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

automatic-video-processing

Posts with mentions or reviews of automatic-video-processing. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-04-21.
  • Building an API + query language for rich data like images and video
    2 projects | /r/datascience | 21 Apr 2022
    I'm one of the creators of Sieve, and I'm looking for general thoughts on this problem.
  • I built the easiest way to process and tag videos with AI
    3 projects | /r/SideProject | 21 Apr 2022
  • The YC Winter 2022 Batch
    1 project | news.ycombinator.com | 29 Mar 2022
    https://sievedata.com seems very promising, a search engine for videos, with specific tags, sounds like a very good idea.

    I'd like the same for all my photos and videos: that would be so much easier to find specific pictures by keywords

  • Locally vs cloud stored management systems
    1 project | /r/SecurityCamera | 25 Feb 2022
    The reason I ask is because I'm working on something called Sieve. We're basically making it really easy for any software developer to process and understand video content. This includes applications from home security, to pet monitoring, baby monitoring, sports analytics, and media understanding.
  • AI video understanding in games
    1 project | /r/esports | 25 Feb 2022
    Hey everyone! I'm the creator of Sieve, an API for AI-based video understanding. One application we're starting to support in beta is tracking player / object movements, speed, etc in video games. All you do is push video to our API, which we then process, after which you can search + query using API calls. We're starting by supporting a few popular games like League of Legends, Dota 2, CSGO, and Overwatch. Here are the docs.
  • Gauging sentiment in sales calls?
    1 project | /r/sales | 24 Feb 2022
    For context I'm the founder of a company called Sieve which is starting to work with some of these tools to automatically gauge things like attentiveness and facial expressions by automatically analyzing the video. Would be interesting to hear what you as users actually want.
  • [D] How computer vision will take over the world
    1 project | /r/MachineLearning | 23 Feb 2022
    P.S. I am potentially very bias because I'm working on Sieve which is trying to work with these applications.
  • Smart features that are actually helpful?
    1 project | /r/VideoEditing | 20 Feb 2022
    Hey everyone! I recently started building Sieve, a really easy way for devs to understand video content. We've just started to work with quite a few video editing tools / companies (both online and offline ones) after having primarily focused on real-world applications like security, supply chain, and general media.
  • [P] Sieve: Process 24 hours of video in 10 mins (UPDATE - try it yourself!)
    1 project | /r/MachineLearning | 2 Feb 2022
    Hey everyone! I’m one of the creators of Sieve. I posted about it here a while back and thought I'd share that r/MachineLearning can now try it for free :)
  • Launch HN: Sieve (YC W22) – Pluggable APIs for Video Search
    1 project | news.ycombinator.com | 2 Feb 2022
    Hi HN, we’re Mokshith and Abhi from Sieve (https://sievedata.com). We’re building an API that lets you add video search to internal tools or customer applications, instantly. Sieve can process 24 hours of video in less than 10 minutes, and makes it easy to search video by detected objects / characteristics, motion data, and visual similarity. You can use our models out of the box, or plug-in your own model endpoints into our infrastructure. Models can mean any software that produces output given an image.

    Every industry from security, to media, supply chain, construction, retail, sports, and agriculture is being transformed by video analytics—but setting up the infrastructure to process video data quickly is difficult. Having to deal with video ingestion pipelines, computer-vision model training, and search functionality is not pretty. We’re building a platform that takes care of all of this so teams can focus on their domain-expertise, building industry-specific software.

    We met in high school, and were on the robotics team together. It was our first exposure to computer vision, and something we both deeply enjoyed. We ended up going to UC Berkeley together and worked on computer vision at places like Scale AI, Niantic, Ford, NVIDIA, Microsoft, and Second Spectrum. We were initially trying to solve problems for ourselves as computer vision developers but quickly realized the unique problems in video having to do with cost, efficiency, and scale. We also realized how important video would be in lots of verticals, and saw an opportunity to build infrastructure which wouldn’t have to be rebuilt by a fullstack dev at any company again.

    Let’s take the example of cloud software for construction which might include tons of features from asset trackers to rental management and compliance checks. It doesn’t make sense for them to build their own video processing for telematics—the density and scale of video make this a difficult task. A single 30 FPS camera generates over 2.5M frames within a day of recording. Imagine this across thousands of cameras and many weeks of footage—not to mention the actual vertical-specific software they’re building for end users.

    Sieve takes care of everything hard about processing and searching video. Our API allows you to process and search video with just two API calls. We use filtering, parallelization, and interpolation techniques to keep costs low, while being able to process 24 hours of video in under 10 minutes. Users can choose from our pre-existing set of models, or use their own models with our video processing engine. Our pricing can range anywhere from $0.08-$0.45 per minute of video processed based on the models clients are interested in and usage volume. Our FAQ page (https://sievedata.com/faq) explains these factors in more detail.

    Our backend is built on serverless functions. We split each video into individual chunks which are processed in parallel and passed through multiple layers of filters to determine which chunks are “important”. We’re able to algorithmically ignore parts of video which are static, or change minimally, and focus on the parts that contain real action. We then run more expensive models on the most “important” parts of video, and interpolate results across frames to return information to customers at 30 FPS granularity. Our customers simply push signed video URLs to our platform, and this happens automatically. You can then use our API to query for intervals of interest.

    We haven’t built an automated sign up flow yet because of our focus on the core product, but we still wanted to give all of you the chance to try Sieve on your own videos for free. You’ll be emailed a personal, limited-access API key.

    Try it out: https://sieve-data.notion.site/Trying-Sieve-s-Video-Search-4...

    Visual dashboard demo: https://www.youtube.com/watch?v=_uyjp_HGZl4

    We’d love to hear what you think about the product and vision, and ideas on how we can improve it. Thanks for taking the time to read this, we’re grateful to be posting here :)

deep_sort

Posts with mentions or reviews of deep_sort. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-16.
  • Similari 0.26.2: MOT framework with Python bindings
    3 projects | /r/computervision | 16 May 2023
    Similari is a Rust/Python framework aimed at building sophisticated tracking systems. With Similari, you can develop highly efficient parallelized SORT, DeepSORT, and other sophisticated multiple-object tracking engines.
  • How to integrate DeepSORT with YOLOv8
    2 projects | /r/computervision | 25 Mar 2023
    I'm doing a Python personal project where I'm trying to use YOLOv8 and DeepSORT to detect vehicles from a car's dash cam footage. I succeeded in using YOLOv8 to output the correct bounding boxes by processing each camera frame. However, I tried to add on DeepSORT code, but it made the detection accuracy significantly worse. I'm pretty sure I need to train my own "feature extractor" for DeepSORT to create a new .pb file. I got this information from the deep_sort GitHub link: https://github.com/nwojke/deep_sort. I tried to find resources to do this but they are pretty scarce. Has anyone had experience with this problem?
  • Need to download resources for DeepSORT from pan.baidu.com
    3 projects | /r/computervision | 6 Dec 2022
    The feature model well that looks like it is at that domain you mentioned but why not instead of using this repo use the original authors repo
  • DeepSort with PyTorch(support yolo series)
    13 projects | /r/u_No_Experience9104 | 20 Sep 2022
    nwojke/deep_sort
  • Kalman filter in Rust runs 120+ times faster than NumPy, SciKit implementation
    4 projects | /r/rust | 25 Jul 2022
    I was implementing the Kalman filter for bounding boxes during the last two days. As an inspiration source, I looked at the Python3 Kalman filter implementation that is used in the DeepSORT algorithm and uses NumPy and SciKit under the hood, so it's pretty efficient because all the operations are run inside FFI.
  • [P] The easiest way to process and tag video data
    1 project | /r/MachineLearning | 2 May 2022
    There's tons of work out there when it comes to object tracking such as DeepSort. We've worked to build simpler, more efficient solutions in-house though. Then past that, it's a matter of treating everything in the video as an object (including the whole frame), tracking it, and saving it in a no-SQL DB such that it's easy to query in this way.
  • Building an API + query language for rich data like images and video
    2 projects | /r/datascience | 21 Apr 2022
    Right now, the way we're thinking about it is to turn videos into something that works with the structure of a database like MongoDB. Everything in an image or a video is an object (even the frame itself is an object with a large bounding box), and each of these objects has some attributes and can be tracked over time with some form of object tracking. Given that the objects are tracked, they can each basically be returned as a time-series of each of the attributes associated with that object.
  • Could someone suggest a good article that explains the implementation of deep sort algorithm ?
    1 project | /r/deeplearning | 21 Jan 2022
    Deep Sort
  • How do I train the DeepSORT tracker for a custom class?
    2 projects | /r/computervision | 13 Apr 2021
    I was wondering if I could use the same annotated data(in YOLO format) for the training of the tracker as well. I took a look at the original repo for DeepSORT, and it does mention the training using cosine metric learning, but I could not seem to understand how to replicate that for my own dataset(they show us how to do it for the MARS and Market1501 datasets).
  • Is it possible to track objects on the go?
    2 projects | /r/learnmachinelearning | 11 Mar 2021
    DeepSORT is one of the best trackers https://github.com/nwojke/deep_sortIt requires an object detector tho, like YOLO https://pjreddie.com/darknet/yolo/

What are some alternatives?

When comparing automatic-video-processing and deep_sort you can also consider the following projects:

nodejs-vision - Node.js client for Google Cloud Vision: Derive insight from images.

sort - Simple, online, and realtime tracking of multiple objects in a video sequence.

rpi-object-detection - Real-time object detection and tracking using Raspberry Pi and OpenCV!

yolov4-deepsort - Object tracking implemented with YOLOv4, DeepSort, and TensorFlow.

Similari - A framework for building high-performance real-time multiple object trackers

mmdetection - OpenMMLab Detection Toolbox and Benchmark

yolo_series_deepsort_pytorch - Deepsort with yolo series. This project support the existing yolo detection model algorithm (YOLOV8, YOLOV7, YOLOV6, YOLOV5, YOLOV4Scaled, YOLOV4, YOLOv3', PPYOLOE, YOLOR, YOLOX ).

yolov5 - YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

yolor - implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)

DeepSORT - support deepsort and bytetrack MOT(Multi-object tracking) using yolov5 with C++

ScaledYOLOv4 - Scaled-YOLOv4: Scaling Cross Stage Partial Network

PyTorch_YOLOv4 - PyTorch implementation of YOLOv4