[D] Efficiently loading videos in PyTorch without extracting frames

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • DALI

    A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

  • SlowFast

    PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

  • Try using ffmpeg. They use av Python bindings in this repository: https://github.com/facebookresearch/SlowFast/tree/main/slowfast/datasets

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • vision

    Datasets, Transforms and Models specific to Computer Vision

  • Maybe VideoClips? see the discussion here: https://github.com/pytorch/vision/issues/1446.

  • imutils

    A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and displaying Matplotlib images easier with OpenCV and Python.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts