Python data-slicing

Open-source Python projects categorized as data-slicing

Python data-slicing Projects

data-slicing
  1. snorkel

    A system for quickly generating training data with weak supervision

    Project mention: Harnessing Weak Supervision to Isolate Sign Language in Crowded News Videos | news.ycombinator.com | 2024-08-15

    Hello everyone, we are trying to make a large dataset for Sign Language translation, inspired by BSL-1K [1]. As part of cleaning our collected videos, we use a nice technique for aggregating heuristic labels [2]. We thought it was interesting enough to share with people on here.

    [1] https://www.robots.ox.ac.uk/~vgg/research/bsl1k/

    [2] https://github.com/snorkel-team/snorkel

  2. Nutrient

    Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers. Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.

    Nutrient logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python data-slicing discussion

Log in or Post with

Index

# Project Stars
1 snorkel 5,828

Sponsored
Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers
Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.
www.nutrient.io