training-data

Open-source projects categorized as training-data

Top 7 training-data Open-Source Projects

  • snorkel

    A system for quickly generating training data with weak supervision

  • diffgram

    The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • ydata-synthetic

    Synthetic data generators for tabular and time-series data

  • Project mention: Coding Wonderland: Contribute to YData Profiling and YData Synthetic in this Advent of Code | dev.to | 2023-12-05

    Send us your North ⭐️: "On the first day of Christmas, my true contributor gave to me..." a star in my GitHub tree! 🎵 If you love these projects too, star ydata-profiling or ydata-synthetic and let your friends know why you love it so much!

  • skweak

    skweak: A software toolkit for weak supervision applied to NLP tasks

  • compose

    A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning. (by alteryx)

  • trainset

    A lightweight web application for brushing labels onto time series data; useful for building training sets.

  • BingImageAITrainer

    A tool for generating diverse synthetic training images using Bing Image Creator to facilitate the training of AI/ML image models.

  • Project mention: I made a tool for generating training images | /r/ArtificialInteligence | 2023-06-06

    https://github.com/atticusrussell/BingImageAITrainer Getting training data for AI models involving imaging is difficult. I created this tool to generate synthetic training images while changing as many or as few parameters about the images as desired. This is accomplished using Bing Image Creator, which uses the DALLE-2 model.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

training-data related posts

  • trainset: NEW Data - star count:120.0

    1 project | /r/algoprojects | 26 Nov 2022
  • trainset: NEW Data - star count:120.0

    1 project | /r/algoprojects | 25 Nov 2022
  • trainset: NEW Data - star count:120.0

    1 project | /r/algoprojects | 24 Nov 2022
  • trainset: NEW Data - star count:120.0

    1 project | /r/algoprojects | 23 Nov 2022
  • trainset: NEW Data - star count:120.0

    1 project | /r/algoprojects | 22 Nov 2022
  • trainset: NEW Data - star count:120.0

    1 project | /r/algoprojects | 21 Nov 2022
  • trainset: NEW Data - star count:120.0

    1 project | /r/algoprojects | 20 Nov 2022
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 3 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source training-data projects? This list will help you:

Project Stars
1 snorkel 5,712
2 diffgram 1,800
3 ydata-synthetic 1,292
4 skweak 909
5 compose 471
6 trainset 154
7 BingImageAITrainer 3

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com