[P] Training synthetic models on highly complex datasets.

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • trainer

    Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs. (by gretelai)

  • We published a notebook and a GitHub repo that helps you train synthetic models on highly dimensional datasets (e.g. 1000's of columns, and millions of records). It works by using Gretel's open source header clustering to group correlated data and parallelize training across multiple GPUs. https://github.com/gretelai/trainer

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Farfalle – Open-source AI-powered search engine

    2 projects | news.ycombinator.com | 16 May 2024
  • Ask HN: How to do dead simple heartbeat monitoring?

    5 projects | news.ycombinator.com | 6 May 2024
  • The Alternative Implementation Problem

    1 project | news.ycombinator.com | 17 May 2024
  • Toon3D: Seeing Cartoons from a New Perspective

    1 project | news.ycombinator.com | 17 May 2024
  • HMT: Hierarchical Memory Transformer for Long Context Language Processing

    4 projects | news.ycombinator.com | 17 May 2024