Nvidia Hopper Sweeps AI Inference Benchmarks in MLPerf Debut

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • m1_huggingface_diffusers_demo

    Demo of how to get HuggingFace Diffusers working on an M1 Mac

  • Out of interest I've been running a bunch of the huggingface version of StableDiffusion using the M1 accelerated branch on my M1 Max[1]. I'm getting 1.54 it/s compared to 2.0 it/s for a Nvidia T4 Tesla on Google Collab.

    T4 Tesla gets 21,691 queries/second for for ResNet, compared to 81,292 q/s for the new H100, 41,893 q/s for the A100 and 6164 q/s for the new Jetson.

    So you can expect maybe 15,000 q/s on a M1 Max. But some tests seem to indicate a lot less[2] - not sure what is happening there.

    [1] Setup like this: https://github.com/nlothian/m1_huggingface_diffusers_demo

    [2] https://tlkh.dev/benchmarking-the-apple-m1-max#heading-resne...

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: Hamilton's UI – observability, lineage, and catalog for data pipelines

    1 project | news.ycombinator.com | 2 May 2024
  • ESpeak-ng: speech synthesizer with more than one hundred languages and accents

    6 projects | news.ycombinator.com | 1 May 2024
  • Quantum Computing Collection of Resources

    1 project | news.ycombinator.com | 2 May 2024
  • 2024 Verizon Data Breach Investigation Report [pdf]

    1 project | news.ycombinator.com | 1 May 2024
  • Impact of Input Length on the Reasoning Performance of Large Language Models

    1 project | news.ycombinator.com | 1 May 2024