Google's First Tensor Processing Unit: Architecture

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • flash-attention-minimal

    Flash Attention in ~100 lines of CUDA (forward pass only)

  • Vulcan is a driver-level API. It competes with DirectX and OpenGL.

    CUDA is a language you write kernels. It competes with OpenAI's Triton language.

    Here's what CUDA looks like: https://github.com/tspeterkim/flash-attention-minimal/blob/m...

    This is what Triton looks like: https://triton-lang.org/main/getting-started/tutorials/06-fu...

    By contrast Vulcan looks like this: https://github.com/KhronosGroup/Vulkan-Samples/blob/main/sam...

    (It's true to some extent that maybe you could use Vulcan shaders to write deep learning kernels, maybe? I'm not aware of anyone doing it though)

  • Vulkan-Samples

    One stop solution for all Vulkan samples

  • Vulcan is a driver-level API. It competes with DirectX and OpenGL.

    CUDA is a language you write kernels. It competes with OpenAI's Triton language.

    Here's what CUDA looks like: https://github.com/tspeterkim/flash-attention-minimal/blob/m...

    This is what Triton looks like: https://triton-lang.org/main/getting-started/tutorials/06-fu...

    By contrast Vulcan looks like this: https://github.com/KhronosGroup/Vulkan-Samples/blob/main/sam...

    (It's true to some extent that maybe you could use Vulcan shaders to write deep learning kernels, maybe? I'm not aware of anyone doing it though)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts