[R] Greg Yang's work on a rigorous mathematical theory for neural networks

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • GP4A

    Code for NeurIPS 2019 paper: "Tensor Programs I: Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian Processes"

  • took a quick glance (https://arxiv.org/abs/1910.12478 and https://proceedings.mlr.press/v139/yang21c.html), a few theorems but where r the proofs?

  • mup

    maximal update parametrization (µP)

  • Tensor Programs I: Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian Processes: https://arxiv.org/abs/1910.12478 Tensor Programs II: Neural Tangent Kernel for Any Architecture: https://arxiv.org/abs/2006.14548 Tensor Programs III: Neural Matrix Laws: https://arxiv.org/abs/2009.10685 Tensor Programs IV: Feature Learning in Infinite-Width Neural Networks: https://proceedings.mlr.press/v139/yang21c.html Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer: https://arxiv.org/abs/2203.03466

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • NTK4A

    Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"

  • Found relevant code at https://github.com/thegregyang/NTK4A + all code implementations here

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Bard is getting better at logic and reasoning

    1 project | news.ycombinator.com | 7 Jun 2023
  • Cerebras Open Sources Seven GPT models and Introduces New Scaling Law

    3 projects | /r/mlscaling | 28 Mar 2023
  • OpenAI’s policies hinder reproducible research on language models

    2 projects | news.ycombinator.com | 23 Mar 2023
  • DeepMind’s New Language Model,Chinchilla(70B Parameters),Which Outperforms GPT-3

    3 projects | news.ycombinator.com | 11 Apr 2022
  • "Training Compute-Optimal Large Language Models", Hoffmann et al 2022 {DeepMind} (current LLMs are significantly undertrained)

    1 project | /r/mlscaling | 31 Mar 2022