Language models can explain neurons in language models

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

neural-network-from-scratch

3 114 0.0 Rust

A neural network library written from scratch in Rust along with a web-based application for building + training neural networks + visualizing their outputs

I built a toy neural network that runs in the browser[1] to model 2D functions with the goal of doing something similar to this research (in a much more limited manner, ofc). Since the input space is so much more limited than language models or similar, it's possible to examine the outputs for each neuron for all possible inputs, and in a continuous manner.
In some cases, you can clearly see neurons that specialize to different areas of the function being modeled, like this one: https://i.ameo.link/b0p.png
This OpenAI research seems to be feeding lots of varied input text into the models they're examining and keeping track of the activations of different neurons along the way. Another method I remember seeing used in the past involves using an optimizer to generate inputs that maximally activate particular neurons in vision models[2].
I'm sure that's much more difficult or even impossible for transformers which operate on sequences of tokens/embeddings rather than single static input vectors, but maybe there's a way to generate input embeddings and then use some method to convert them back into tokens.
[1] https://nn.ameo.dev/
[2] https://www.tensorflow.org/tutorials/generative/deepdream

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Examine individual neurons of a small neural network in the browser

1 project | news.ycombinator.com | 10 May 2023
C++ neural network low level engine

2 projects | /r/cpp | 19 Apr 2022
"If this one guy got hit by a bus, the software would fall apart."

1 project | news.ycombinator.com | 6 Apr 2024
Show HN: Kiwi – End-to-End Kafka Subscriptions with WebAssembly

2 projects | news.ycombinator.com | 6 Apr 2024
Show HN: A fast HNSW implementation in Rust

6 projects | news.ycombinator.com | 14 Mar 2024

Language models can explain neurons in language models

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
neural-network Rust WebAssembly gradient-descent Backpropagation
Post date: 9 May 2023

neural-network-from-scratch

InfluxDB

Related posts

Examine individual neurons of a small neural network in the browser

C++ neural network low level engine

"If this one guy got hit by a bus, the software would fall apart."

Show HN: Kiwi – End-to-End Kafka Subscriptions with WebAssembly

Show HN: A fast HNSW implementation in Rust

Language models can explain neurons in language models

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com neural-network Rust WebAssembly gradient-descent Backpropagation Post date: 9 May 2023

neural-network-from-scratch

InfluxDB

Related posts

Examine individual neurons of a small neural network in the browser

C++ neural network low level engine

"If this one guy got hit by a bus, the software would fall apart."

Show HN: Kiwi – End-to-End Kafka Subscriptions with WebAssembly

Show HN: A fast HNSW implementation in Rust

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
neural-network Rust WebAssembly gradient-descent Backpropagation
Post date: 9 May 2023