Ask HN: Explain how size of input changes ChatGPT performance

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

jsonformer

25 3,868 5.4 Jupyter Notebook

A Bulletproof Way to Generate Structured JSON from Language Models

You're correct with interpreting how the model works wrt it returning tokens one at a time. The model returns one token, and the entire context window gets shifted right by one to for account it when generating the next one.
As for model performance at different context sizes, it's seems a bit complicated. From what I understand, even if models are tweaked (for example using the superHOT RoPE hack or sparse attention) to be able to use longer contexts, they still have to be fined tuned on input of this increased context to actually utilize it, but performance seems to degrade regardless as input length increases.
For your question about fine tuning models to respond with only "yes" or "no", I recommend looking into how the jsonformers library works: https://github.com/1rgs/jsonformer . Essentially, you still let the model generate many tokens for the next position, and only accept the ones that satisfy certain criteria (such as the token for "yes" and the token for "no".
You can do this with openAI API too, using tiktoken https://twitter.com/AAAzzam/status/1669753722828730378?t=d_W... . Be careful though as results will be different on different selections of tokens, as "YES", "Yes", "yes", etc are all different tokens to the best of my knowledge

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Think DSP Is an Introduction to Digital Signal Processing in Python

1 project | news.ycombinator.com | 15 May 2024
The Fundamentals of Modern Deep Learning with PyTorch

1 project | news.ycombinator.com | 15 May 2024
Code a Neural Network from scratch to solve the binary MNIST problem

1 project | dev.to | 14 May 2024
Blog-Reading Chatbot with GPT-4o

1 project | dev.to | 14 May 2024
The First Convolutional-KANs

2 projects | news.ycombinator.com | 14 May 2024

Ask HN: Explain how size of input changes ChatGPT performance

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Post date: 9 Aug 2023

jsonformer

InfluxDB

Related posts

Think DSP Is an Introduction to Digital Signal Processing in Python

The Fundamentals of Modern Deep Learning with PyTorch

Code a Neural Network from scratch to solve the binary MNIST problem

Blog-Reading Chatbot with GPT-4o

The First Convolutional-KANs