[News] OpenAI Announced GPT-4

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

flash-attention

26 10,773 9.4 Python

Fast and memory-efficient exact attention

As posted above, it seems likely that GPT4 uses Flash Attention. Their GitHub page claims that an A100 tops out at 4k tokens. It was my understanding that this was a hard upper limit given the current hardware. So scaling to 32k wouldn't just mean throwing more compute at the problem, but rather a change in the architecture. Flash Attention is an architecture change that can achieve 32k (even 64k according to the GitHub page) context length on an A100.

RWKV-LM

84 11,619 8.8 Python

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Keep an eye on projects like this RWKV-LM that are looking promising in certain cases as they develop.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Show HN: I made a ROS package for realtime semantic segmentation
1 project | news.ycombinator.com | 26 Apr 2024
OpenVoice: Instant Voice Cloning
1 project | news.ycombinator.com | 26 Apr 2024
Embracing Component-Based Templates with JinjaX
1 project | dev.to | 26 Apr 2024
Turbocharge your Lambda Functions with AWS Lambda Powertools for Python
1 project | dev.to | 25 Apr 2024
PyPy v7.3.16 Release
4 projects | news.ycombinator.com | 24 Apr 2024

[News] OpenAI Announced GPT-4

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning Post date: 15 Mar 2023

flash-attention

RWKV-LM

InfluxDB

Related posts