[D] GPT-3T: Can we train language models to think further ahead?

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

GPT-3T

7 11 5.3 Python

Building language models to predict more than one token ahead to enable further ahead predictions.

Link to the repo here

EfficientZero

9 825 0.0 Python

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Here's an algorithm that is more sample efficient : https://github.com/YeWR/EfficientZero

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Free open-source solution for cybersecurity posture management (GRC)
1 project | news.ycombinator.com | 25 Apr 2024
The Adventures of Blink #20: Facial Recognition with Python
1 project | dev.to | 25 Apr 2024
Tribler: An attack-resilient micro-economy for media
1 project | news.ycombinator.com | 25 Apr 2024
Show HN: Geopolitical and Environmental Risk Monitor for Companies House
1 project | news.ycombinator.com | 25 Apr 2024
Gemini API 102: Next steps beyond "Hello World!"
5 projects | dev.to | 24 Apr 2024

[D] GPT-3T: Can we train language models to think further ahead?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning Post date: 19 Apr 2023

GPT-3T

EfficientZero

WorkOS

Related posts