Top 23 Jupyter Notebook large-language-model Projects

llm-course

6 32,968 7.9 Jupyter Notebook

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Project mention: Ask HN: People who switched from GPT to their own models. How was it? | news.ycombinator.com | 2024-02-26

This is a very nice resource: https://github.com/mlabonne/llm-course

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
LLMs-from-scratch

11 19,418 9.6 Jupyter Notebook

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Project mention: Evaluating LLMs locally, on a laptop, with Llama 3 and Ollama | news.ycombinator.com | 2024-06-13

DeepLearningExamples

7 12,821 5.7 Jupyter Notebook

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
FinGPT

11 12,396 9.4 Jupyter Notebook

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Project mention: GPT-4, without specialized training, beat a GPT-3.5 class model that cost $10B | news.ycombinator.com | 2024-03-24

There is also the open source FinGPT, that is claimed to beat GPT4 in some benchmarks at a fine tuning cost of $17.25.
https://github.com/AI4Finance-Foundation/FinGPT

Promptify

29 3,089 8.5 Jupyter Notebook

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Project mention: Promptify 2.0: More Structured, More Powerful LLMs with Prompt-Optimization, Prompt-Engineering, and Structured Json Parsing with GPT-n Models! 🚀 | /r/ArtificialInteligence | 2023-07-31

First up, a huge Thank You for making Promptify a hit with over 2.3k+ stars on Github ! 🌟

ReAct

1 1,679 4.8 Jupyter Notebook

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models (by ysymyth)
EasyEdit

6 1,523 9.8 Jupyter Notebook

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Project mention: ChatGPT provides false information about people, and OpenAI can't correct it | news.ycombinator.com | 2024-04-29

> The article talks about OpenAI being unwilling to correct errors. But they just can’t.
There are actually several algorithms intended to allow fact editing in LLMs: https://github.com/zjunlp/EasyEdit?tab=readme-ov-file#curren...
They don't work perfectly (e.g. "Tim Cook is CEO of Apple" and "The CEO of Apple is Tim Cook" for some reason have to be edited separately) but there are certainly techniques available.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
alpaca_eval

5 1,224 9.6 Jupyter Notebook

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Project mention: MT-Bench: Comparing different LLM Judges | dev.to | 2024-06-08

Another popular option for LLM evaluation is AlpacaEval. This one uses a newer and cheaper GPT-4 Turbo model as a baseline. The authors of AlpacaEval provided correlation coefficients of different evals with LMSYS Arena showing a strong association between LLM judges' scores and human preferences at the Arena:

Get-Things-Done-with-Prompt-Engineering-and-LangChain

18 997 8.2 Jupyter Notebook

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

Project mention: Get-Things-Done-with-Prompt-Engineering-and-LangChain: NEW Data - star count:617.0 | /r/algoprojects | 2023-12-10

ontogpt

2 539 9.8 Jupyter Notebook

LLM-based ontological extraction tools, including SPIRES

Project mention: GPT-based ontological extraction tools, including SPIRES | news.ycombinator.com | 2023-07-24

xmtf

2 504 5.9 Jupyter Notebook

Crosslingual Generalization through Multitask Finetuning
fromage

2 462 6.3 Jupyter Notebook

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
KG_RAG

5 444 9.7 Jupyter Notebook

Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

Project mention: A list of system prompts used for biomedical RAG (KG-RAG) using LLM | news.ycombinator.com | 2024-01-10

PIXIU

6 429 8.9 Jupyter Notebook

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

Project mention: PIXIU: NEW Data - star count:172.0 | /r/algoprojects | 2023-08-15

llm-search

2 405 8.2 Jupyter Notebook

Querying local documents, powered by LLM

Project mention: Querying local documents, powered by LLM | news.ycombinator.com | 2023-11-07

hyde

2 362 10.0 Jupyter Notebook

HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels (by texttron)

Project mention: Show HN: Hacker Search – A semantic search engine for Hacker News | news.ycombinator.com | 2024-05-02

HyDE apparently means “Hypothetical Document Embeddings”, which seems to be a kind of generative query expansion/pre-processing
https://arxiv.org/abs/2212.10496
https://github.com/texttron/hyde
From the abstract:
Given a query, HyDE first zero-shot instructs an instruction-following language model (e.g. InstructGPT) to generate a hypothetical document. The document captures relevance patterns but is unreal and may contain false details. Then, an unsupervised contrastively learned encoder~(e.g. Contriever) encodes the document into an embedding vector. This vector identifies a neighborhood in the corpus embedding space, where similar real documents are retrieved based on vector similarity. This second step ground the generated document to the actual corpus, with the encoder's dense bottleneck filtering out the incorrect details.

datablations

6 297 6.9 Jupyter Notebook

Scaling Data-Constrained Language Models

Project mention: Gemini is only 1x Chinchilla, so it undertrained for production | /r/singularity | 2023-12-07

1x chinchilla means it's not really undertrained but that more could be squeezed without excessive difficulty https://arxiv.org/abs/2305.16264

generativeAgent_LLM

4 264 5.8 Jupyter Notebook

Implementation of "Generative Agents: Interactive Simulacra of Human Behavior" paper with Guidance and Langchain. Full features and work with local LLMs.
ToolQA

1 213 6.1 Jupyter Notebook

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

Project mention: 🔍📊 Exciting development in the AI world: Introducing ToolQA, a new dataset that evaluates how well Large Language Models (LLMs) can use external tools for question answering. | /r/machinelearningnews | 2023-07-01

langforge

1 163 6.6 Jupyter Notebook

A Toolkit for Creating and Deploying LangChain Apps
localLLM_guidance

3 148 4.2 Jupyter Notebook

Local LLM ReAct Agent with Guidance
FastLoRAChat

2 119 7.2 Jupyter Notebook

Instruct-tune LLaMA on consumer hardware with shareGPT data
seemore

2 116 8.3 Jupyter Notebook

From scratch implementation of a vision language model in pure PyTorch

Project mention: A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file | news.ycombinator.com | 2024-05-05

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook large-language-models discussion

Jupyter Notebook large-language-models related posts

Evaluating LLMs locally, on a laptop, with Llama 3 and Ollama

1 project | news.ycombinator.com | 13 Jun 2024
Finetuning an LLM-Based Spam Classifier with LoRA from Scratch

1 project | news.ycombinator.com | 11 May 2024
A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file

1 project | news.ycombinator.com | 5 May 2024
Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes

1 project | news.ycombinator.com | 3 May 2024
ChatGPT provides false information about people, and OpenAI can't correct it

1 project | news.ycombinator.com | 29 Apr 2024
Insights from Finetuning LLMs for Classification Tasks

1 project | news.ycombinator.com | 28 Apr 2024
Comparing 5 ways to implement Multihead Attention in PyTorch

1 project | news.ycombinator.com | 8 Mar 2024
A note from our sponsor - SaaSHub
www.saashub.com | 16 Jun 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source large-language-model projects in Jupyter Notebook? This list will help you:

	Project	Stars
1	llm-course	32,968
2	LLMs-from-scratch	19,418
3	DeepLearningExamples	12,821
4	FinGPT	12,396
5	Promptify	3,089
6	ReAct	1,679
7	EasyEdit	1,523
8	alpaca_eval	1,224
9	Get-Things-Done-with-Prompt-Engineering-and-LangChain	997
10	ontogpt	539
11	xmtf	504
12	fromage	462
13	KG_RAG	444
14	PIXIU	429
15	llm-search	405
16	hyde	362
17	datablations	297
18	generativeAgent_LLM	264
19	ToolQA	213
20	langforge	163
21	localLLM_guidance	148
22	FastLoRAChat	119
23	seemore	116

Jupyter Notebook large-language-models

Top 23 Jupyter Notebook large-language-model Projects

Jupyter Notebook large-language-models discussion

Jupyter Notebook large-language-models related posts

Evaluating LLMs locally, on a laptop, with Llama 3 and Ollama

Finetuning an LLM-Based Spam Classifier with LoRA from Scratch

A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file

Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes

ChatGPT provides false information about people, and OpenAI can't correct it

Insights from Finetuning LLMs for Classification Tasks

Comparing 5 ways to implement Multihead Attention in PyTorch

Index