Jupyter Notebook large-language-models

Open-source Jupyter Notebook projects categorized as large-language-models

Top 23 Jupyter Notebook large-language-model Projects

large-language-models
  • llm-course

    Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

  • Project mention: Ask HN: People who switched from GPT to their own models. How was it? | news.ycombinator.com | 2024-02-26

    This is a very nice resource: https://github.com/mlabonne/llm-course

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • LLMs-from-scratch

    Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

  • Project mention: Evaluating LLMs locally, on a laptop, with Llama 3 and Ollama | news.ycombinator.com | 2024-06-13
  • DeepLearningExamples

    State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

  • FinGPT

    FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

  • Project mention: GPT-4, without specialized training, beat a GPT-3.5 class model that cost $10B | news.ycombinator.com | 2024-03-24

    There is also the open source FinGPT, that is claimed to beat GPT4 in some benchmarks at a fine tuning cost of $17.25.

    https://github.com/AI4Finance-Foundation/FinGPT

  • Promptify

    Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

  • Project mention: Promptify 2.0: More Structured, More Powerful LLMs with Prompt-Optimization, Prompt-Engineering, and Structured Json Parsing with GPT-n Models! 🚀 | /r/ArtificialInteligence | 2023-07-31

    First up, a huge Thank You for making Promptify a hit with over 2.3k+ stars on Github ! 🌟

  • ReAct

    [ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models (by ysymyth)

  • EasyEdit

    [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

  • Project mention: ChatGPT provides false information about people, and OpenAI can't correct it | news.ycombinator.com | 2024-04-29

    > The article talks about OpenAI being unwilling to correct errors. But they just can’t.

    There are actually several algorithms intended to allow fact editing in LLMs: https://github.com/zjunlp/EasyEdit?tab=readme-ov-file#curren...

    They don't work perfectly (e.g. "Tim Cook is CEO of Apple" and "The CEO of Apple is Tim Cook" for some reason have to be edited separately) but there are certainly techniques available.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • alpaca_eval

    An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

  • Project mention: MT-Bench: Comparing different LLM Judges | dev.to | 2024-06-08

    Another popular option for LLM evaluation is AlpacaEval. This one uses a newer and cheaper GPT-4 Turbo model as a baseline. The authors of AlpacaEval provided correlation coefficients of different evals with LMSYS Arena showing a strong association between LLM judges' scores and human preferences at the Arena:

  • Get-Things-Done-with-Prompt-Engineering-and-LangChain

    LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

  • Project mention: Get-Things-Done-with-Prompt-Engineering-and-LangChain: NEW Data - star count:617.0 | /r/algoprojects | 2023-12-10
  • ontogpt

    LLM-based ontological extraction tools, including SPIRES

  • Project mention: GPT-based ontological extraction tools, including SPIRES | news.ycombinator.com | 2023-07-24
  • xmtf

    Crosslingual Generalization through Multitask Finetuning

  • fromage

    🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

  • KG_RAG

    Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

  • Project mention: A list of system prompts used for biomedical RAG (KG-RAG) using LLM | news.ycombinator.com | 2024-01-10
  • PIXIU

    This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

  • Project mention: PIXIU: NEW Data - star count:172.0 | /r/algoprojects | 2023-08-15
    Project mention: Querying local documents, powered by LLM | news.ycombinator.com | 2023-11-07
  • hyde

    HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels (by texttron)

  • Project mention: Show HN: Hacker Search – A semantic search engine for Hacker News | news.ycombinator.com | 2024-05-02

    HyDE apparently means “Hypothetical Document Embeddings”, which seems to be a kind of generative query expansion/pre-processing

    https://arxiv.org/abs/2212.10496

    https://github.com/texttron/hyde

    From the abstract:

    Given a query, HyDE first zero-shot instructs an instruction-following language model (e.g. InstructGPT) to generate a hypothetical document. The document captures relevance patterns but is unreal and may contain false details. Then, an unsupervised contrastively learned encoder~(e.g. Contriever) encodes the document into an embedding vector. This vector identifies a neighborhood in the corpus embedding space, where similar real documents are retrieved based on vector similarity. This second step ground the generated document to the actual corpus, with the encoder's dense bottleneck filtering out the incorrect details.

  • datablations

    Scaling Data-Constrained Language Models

  • Project mention: Gemini is only 1x Chinchilla, so it undertrained for production | /r/singularity | 2023-12-07

    1x chinchilla means it's not really undertrained but that more could be squeezed without excessive difficulty https://arxiv.org/abs/2305.16264

  • generativeAgent_LLM

    Implementation of "Generative Agents: Interactive Simulacra of Human Behavior" paper with Guidance and Langchain. Full features and work with local LLMs.

  • ToolQA

    ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

  • Project mention: 🔍📊 Exciting development in the AI world: Introducing ToolQA, a new dataset that evaluates how well Large Language Models (LLMs) can use external tools for question answering. | /r/machinelearningnews | 2023-07-01
  • langforge

    A Toolkit for Creating and Deploying LangChain Apps

  • localLLM_guidance

    Local LLM ReAct Agent with Guidance

  • FastLoRAChat

    Instruct-tune LLaMA on consumer hardware with shareGPT data

  • seemore

    From scratch implementation of a vision language model in pure PyTorch

  • Project mention: A Simple Version of Grok 1.5/ GPT-4 Vision from scratch, in one PyTorch file | news.ycombinator.com | 2024-05-05
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook large-language-models discussion

Log in or Post with

Jupyter Notebook large-language-models related posts

Index

What are some of the best open-source large-language-model projects in Jupyter Notebook? This list will help you:

Project Stars
1 llm-course 32,968
2 LLMs-from-scratch 19,418
3 DeepLearningExamples 12,821
4 FinGPT 12,396
5 Promptify 3,089
6 ReAct 1,679
7 EasyEdit 1,523
8 alpaca_eval 1,224
9 Get-Things-Done-with-Prompt-Engineering-and-LangChain 997
10 ontogpt 539
11 xmtf 504
12 fromage 462
13 KG_RAG 444
14 PIXIU 429
15 llm-search 405
16 hyde 362
17 datablations 297
18 generativeAgent_LLM 264
19 ToolQA 213
20 langforge 163
21 localLLM_guidance 148
22 FastLoRAChat 119
23 seemore 116

Sponsored
Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com