Tuning and Testing Llama 2, Flan-T5, and GPT-J with LoRA, Sematic, and Gradio

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

training-code

1 103 7.0 Python

The code we currently use to fine-tune models.

https://github.com/PygmalionAI/training-code
Or, you can use this; for QLoRA

qlora

80 9,388 7.4 Jupyter Notebook

QLoRA: Efficient Finetuning of Quantized LLMs

https://github.com/artidoro/qlora
The tools and mechanisms to get a model to do what you want is ever so changing, ever so quickly. Build and understand a notebook yourself, and reduce dependencies. You will need to switch them.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

First impressions: GPU + GCP Batch
2 projects | dev.to | 26 Apr 2024
Searchformer: Beyond a* Better Planning with Transformers via Search Dynamics
1 project | news.ycombinator.com | 26 Apr 2024
Voxel51 Filtered Views Newsletter – April 26, 2024
1 project | dev.to | 26 Apr 2024
DataFrameAndNotebooksAmsterdam2024 – Discovering why trains come in late in NL
1 project | news.ycombinator.com | 25 Apr 2024
Why Vector Compression Matters
3 projects | dev.to | 24 Apr 2024

Tuning and Testing Llama 2, Flan-T5, and GPT-J with LoRA, Sematic, and Gradio

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Post date: 26 Jul 2023

training-code

qlora

WorkOS

Related posts