Fine-tuning Llama

This page summarizes the projects mentioned and recommended in the original post on /r/Oobabooga

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • TencentPretrain

    Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

  • stanford_alpaca

    Code and documentation to train Stanford's Alpaca models, and generate the data.

  • Something else that might be worth keeping an eye on. https://github.com/tatsu-lab/stanford_alpaca

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • alpaca-lora

    Instruct-tune LLaMA on consumer hardware

  • Code and weights are here for a single gpu to replicate alpaca: https://github.com/tloen/alpaca-lora

  • text-generation-webui

    A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

  • Discussion on github: https://github.com/oobabooga/text-generation-webui/issues/332

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts