Detailed walkthrough of procedure to uncensor models

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • alpaca_lora_4bit

  • What I did to fine tune uncensored stuff on personal data is using https://github.com/johnsmith0031/alpaca_lora_4bit finetune.py for fine-tuning a 4bit LoRA (so CLI interface) with various custom scripts I made with chatGPT's help to gather and prepare my own dataset. I think it's also easily doable (training 4bit lora) with a nice UI in oobabooga's text-generation-webui with monkey path enabled now.

  • LLaVA

    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

  • If someone could do this to [LLaVA](https://github.com/haotian-liu/LLaVA/) for a multi modal model, that would be amazing!

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • prettier

    Prettier is an opinionated code formatter.

  • And/but within the private, in-company/organizational of uses I expect to see a demand for strict guideline enforcements for the tone & style of their employees in their documents. Kind of like having different Prettier configurations for different kind of projects, but for office documents.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts