Our great sponsors
-
AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
File upload and automatic agents. It exists it is just buggy. They are working at an insane pace building it. It is practically broke 90% of the time. Maybe it's working better right now. I had success with v1.1.31 as well. https://github.com/Josh-xt/AGiXT
To merge a LoRA into an existing model, use this script:
Then once you have done that, re-quantize the model with GPTQ for Llama. Many models including llama are compatible with the regular triton version. If not, you may have to find a fork that is compatible.
If you are using the triton version or my CUDA fork for inference, you can use act-order:
Related posts
- AGiXT: A local automation platform with memories and SmartGPT-like prompting. Works with Ooba/LCPP/GPT4All, and more
- Langchain, Langchain.js, vs AutoGPT for local agent development
- Is there an alternative to AgentGPT that I can run on my CPU with 32 GB of RAM?
- "Question answering over Docs" langchain integration into Textgen
- LLama with internet access?