minimal-text-diffusion
basaran
minimal-text-diffusion | basaran | |
---|---|---|
2 | 22 | |
261 | 1,281 | |
- | - | |
4.9 | 10.0 | |
12 months ago | 3 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
minimal-text-diffusion
- Is there any nano-gpt/pico-gpt like implementation available for stable-diffusion models?
-
[D] would diffusion language models make sense?
In case you’re interested, I have a minimal implementation here: https://github.com/madaan/minimal-text-diffusion
basaran
- OpenLLM
-
Langchain and self hosted LLaMA hosted API
What are the current best "no reinventing the wheel" approaches to have Langchain use an LLM through a locally hosted REST API, the likes of Oobabooga or hyperonym/basaran with streaming support for 4-bit GPTQ?
-
Run and create custom ChatGPT-like bots with OpenChat
Disclaimer: I am curating LLM-tools on github [1]
A few thoughts:
* allow for custom endpoint URLs, this way people can use open source LLMs with a fake openAI API backend like basaran[2] or llama-api-server[3]
* look into better embedding methods for info-retrieval like InstructorEmbeddings or Document Summary Index
* Don't use a single embedding per content item, use multiple to increase retrieval quality
1 https://github.com/underlines/awesome-marketing-datascience/...
2 https://github.com/hyperonym/basaran
3 https://github.com/iaalm/llama-api-server
-
1-Jun-2023
open-source alternative to the OpenAI text completion API (https://github.com/hyperonym/basaran)
- Introducing Basaran: self-hosted open-source alternative to the OpenAI text completion API
- Basaran is an open-source alternative to the OpenAI text completion API
-
Ask HN: What's the best self hosted/local alternative to GPT-4?
Guanaco-65B[0] using Basaran[1] for your OpenAI compatible API. You can use any ChatGPT front-end which lets you change the OpenAI endpoint URL.
[0] An fp4 finetune of LLaMA-30B by Tim Dettmers
[1] https://github.com/hyperonym/basaran
-
Are all the finetunes stupid?
For lm-eval, I think you'd either need to take GPTQ's inference script and shim it into a model: https://github.com/EleutherAI/lm-evaluation-harness/tree/master/lm_eval/models or you might be able to use a project like https://github.com/hyperonym/basaran and then you could use the gpt3 model...
-
Using the API in Node
There are also: - Basaran repo: "Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models". "...Compatibility with OpenAI API and client libraries..."; - llama-cpp-python repo: "Simple Python bindings for @ggerganov's llama.cpp library...". "...OpenAI-like API...".
-
Researcher looking for help with how to prepare a finetuning dataset for models like Bloomz and Cerebras-GPT
I want to start with a totally freely available model, so again, that excludes things like LLaMA where the weights are only available through a wait list. The two models that most get my attention and (I think, and hope) fit my criteria of open availability are Cerebras-GPT (13b) and Bloomz (7b). The tools to process and fine-tune that seem most feasible to me, from my limit knowledge, are xturing and basaran.
What are some alternatives?
DiffSBDD - A Euclidean diffusion model for structure-based drug design.
text-generation-inference - Large Language Model Text Generation Inference
modular-diffusion - Python library for designing and training your own Diffusion Models with PyTorch.
openai-chatgpt-opentranslator - Python command that uses openai to perform text translations
ImageReward - [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
AutoGPTQ - An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
DDPM_inversion - Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.
NeMo-Guardrails - NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
llm-foundry - LLM training code for Databricks foundation models
alpaca.cpp - Locally run an Instruction-Tuned Chat-Style LLM
lmql - A language for constraint-guided and efficient LLM programming.
llama.cpp - LLM inference in C/C++