The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 text-generation Open-Source Projects
-
LocalAI
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
textgenrnn
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Accelerated Text
Accelerated Text is a no-code natural language generation platform. It will help you construct document plans which define how your data is converted to textual descriptions varying in wording and structure.
-
Cornucopia-LLaMA-Fin-Chinese
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Has anyone tried fine tuning on a dataset of complex tasks that require tool use? | /r/LocalLLaMA | 2023-05-05
Try this: 1) (Not sure if that's necessary.) Uninstall textgenrnn: pip3 uninstall textgenrnn. 2) Install it using one of this commands: * pip3 install git+git://github.com/minimaxir/textgenrnn.git * pip3 install git+https://github.com/minimaxir/textgenrnn.git (Try the first one, but if it'll raise an error, try the second one.) That's discussion about this "multi_gpu_model not found" error: https://github.com/minimaxir/textgenrnn/issues/222.
Project mention: Show HN: I made an app to use local AI as daily driver | news.ycombinator.com | 2024-02-27
Project mention: Show HN: WhatsApp-Llama: A clone of yourself from your WhatsApp conversations | news.ycombinator.com | 2023-09-09Tap the contact's name in WhatsApp (I think it only works on a phone) and at the bottom of that screen there's Export Chat.
For finetuning GPT-2 I think I used this thing on Google Colab. (My friend ran it on his GPU, it should be doable on most modern-ish GPUs.)
https://github.com/minimaxir/gpt-2-simple
I tried doing something with this a few months ago though and it was a bit of a hassle to get running (needed to use a specific python version for some dependencies...), I forget the details sorry!
I think of guardrails as another dimension of human preferences: whether you are training a model to answer questions more gooder or avoid saying horrifying stuff, you are teaching the model a preference. So I thinks it's a straightforward RLHF problem but from a different perspective.
Project mention: Microsoft: Large-scale pretrained models for goal-directed dialog | news.ycombinator.com | 2023-06-05
Using the currently popular gptq the 3bit quantization hurts performance much more than 4bit, but there's also awq (https://github.com/mit-han-lab/llm-awq) and squishllm (https://github.com/SqueezeAILab/SqueezeLLM) which are able to manage 3bit without as much performance drop - I hope to see them used more commonly.
Project mention: Cornucopia-LLaMA-Fin-Chinese: NEW Textual - star count:263.0 | /r/algoprojects | 2023-07-31
Project mention: I Built a Modular Python Library for Designing and Training Diffusion Models from Scratch | /r/SideProject | 2023-09-06Last week, I released a project I've been working on for months: Modular Diffusion. It's a modular Python library for designing and training your own Diffusion Models in just a few lines of code. I also wrote a documentation page. The project has already gotten some great community feedback and I'm hoping you guys like it too!
Project mention: A LLM trained to follow annotation guidelines, for information extraction tasks | news.ycombinator.com | 2023-10-30
I've been playing around with DALL-E 3 a lot recently. One of the things they do is to expand a user's prompt in order to add a lot of detail. Via their API, you can see the expanded prompt (whereas, you can't through the ChatGPT interface).
They obviously have the power of their LLM behind them and can generate some really interesting prompts. There is an open source implementation that the creator of Fooocus made which attempts to expand on prompts using some commonly used keywords[1] with some sort of basic context.
e.g., I typed in "Brisket on a table" and got: "Brisket on a table, product photography, Michelin star, award winning photo, 8k, trending, HD. High quality image, highly detailed, stunning lighting, flawless render, masterpiece, still from the movie directed by Denis Villeneuve with art direction"
You get a much better image with that prompt vs just the basic: "Brisket on a table"
[1] https://huggingface.co/spaces/daspartho/prompt-extend
text-generation related posts
- Show HN: WhatsApp-Llama: A clone of yourself from your WhatsApp conversations
- Modern alternative to textgenrnn?
- Is there any nano-gpt/pico-gpt like implementation available for stable-diffusion models?
- indistinguishable
- Just a thought
- training gpt on your own sources - how does it work? gpt2 v gpt3? and how much does it cost?
- Gen Z says that school is not shipping them with the skills necessary to survive in a digital world
-
A note from our sponsor - WorkOS
workos.com | 25 Apr 2024
Index
What are some of the best open-source text-generation projects? This list will help you:
Project | Stars | |
---|---|---|
1 | LocalAI | 19,593 |
2 | MOSS | 11,808 |
3 | GPT2-Chinese | 7,342 |
4 | textgenrnn | 4,943 |
5 | lollms-webui | 3,762 |
6 | gpt-2-simple | 3,366 |
7 | DialoGPT | 2,315 |
8 | RL4LMs | 2,084 |
9 | GODEL | 834 |
10 | Accelerated Text | 789 |
11 | SqueezeLLM | 560 |
12 | Cornucopia-LLaMA-Fin-Chinese | 521 |
13 | commit-autosuggestions | 383 |
14 | gpt-2-cloud-run | 313 |
15 | minimal-text-diffusion | 258 |
16 | modular-diffusion | 253 |
17 | MAGIC | 245 |
18 | GoLLIE | 204 |
19 | LongForm | 196 |
20 | rant | 183 |
21 | KVQuant | 183 |
22 | genius | 175 |
23 | prompt-extend | 174 |
Sponsored