text-generation

Open-source projects categorized as text-generation

Top 23 text-generation Open-Source Projects

  • LocalAI

    :robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

  • Project mention: Drop-In Replacement for ChatGPT API | news.ycombinator.com | 2024-01-24
  • MOSS

    An open-source tool-augmented conversational language model from Fudan University

  • Project mention: Has anyone tried fine tuning on a dataset of complex tasks that require tool use? | /r/LocalLLaMA | 2023-05-05
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • GPT2-Chinese

    Chinese version of GPT2 training code, using BERT tokenizer.

  • textgenrnn

    Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

  • Project mention: Modern alternative to textgenrnn? | /r/MLQuestions | 2023-06-09

    Try this: 1) (Not sure if that's necessary.) Uninstall textgenrnn: pip3 uninstall textgenrnn. 2) Install it using one of this commands: * pip3 install git+git://github.com/minimaxir/textgenrnn.git * pip3 install git+https://github.com/minimaxir/textgenrnn.git (Try the first one, but if it'll raise an error, try the second one.) That's discussion about this "multi_gpu_model not found" error: https://github.com/minimaxir/textgenrnn/issues/222.

  • lollms-webui

    Lord of Large Language Models Web User Interface

  • Project mention: Show HN: I made an app to use local AI as daily driver | news.ycombinator.com | 2024-02-27
  • gpt-2-simple

    Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

  • Project mention: Show HN: WhatsApp-Llama: A clone of yourself from your WhatsApp conversations | news.ycombinator.com | 2023-09-09

    Tap the contact's name in WhatsApp (I think it only works on a phone) and at the bottom of that screen there's Export Chat.

    For finetuning GPT-2 I think I used this thing on Google Colab. (My friend ran it on his GPU, it should be doable on most modern-ish GPUs.)

    https://github.com/minimaxir/gpt-2-simple

    I tried doing something with this a few months ago though and it was a bit of a hassle to get running (needed to use a specific python version for some dependencies...), I forget the details sorry!

  • DialoGPT

    Large-scale pretraining for dialogue

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • RL4LMs

    A modular RL library to fine-tune language models to human preferences

  • Project mention: How To Setup a Model With Guardrails? | /r/LocalLLaMA | 2023-05-12

    I think of guardrails as another dimension of human preferences: whether you are training a model to answer questions more gooder or avoid saying horrifying stuff, you are teaching the model a preference. So I thinks it's a straightforward RLHF problem but from a different perspective.

  • GODEL

    Large-scale pretrained models for goal-directed dialog

  • Project mention: Microsoft: Large-scale pretrained models for goal-directed dialog | news.ycombinator.com | 2023-06-05
  • Accelerated Text

    Accelerated Text is a no-code natural language generation platform. It will help you construct document plans which define how your data is converted to textual descriptions varying in wording and structure.

  • SqueezeLLM

    SqueezeLLM: Dense-and-Sparse Quantization

  • Project mention: Llama33B vs Falcon40B vs MPT30B | /r/LocalLLaMA | 2023-07-05

    Using the currently popular gptq the 3bit quantization hurts performance much more than 4bit, but there's also awq (https://github.com/mit-han-lab/llm-awq) and squishllm (https://github.com/SqueezeAILab/SqueezeLLM) which are able to manage 3bit without as much performance drop - I hope to see them used more commonly.

  • Cornucopia-LLaMA-Fin-Chinese

    聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

  • Project mention: Cornucopia-LLaMA-Fin-Chinese: NEW Textual - star count:263.0 | /r/algoprojects | 2023-07-31
  • commit-autosuggestions

    A tool that AI automatically recommends commit messages.

  • gpt-2-cloud-run

    Text-generation API via GPT-2 for Cloud Run

  • minimal-text-diffusion

    A minimal implementation of diffusion models for text generation

  • modular-diffusion

    Python library for designing and training your own Diffusion Models with PyTorch.

  • Project mention: I Built a Modular Python Library for Designing and Training Diffusion Models from Scratch | /r/SideProject | 2023-09-06

    Last week, I released a project I've been working on for months: Modular Diffusion. It's a modular Python library for designing and training your own Diffusion Models in just a few lines of code. I also wrote a documentation page. The project has already gotten some great community feedback and I'm hoping you guys like it too!

  • MAGIC

    Language Models Can See: Plugging Visual Controls in Text Generation (by yxuansu)

  • GoLLIE

    Guideline following Large Language Model for Information Extraction

  • Project mention: A LLM trained to follow annotation guidelines, for information extraction tasks | news.ycombinator.com | 2023-10-30
  • LongForm

    Reverse Instructions to generate instruction tuning data with corpus examples (by akoksal)

  • rant

    Rant - The templating language for procedural generation.

  • KVQuant

    KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

  • Project mention: 10M Tokens LLM Context | news.ycombinator.com | 2024-02-02
  • genius

    💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.

  • prompt-extend

    extending stable diffusion prompts with suitable style cues using text generation

  • Project mention: ArtBot for Stable Diffusion | news.ycombinator.com | 2023-12-17

    I've been playing around with DALL-E 3 a lot recently. One of the things they do is to expand a user's prompt in order to add a lot of detail. Via their API, you can see the expanded prompt (whereas, you can't through the ChatGPT interface).

    They obviously have the power of their LLM behind them and can generate some really interesting prompts. There is an open source implementation that the creator of Fooocus made which attempts to expand on prompts using some commonly used keywords[1] with some sort of basic context.

    e.g., I typed in "Brisket on a table" and got: "Brisket on a table, product photography, Michelin star, award winning photo, 8k, trending, HD. High quality image, highly detailed, stunning lighting, flawless render, masterpiece, still from the movie directed by Denis Villeneuve with art direction"

    You get a much better image with that prompt vs just the basic: "Brisket on a table"

    [1] https://huggingface.co/spaces/daspartho/prompt-extend

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

text-generation related posts

Index

What are some of the best open-source text-generation projects? This list will help you:

Project Stars
1 LocalAI 19,593
2 MOSS 11,808
3 GPT2-Chinese 7,342
4 textgenrnn 4,943
5 lollms-webui 3,762
6 gpt-2-simple 3,366
7 DialoGPT 2,315
8 RL4LMs 2,084
9 GODEL 834
10 Accelerated Text 789
11 SqueezeLLM 560
12 Cornucopia-LLaMA-Fin-Chinese 521
13 commit-autosuggestions 383
14 gpt-2-cloud-run 313
15 minimal-text-diffusion 258
16 modular-diffusion 253
17 MAGIC 245
18 GoLLIE 204
19 LongForm 196
20 rant 183
21 KVQuant 183
22 genius 175
23 prompt-extend 174

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com