Fine Tuning Language Models

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

llm-foundry

37 3,730 9.7 Python

LLM training code for Databricks foundation models

Most AI runners just ignore licensing and run LLaMA finetunes.
But if you want to avoid the non commercial LLaMA license, you have 3 good options for a base model.
- OpenLlama 13B
- MPT 30B
- Falcon 40B
Of these, Falcon 40B is very difficult to run (slow in 4 bit, basically requires a professional GPU, no good cpu offloading yet).
OpenLLaMA 13B only supports a context size of 2048 as of today... But that could change soon.
So you probably want MPT instruct 30B, specifically this one:
https://huggingface.co/TheBloke/mpt-30B-instruct-GGML
As the page says, you can try it out on a decent PC of your own with the OpenCL build of KoboldCPP. Change it to "instruct" mode, use the template on the page, offload as many layers as you can to your PC's dGPU, and run it in instruct mode. It may already work for your summarization needs.
If not, you can finetune it with MPT's code and summarization d
https://github.com/mosaicml/llm-foundry
Or train OpenLLaMA 13B with SuperHOT + summarization data using QLORA.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

GPT-4o's Memory Breakthrough (Needle in a Needlestack)

2 projects | news.ycombinator.com | 14 May 2024
BLint: Check the security properties, and capabilities in your executables

1 project | news.ycombinator.com | 14 May 2024
Casino Terminal Game

2 projects | dev.to | 14 May 2024
Project-Gameface

1 project | news.ycombinator.com | 14 May 2024
Glance: A self-hosted dashboard that puts all your feeds in one place

2 projects | news.ycombinator.com | 14 May 2024

Fine Tuning Language Models

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Post date: 3 Jul 2023

llm-foundry

InfluxDB

Related posts

GPT-4o's Memory Breakthrough (Needle in a Needlestack)

BLint: Check the security properties, and capabilities in your executables

Casino Terminal Game

Project-Gameface

Glance: A self-hosted dashboard that puts all your feeds in one place