Simple LLM Watermarking - Open Lllama 3b LORA

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

alpaca-lora

107 18,197 3.6 Jupyter Notebook

Instruct-tune LLaMA on consumer hardware

There are a few papers on watermarking LLM output, but from what I have seen they all use complex methods of detection to allow the watermark to go unseen by the end user, only to be detected by algorithm. I believe that a more overt system of watermarking might also be beneficial. One simple method that I have tried is character substitution. For this model, I LORA finetuned openlm-research/open_llama_3b on the alpaca_data_cleaned_archive.json dataset from https://github.com/tloen/alpaca-lora/ modified by replacing all instances of the "." character in the outputs with a "ι" The results are pretty good, with the correct the correct substitutions being generated by the model in most cases. It doesn't always work, but this was only a LORA training and for two epochs of 400 steps each, and 100% substitution isn't really required.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

2024 Verizon Data Breach Investigation Report [pdf]

1 project | news.ycombinator.com | 1 May 2024
Impact of Input Length on the Reasoning Performance of Large Language Models

1 project | news.ycombinator.com | 1 May 2024
Kolmogorov-Arnold Networks

4 projects | news.ycombinator.com | 30 Apr 2024
Quick tip: Write numpy arrays directly to the SingleStore VECTOR data type

1 project | dev.to | 1 May 2024
Navigating the Risky Waters of Loan Defaults: A Predictive Beacon

1 project | dev.to | 30 Apr 2024

Simple LLM Watermarking - Open Lllama 3b LORA

This page summarizes the projects mentioned and recommended in the original post on /r/u_Unstable_Llama Post date: 11 Jun 2023

alpaca-lora

InfluxDB

Related posts

2024 Verizon Data Breach Investigation Report [pdf]

Impact of Input Length on the Reasoning Performance of Large Language Models

Kolmogorov-Arnold Networks

Quick tip: Write numpy arrays directly to the SingleStore VECTOR data type

Navigating the Risky Waters of Loan Defaults: A Predictive Beacon