memprompt vs unilm

memprompt

A method to fix GPT-3 after deployment with user feedback, without re-training. (by madaan)

Source Code

Suggest alternative

Edit details

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities (by microsoft)

Source Code

aka.ms

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

memprompt		unilm
	Project
4	Mentions	40
320	Stars	18,319
-	Growth	5.9%
1.7	Activity	9.0
about 1 year ago	Latest Commit	5 days ago
Python	Language	Python
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

memprompt

Posts with mentions or reviews of memprompt. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-12-03.

Allen Institute for Artificial Intelligence Introduces MemPrompt: A New Method to “fix” GPT-3 After Deployment with User Interaction
1 project | /r/machinelearningnews | 18 Dec 2022

Quick Read: https://www.marktechpost.com/2022/12/18/allen-institute-for-artificial-intelligence-introduces-memprompt-a-new-method-to-fix-gpt-3-after-deployment-with-user-interaction/ Paper: https://arxiv.org/abs/2201.06009 Code: https://github.com/madaan/memprompt
Building a Virtual Machine Inside ChatGPT
5 projects | news.ycombinator.com | 3 Dec 2022

It's already possible to get some of this effect with codex. The trick is to keep appending the interaction in the prompt (to maintain a memory of sorts).
For examples, you can replicate all the prompts here: https://twitter.com/yoavgo/status/1599200756631887872 with prompt + memory.
The notebook at https://github.com/madaan/memprompt/blob/main/YoavsPythonPro... shows a demo of this.
Some of these ideas were earlier discussed in our work on memory-assisted prompting [1].
[1] https://arxiv.org/pdf/2201.06009.pdf.
[D] Paper Review Video - Memory-assisted prompt editing to improve GPT-3 after deployment
2 projects | /r/MachineLearning | 28 Mar 2022

Code for https://arxiv.org/abs/2201.06009 found: https://github.com/madaan/memprompt

unilm

Posts with mentions or reviews of unilm. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-28.

The Era of 1-Bit LLMs: Training_Tips, Code And_FAQ [pdf]
1 project | news.ycombinator.com | 21 Mar 2024
The Era of 1-Bit LLMs: Training Tips, Code and FAQ
1 project | news.ycombinator.com | 20 Mar 2024
The Era of 1-bit LLMs: ternary parameters for cost-effective computing
6 projects | news.ycombinator.com | 28 Feb 2024

+1 On this, the real proof would have been testing both models side-by-side.
It seems that it may be published on GitHub [1] according to HuggingFace [2].
[1] https://github.com/microsoft/unilm/tree/master/bitnet
[2] https://huggingface.co/papers/2402.17764
I'm an Old Fart and AI Makes Me Sad
2 projects | news.ycombinator.com | 16 Feb 2024
On building a semantic search engine
3 projects | news.ycombinator.com | 6 Jan 2024

e5-mistral is essentially a distillation from gpt-4 to a smaller model. You can see here https://github.com/microsoft/unilm/blob/16da2f193b9c1dab0a69...
they actually have custom prompts for each dataset being tested.
Question would be, if you haven't seen the task before, what is a good prompt to prepend for your task?
IMO e5-mistral is overfit to MTEB
Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide
5 projects | dev.to | 27 Dec 2023

Layout LM v1, v2 and v3 models [ Github ] DocBERT [ Github ]
Microsoft Publishes LongNet: Scaling Transformers to 1,000,000,000 Tokens
1 project | /r/ArtificialInteligence | 8 Jul 2023

The repository is available here.
Recommended open LLMs with image input modality?
3 projects | /r/LocalLLaMA | 8 Jul 2023

It is missing kosmos-2. I remember its image captioning was(demo currently down) really good and it's almost as fast as llava and lavin.
LongNet: Scaling Transformers to 1,000,000,000 Tokens
3 projects | /r/LocalLLaMA | 6 Jul 2023

Should be this: https://github.com/microsoft/unilm/
[R] LongNet: Scaling Transformers to 1,000,000,000 Tokens
1 project | /r/MachineLearning | 5 Jul 2023

This is from Microsoft Research (Asia). https://aka.ms/GeneralAI

What are some alternatives?

When comparing memprompt and unilm you can also consider the following projects:

gpt-scrolls - A collaborative collection of open-source safe GPT-3 prompts that work well

transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

memprompt vs gpt-scrolls unilm vs transformers

Compare memprompt vs unilm and see what are their differences.

memprompt

unilm

memprompt

unilm

What are some alternatives?