GirlfriendGPT
ggml
GirlfriendGPT | ggml | |
---|---|---|
18 | 69 | |
2,545 | 9,725 | |
- | - | |
7.9 | 9.8 | |
about 1 month ago | 5 days ago | |
Python | C | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
GirlfriendGPT
-
The gatekeepers trying to silence uncensored AI
While Poe from Quora is extempted from stripe restricted businesses, startups like GirlfriendGPT get bullied! The functionality of both AI chat engines is the same.
-
Tutorials???
Geeesh I’m kinda new to this whole coding, making your own GPT type stuff lol and if you ask me so far so good I can definitely follow along and plug in codes and prompts just to make your basic chatbot lol. But! Now I’m trying to make the GirlfriendGpt & I’ve found the code/template on GitHub https://github.com/EniasCailliau/GirlfriendGPT only thing is I don’t know how to code it without watching someone on YouTube walk me through the steps. Is there anyone or anyway I can get this coded to create the AI girlfriend I want????
-
🤖💟 An open-source AI tool to create a virtual partner right from your description
BTW, here is the service:)))
-
Create your virtual partner with this open-source AI tool!
💻 Check GitHub to see the service. Would be glad to see your results! Maybe I’ll test it in the future, too:))
-
(2/2) May 2023
Girlfriend GPT is a Python project to build your own AI girlfriend using ChatGPT4.0 (https://github.com/EniasCailliau/GirlfriendGPT)
-
Thinking about creating an animated AI chatbot with Ryan Cohen where Apes pay $1 per minute to chat with their dad
Some guy has already made GirlfriendGPT, which allows you to define a personality, and it can also send you AI generated selfies. You could probably define an RC personality and host this things in the AWS free tier.
-
Artificial intelligence could lead to extinction, experts warn
Here you go: https://github.com/EniasCailliau/GirlfriendGPT
- [P] GirlfriendGPT - build your own AI girlfriend
- [D] GirlfriendGPT - Your personal AI companion
ggml
-
LLMs on your local Computer (Part 1)
git clone https://github.com/ggerganov/ggml cd ggml mkdir build cd build cmake .. make -j4 gpt-j ../examples/gpt-j/download-ggml-model.sh 6B
-
GGUF, the Long Way Around
Cool. I was just learning about GGUF by creating my own parser for it based on the spec https://github.com/ggerganov/ggml/blob/master/docs/gguf.md (for educational purposes)
-
Ask HN: People who switched from GPT to their own models. How was it?
If you don't care about the details of how those model servers work, then something that abstracts out the whole process like LM Studio or Ollama is all you need.
However, if you want to get into the weeds of how this actually works, I recommend you look up model quantization and some libraries like ggml[1] that actually do that for you.
[1] https://github.com/ggerganov/ggml
- GGUF File Format
-
Google just shipped libggml from llama-cpp into its Android AICore
Because the library is called ggml, but it supports gguf.
-
Q-Transformer
Apparently this guy like a bunch of others like https://github.com/ggerganov/ggml are implementing transformers from papers for people that want them. Pretty cool.
-
[P] Inference Vision Transformer (ViT) in plain C/C++ with ggml
You can access it here: https://github.com/staghado/vit.cpp It has been added to the ggml library on GitHub: https://github.com/ggerganov/ggml
-
Falcon 180B Released
https://github.com/ggerganov/ggml
One note is that prompt ingestion is extremely slow on CPU compared to GPU. So short prompts are fine (as tokens can be streamed once the prompt is ingested), but long prompts feel extremely sluggish.
-
Stable Diffusion in pure C/C++
I did a quick run under profiler and on my AVX2-laptop the slowest part (>50%) was matrix multiplication (sgemm).
In current version of GGML if OpenBLAS is enabled, they convert matrices to FP32 before running sgemm.
If OpenBLAS is disabled, on AVX2 plaftorm they convert FP16 to FP32 on every FMA operation, which even worse (due to repetition). After that, both ggml_vec_dot_f16 and ggml_vec_dot_f32 took first place in profiler.
Source: https://github.com/ggerganov/ggml/blob/master/src/ggml.c#L10...
-
Accessing Llama 2 from the command-line with the LLM-replicate plugin
For those getting started, the easiest one click installer I've used is Nomic.ai's gpt4all: https://gpt4all.io/
This runs with a simple GUI on Windows/Mac/Linux, leverages a fork of llama.cpp on the backend and supports GPU acceleration, and LLaMA, Falcon, MPT, and GPT-J models. It also has API/CLI bindings.
I just saw a slick new tool https://ollama.ai/ that will let you install a llama2-7b with a single `ollama run llama2` command that has a very simple 1-click installer for Apple Silicon Mac (but need to build from source for anything else atm). It looks like it only supports llamas OOTB but it also seems to use llama.cpp (via Go adapter) on the backend - it seemed to be CPU-only on my MBA, but I didn't poke too much and it's brand new, so we'll see.
For anyone on HN, they should probably be looking at https://github.com/ggerganov/llama.cpp and https://github.com/ggerganov/ggml directly. If you have a high-end Nvidia consumer card (3090/4090) I'd highly recommend looking into https://github.com/turboderp/exllama
For those generally confused, the r/LocalLLaMA wiki is a good place to start: https://www.reddit.com/r/LocalLLaMA/wiki/guide/
I've also been porting my own notes into a single location that tracks models, evals, and has guides focused on local models: https://llm-tracker.info/
What are some alternatives?
SillyTavern - LLM Frontend for Power Users.
llama.cpp - LLM inference in C/C++
tree-of-thoughts - Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
alpaca.cpp - Locally run an Instruction-Tuned Chat-Style LLM
gorilla - Gorilla: An API store for LLMs
alpaca-lora - Instruct-tune LLaMA on consumer hardware
gptqlora - GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
mlc-llm - Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
DB-GPT - AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
replika-research - Replika.ai Research Papers, Posters, Slides & Datasets
llm - An ecosystem of Rust libraries for working with large language models