vlite
gpt-2
vlite | gpt-2 | |
---|---|---|
7 | 64 | |
710 | 21,259 | |
- | 1.6% | |
9.3 | 2.5 | |
18 days ago | about 1 month ago | |
Python | Python | |
GNU Affero General Public License v3.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
vlite
- Fastest vector database made in NumPy
- FLaNK Stack Weekly for 27 November 2023
- Vlite – simple vector database written in less than 200 lines of code
-
Build Personal ChatGPT Using Your Data
I am working on a simple vector db just with numpy: https://github.com/sdan/vlite
I think milvus, quickwit, and pinecone are geared more towards enterprise and are hard to use.
- Vlite: Simple Open-source project for vector embeddings
- Vlite: Simple Vector Database with NumPy
- VLite: Fast vector db written in NumPy
gpt-2
-
What are LLMs? An intro into AI, models, tokens, parameters, weights, quantization and more
Medium models: Roughly between 1B to 10B parameters. This is where Mistral 7B, Phi-3, Gemma from Google DeepMind, and wizardlm2 sit. Fun fact: GPT 2 was a medium sized model, much smaller than its latest versions.
- Sam Altman is still trying to return as OpenAI CEO
- Build Personal ChatGPT Using Your Data
-
Are the recent advancements in AI technology primarily driven by recent discoveries or the progress in hardware capabilities and the abundance of available data?
"Our model, called GPT-2 (a successor to GPT), was trained simply to predict the next word in 40GB of Internet text. Due to our concerns about malicious applications of the technology, we are not releasing the trained model. As an experiment in responsible disclosure, we are instead releasing a much smaller model for researchers to experiment with, as well as a technical paper. "
-
BING IS NOW THE DEFAULT SEARCH FOR CHATGPT
They did release GPT-2 under the MIT License.
-
Don Knuth Plays with ChatGPT
Did you arrive at this certainty through reading something other than what OpenAI has published? The document [0] that describes the training data for GPT-2 makes this assertion hilarious to me.
[0]: https://github.com/openai/gpt-2/blob/master/model_card.md#da...
- Was frustriert euch an der Nutzung oder der Diskussion um KI?
- The AI
-
Help with pet project to learn - Running ChatGPT-2 at home
I made a clone of https://github.com/openai/gpt-2 on my local laptop
- По поводу опасности ИИ и предложений остановить разработки на 6 месяцев.
What are some alternatives?
instructor-embedding - [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
dalle-mini - DALL·E Mini - Generate images from a text prompt
PdfGptIndexer - RAG based tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for rapid information retrieval and superior search accuracy.
minGPT - A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
OpenLLM - Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.
Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time
openai-cookbook - Examples and guides for using the OpenAI API
gpt-neo - An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
FLaNK-ContinuousSQL
sentencepiece - Unsupervised text tokenizer for Neural Network-based text generation.
private-gpt - Interact with your documents using the power of GPT, 100% privately, no data leaks
jukebox - Code for the paper "Jukebox: A Generative Model for Music"