VardaGPT
AgileRL
VardaGPT | AgileRL | |
---|---|---|
7 | 12 | |
339 | 503 | |
- | 4.6% | |
5.8 | 9.8 | |
11 months ago | 4 days ago | |
Python | Python | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
VardaGPT
-
(1/2) May 2023
VardaGPT is a memory-enhanced GPT-2 model powered by Hugging Face Transformers and FAISS (https://github.com/ixaxaar/VardaGPT)
- Using ChatGPT to Generate a end-to-end Software Project
- Using ChatGPT to generate a GPT project end-to-end
AgileRL
- [P] Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
- Introducing PPO and Rainbow DQN to our super fast evolutionary HPO reinforcement learning framework
-
[P] Significant improvements for multi-agent reinforcement learning!
Please check it out! https://github.com/AgileRL/AgileRL
- 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
- [P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
-
(1/2) May 2023
Deep Reinforcement Learning library focused on improving development by introducing RLOps - MLOps for reinforcement learning (https://github.com/AgileRL/AgileRL)
-
[P] 10x faster reinforcement learning HPO - now for RLHF!
https://github.com/AgileRL/AgileRL/blob/main/CONTRIBUTING.md Has a link to our discord too
- 10x faster reinforcement learning HPO - now with CNNs!
- [P] 10x faster reinforcement learning HPO - now with CNNs!
-
[P] Reinforcement learning evolutionary hyperparameter optimization - 10x speed up
GitHub: https://github.com/AgileRL/AgileRL
What are some alternatives?
promptfoo - Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.
chat-ui - Open source codebase powering the HuggingChat app
RasaGPT - 💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram
RLeXplore - RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven exploration (RIDE).
whisper-writer - 💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
loopquest - A Production Tool for Embodied AI
WizardLM - Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder and WizardMath
de-torch - Minimal PyTorch Library for Differential Evolution
My-Medium-Articles-Friendly-Links - Friendly link to all of my medium articles
Muzero - Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
easy-chat - A ChatGPT UI for young readers, written by ChatGPT
q-learning-algorithms - This repository will aim to provide implementations of q-learning algorithms (DQN, Double-DQN, ...) using Pytorch.