awesome-ml
AGiXT
awesome-ml | AGiXT | |
---|---|---|
27 | 26 | |
1,422 | 2,456 | |
- | - | |
8.8 | 9.9 | |
14 days ago | 3 days ago | |
Python | ||
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-ml
-
AI Infrastructure Landscape
I do something like that for open source:
https://github.com/underlines/awesome-ml
But it lost a bit of traction lately.
It needs re-work for the categories, or better, a tagging system, because these products and libraries can sit in more than one space.
Plus it either needs massive collaboration, or some form of automation (with an LLM and indexer), as I can't keep up with it.
-
OpenVoice: Versatile Instant Voice Cloning
This aera is barely new. Look at how old some of the projects are:
https://github.com/underlines/awesome-ml/blob/master/audio-a...
The thing that changes is the complexity to run it. I was training my wife's voice and my voice for fun and needed 15min of audio and trained on my 3080 for 40 minutes.
Now it's 2 Minutes.
-
Show HN: Floneum, a graph editor for local AI workflows
Thanks for your clarifications. I added it to my awesome list:
https://github.com/underlines/awesome-marketing-datascience/...
-
AI for AWS Documentation
RAG is very difficult to do right. I am experimenting with various RAG projects from [1]. The main problems are:
- Chunking can interfer with context boundaries
- Content vectors can differ vastly from question vectors, for this you have to use hypothetical embeddings (they generate artificial questions and store them)
- Instead of saving just one embedding per text-chuck you should store various (text chunk, hypothetical embedding questions, meta data)
- RAG will miserably fail with requests like "summarize the whole document"
- to my knowledge, openAI embeddings aren't performing well, use a embedding that is optimized for question answering or information retrieval and supports multi language. Also look into instructor embeddings: https://github.com/embeddings-benchmark/mteb
1 https://github.com/underlines/awesome-marketing-datascience/...
-
Explore and compare the parameters of top-performing LLMs
I do the same and with currently with 700+ github stars people seem to like it, but it's still curated/manual, because the hf search API is so limited and I don't have the time to create a scraper.
-
Vicuna v1.3 13B and 7B released, trained with twice the amount of ShareGPT data
Added to the list
-
Useful Links and Info
I keep mine fairly up to date as well, almost daily: https://github.com/underlines/awesome-marketing-datascience/blob/master/README.md
- How to keep track of all the LLMs out there?
-
Run and create custom ChatGPT-like bots with OpenChat
Disclaimer: I am curating LLM-tools on github [1]
A few thoughts:
* allow for custom endpoint URLs, this way people can use open source LLMs with a fake openAI API backend like basaran[2] or llama-api-server[3]
* look into better embedding methods for info-retrieval like InstructorEmbeddings or Document Summary Index
* Don't use a single embedding per content item, use multiple to increase retrieval quality
1 https://github.com/underlines/awesome-marketing-datascience/...
2 https://github.com/hyperonym/basaran
3 https://github.com/iaalm/llama-api-server
-
Seeking clarification about LLM's, Tools, etc.. for developers.
Oobabooga isn't a wrapper for llama.cpp, but it can act as such. A usual Oobabooga installation on windows will use a GPTQ wheel (binary) compiled for cuda/windows, or alternatively use llama.cpp's API and act as a GUI. On Linux you had the choice to use the triton or cuda branch for GPTQ, but I don't know if that is still the case. You can also go the route to use virtualized and hardware accelerated WSL2 Ubuntu on Windows and use anything similar to linux. See my guide
AGiXT
-
Conversational "memory loss"?
If you are more interested in AI assistants check out AGiXT. It has some really cool features but it is under heavy development. Not everything works jet and updates break sometimes already working functions. But it is still far better than babyAGI and other proof of concepts.
-
Microsoft Research proposes new framework, LongMem, allowing for unlimited context length along with reduced GPU memory usage and faster inference speed. Code will be open-sourced
That's exactly my goal right now too! I have been trying to figure out how to use AGiXT agents to read and write to an "Adventurer's Log" text file to try to mimic a long term memory but honestly I'm not good enough with any of this to get it working yet. The idea I've got rn is that there'd be a DM agent which takes your input and then there'd be "memory" agents which would check text files such as "Adventurer's Log" and "Character Interactions/Relationships" to keep a contiguous understanding of what each character has done, who they've met, what they've been told/haven't been told by certain characters about their motivations. I'm sure there's someone *much* more talented than me working on this already, at this point I've sort of given up on the idea and I'm just waiting for someone to come out with a Tavern style interface where I can paste in world details and character details and just get going!
- AGiXT: A local automation platform with memories and SmartGPT-like prompting. Works with Ooba/LCPP/GPT4All, and more
-
What are the best AI tools you've ACTUALLY used?
AGiXT: A Python package for AGI research.
-
?Best LLM service for a tiny home server
Even if my (for example, privateGPT) LLM is glacially slow I'd still love to be able to say "Mr Holmes, have Mrs Doubtfire verb the data object in order to verb a product for me, please." (eg: analyse the wikipedia article on the peace of westfalia in order to ELI5 a short summary of it). Hopefully she'd crunch away at the data, and at my convenience, I could have her brief me on her conclusions. I'm sure folks here would do something more clever using AGiXT, or having the old girl prepare lesson-plans for Mycroft to deliver (I just think that sort of thing is world-changing-bonkers for anyone wanting to learn anything, perhaps for kids one day), but I'd have to work up to that.
-
LlamaCPP and LangChain Agent Quality
Keep an eye on this project as well. https://github.com/Josh-XT/AGiXT
- Using the right prompt format makes responses so much better
-
How big of a jump is 13B Vicuna Uncensored vs 30B Vicuna Uncensored?
File upload and automatic agents. It exists it is just buggy. They are working at an insane pace building it. It is practically broke 90% of the time. Maybe it's working better right now. I had success with v1.1.31 as well. https://github.com/Josh-xt/AGiXT
-
Langchain, Langchain.js, vs AutoGPT for local agent development
Maybe you want to check out josh-xt/AGiXT it has its roots in langchain so you can see what the prompts look like and the code. They have made a lot of tools as well although you are going to have issues getting it to work. The newest version kinda works and version 1.1.31 I had the fast API backend working. Maybe you can help them out. They need more people to show them bugs. https://github.com/Josh-XT/AGiXT
-
Is there an alternative to AgentGPT that I can run on my CPU with 32 GB of RAM?
https://github.com/Josh-XT/AGiXT I have tested this one and it is pretty much the same as AgentGPT, supports many providers + many local models (you can even make it work with oobabooga api which is pretty easy), don’t wait for insane results, the problem right now is context length with the local models, probably going to be an old issue in a few weeks we hope ;)
What are some alternatives?
anything-llm - The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.
AgentOoba - An autonomous AI agent extension for Oobabooga's web ui
OpenChat - LLMs custom-chatbots console ⚡
private-gpt - Interact with your documents using the power of GPT, 100% privately, no data leaks
llama-mps - Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2
AgentGPT - 🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
mnotify - A matrix cli client
babyagi
mteb - MTEB: Massive Text Embedding Benchmark
vault-ai - OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.
simonwillisonblog - The source code behind my blog
guidance - A guidance language for controlling large language models. [Moved to: https://github.com/guidance-ai/guidance]