rubygpt
GPTCache
| rubygpt | GPTCache | |
|---|---|---|
| 1 | 45 | |
| 8 | 8,066 | |
| - | 0.7% | |
| 3.2 | 2.4 | |
| about 3 years ago | 11 months ago | |
| Ruby | Python | |
| - | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
rubygpt
-
Google Releases Bard, Its Competitor in the Race to Create A.I. Chatbots
I've got an IRC channel full of ChatGPT bots that answer questions from different perspectives (using [1] if anyone wants something similar). This comment made me realize it'd probably be extremely interesting to also let them debate between themselves to come to a singular answer -- and it'd also be interesting to see which of the popular language models most often "yield" to each other in bot-vs-bot battles (rap or otherwise). Probably a lot of exciting meta-analytics on models to be done here.
[1] https://github.com/drusepth/rubygpt/blob/master/council.rb
GPTCache
-
What's semantic caching?
GPTCache — an open source library built specifically for caching LLM responses. Pretty flexible and worth looking at if you're rolling your own pipeline.
-
Design Recipe: Observability Pyramid for LLM Infrastructure
Production Sampling: In highload systems, evaluating 100% of traffic via GPT-4o is financial suicide. Use sampling (1–5%). Additionally: implement judge caching (GPT cache, LangChain cache, or vLLM prefix caching). This is especially effective when users ask similar questions — the same prompt+context can be evaluated multiple times, but you pay only once.
-
Ask HN: What are the drawbacks of caching LLM responses?
Just found this: https://github.com/zilliztech/GPTCache which seems to address this idea/issue.
-
Open Source Advent Fun Wraps Up!
21. GPTCache | Github | tutorial
- Semantic Cache
-
Show HN: Danswer – open-source question answering across all your docs
Check this out. Built on a vector database (https://github.com/milvus-io/milvus) and a semantic cache (https://github.com/zilliztech/GPTCache)
https://osschat.io/
- GPTCache
-
Ask HN: Is LLM Caching Necessary?
With the proliferation of large models, an increasing number of enterprises and individual developers are now developing applications based on these models. As such, it is worth considering whether large model caching is necessary during the development process.
Our project: https://github.com/zilliztech/GPTCache
-
Gorilla-CLI: LLMs for CLI including K8s/AWS/GCP/Azure/sed and 1500 APIs
Maybe [GPTCache](https://github.com/zilliztech/GPTCache) can make it more attractive, because similar problems can be less expensive, and can also be responded to faster. Of course, the specific configuration needs to be based on real usage scenarios.
What are some alternatives?
awesome-chatgpt - 🧠 A curated list of awesome ChatGPT resources, including libraries, SDKs, APIs, and more. 🌟 Please consider supporting this project by giving it a star.
DB-GPT - open-source agentic AI data assistant for the next generation of AI + Data products.
ChatFiles - Document Chatbot — multiple files. Powered by GPT / Embedding.
geetanjali - RAG-powered ethical decision guidance from Bhagavad Geeta. Analyze dilemmas, get structured recommendations with verse citations.
ChatGPT - OpenAI API Free Reverse Proxy
shell_gpt - A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.