ollama
openrouter-runner
| ollama | openrouter-runner | |
|---|---|---|
| 750 | 58 | |
| 173,924 | 1,140 | |
| 2.0% | - | |
| 9.9 | 5.2 | |
| about 13 hours ago | 9 months ago | |
| Go | Python | |
| MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ollama
-
Set Up Your Own ChatGPT: Ollama + Open WebUI for Data That Never
Download: Go to https://ollama.com/ and click on the download link for your operating system.
-
I Built a Free, Fully Local AI Resume Builder — No Subscriptions, No Cloud, No Catch
Most AI resume tools call out to OpenAI or Anthropic and charge you for every request. Persona supports Ollama — which means you can run the AI model locally on your own hardware, with zero API costs and zero data leaving your machine.
-
Sovereign Synapse: The Local Brain
To solve these, we built a stack that prioritizes integrity over ease. The centerpiece is Ollama, running the mxbai-embed-large model locally. This is the engine that translates human thought into high-dimensional coordinates.
-
How I Built a Self-Funding AI Lab: From Hobby to Side Income in 6 Months
Ollama for model serving
-
Flat Chat Threads Suck for Reading Books. So I Built a Local-First AI Tree Companion.
Fully offline: Point it at Ollama or LM Studio. Zero cost, nothing leaves your network.
-
Local LLM Hardware Requirements in 2026: What You Actually Need for Every Model Tier [Guide]
Recommended hardware: The RTX 3060 with 12 GB VRAM is the budget king here — all these models fit with room to spare for KV cache overhead, even Gemma 4:12B (which needs ~8.5–9 GB with overhead). An RTX 4060 Ti 16 GB gives you more headroom. On the Apple side, any M2 or M3 MacBook with 16 GB unified memory handles these models comfortably via Ollama's Metal backend.
-
Run Coding Agents on Local AI — Zero Cloud, Full Control
This guide shows how to swap out every cloud API with a local Ollama server running qwen3-coder:30b. Same tools, same workflows, no data leaving your network.
-
Running Brand-New Gemma 4 12B on an 8-Year-Old GTX 1080 Ti: Speed, 3 Gotchas, and Why Q8 Beat Q4 on My Own Field
Related: 35B MoE on 2× 1080 Ti · Ollama
-
Agent Skills in Microsoft Agent Framework
The sample is a tiny console app running entirely against a local Ollama model — no cloud keys, and every HTTP call is traced so I can see exactly what goes over the wire (complete sample code). There's a single skill on disk:
-
Quick and easy local AI RAG setup with JetBrains IDE integration and browser UI
irm https://ollama.com/install.ps1 | iex
openrouter-runner
-
HOW TO CREATE YOUR OWN SEARCH AGENT USING LANGCHAIN
from langchain_openai import ChatOpenAI from langchain_community.tools import DuckDuckGoSearchRun from langchain.tools import Tool from langchain.memory import ConversationBufferMemory from langchain.agents.agent_types import AgentType from langchain.agents import initialize_agent from dotenv import load_dotenv import os load_dotenv() def webSearchAgent(question): llm = ChatOpenAI( model = "deepseek/deepseek-chat-v3-0324:free", base_url = "https://openrouter.ai/api/v1", api_key = os.getenv("OPENROUTER_API_KEY"), ) search = DuckDuckGoSearchRun() tool = [ Tool ( name = "search", func = search.run, description = "When you want real time data use this", ) ] memory = ConversationBufferMemory(memory_key = "chat_history") agent = initialize_agent( llm = llm, tools = tool, agent = AgentType.CONVERSATIONAL_REACT_DESCRIPTION, memory = memory, verbose = True, handle_parsing_errors = True ) result = agent.run(question) print(f"Agent:\n\n {result}") webSearchAgent("Give me some latest news about Ai")
-
Cracking the Opus: Red Teaming Anthropic’s Giant with Promptfoo
OpenRouter API Key → Create an account at OpenRouter and grab your key
-
How I built a self-hosted AI automation stack without losing my mind
OpenRouter (multi-model API wrapper) https://openrouter.ai
- Reproducible LLM Benchmarking: GPT-5 vs Grok-4 with Promptfoo
-
Building an AI Development Environment with Claude Code Claude Router Open Router
{ "Providers": [ { "name": "openrouter", "api_base_url": "https://openrouter.ai/api/v1/chat/completions", "api_key": "sk-xxx", "models": [ "anthropic/claude-sonnet-4" ], "transformer": { "use": ["openrouter"] } } ], "Router": { "default": "openrouter,anthropic/claude-sonnet-4" }, "LOG": true, "HOST": "127.0.0.1", + "PORT": 23456 }
-
5 tools we wish were on the Awesome AI Tools list
OpenRouter - We love OpenRouter because it allows you to easily try out new models and load balance between models. We actually got an open source contribution for this one recently, so we should be supporting it ❤️
-
The Untold Misadventures of Red Teaming Kimi K2 with Promptfoo
Link: Promptfoo Open Source Tool for Evaluation and Red Teaming Link: OpenRouterFor Moonshot Kimi K2 APIs Link: Kimi K2 Model Model Page
-
How OpenRouter Unlocked Our Workshop Strategy
Then I discovered OpenRouter: "a unified API platform that provides access to a wide range of large language models with intelligent routing and automatic fallbacks." You can use whatever model you want with the same API key. But the feature I really needed was its provisionary API key system, which allowed me to generate one master key and programmatically:
-
Anthropic teams use Claude Code
What’s different is all the open weight models like Kimi-k2 or Qwen-3 Coder that are as good and, depending on the task, better than Anthropic’s Sonnet model for 80% less via openrouter [1] and other similar services.
You can use these models through Claude Code; I do it everyday.
Some developers are running smaller versions of these LLMs on their own hardware, paying no one.
So I don’t think Anthropic and the other companies can dramatically increase their prices without losing the customers that helped them go from $0 to $4 billion in revenue in 3 years.
Users can easily move between different AI platforms with no lock-in, which makes it harder to increase prices and proceed to enshitify their platforms.
[1]: https://openrouter.ai/
- Show HN: I built an LLM chat app because we shouldn't need 10 AI subscriptions
What are some alternatives?
koboldcpp - Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
typingmind - The most advanced Web UI for AI chat
SillyTavern - LLM Frontend for Power Users.
plandex - Open source AI coding agent. Designed for large projects and real world tasks.
textgen - Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
zed - Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.