SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python ollama Projects
-
Project mention: Quick and easy local AI RAG setup with JetBrains IDE integration and browser UI | dev.to | 2026-06-02
To get a local web UI (that is very similar to CharGPT) that supports Retrieval Augmented Generation (RAG), workflows and many other features, we'll use Open WebUI (https://github.com/open-webui/open-webui). Although it can be setup using locally installed Python, I've decided to try out their Docker image instead. Since I have an Nvidia card, I've used their Nvidia GPU supported docker image.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
ragflow
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Project mention: I Scanned 5 Popular Open-Source AI Projects for EU AI Act Compliance. Here's What I Found. | dev.to | 2026-03-31I ran AIR Blackbox (the scanner itself), Browser Use (79K+ stars), RAGFlow (76K+ stars), LiteLLM (23K+ stars), and Superlinked (15K+ stars) through the same compliance checks.
-
kirara-ai
🤖 可 DIY 的 多模态 AI 聊天机器人 | 🚀 快速接入 微信、 QQ、Telegram、等聊天平台 | 🦈支持DeepSeek、Grok、Claude、Ollama、Gemini、OpenAI | 工作流系统、网页搜索、AI画图、人设调教、虚拟女仆、语音对话 |
-
deepwiki-open
Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme
Project mention: Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab Repos | news.ycombinator.com | 2025-10-01 -
Project mention: Preview your coding agents' work without painful CIs or servers; just paste a prompt | dev.to | 2026-05-05
Trying to preview a WIP branch of LangBot :
-
ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
-
shell_gpt
A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.
Project mention: Supercharge Your Terminal: ShellGPT + ChromaDB + LangChain for Context-Aware Automation | dev.to | 2025-09-01🗃 To explore ShellGPT in depth, including installation instructions, usage examples, and advanced configuration options, head over to the official ShellGPT GitHub repository.
-
LEANN
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
Project mention: First lightweight local semantic search MCP for Claude Code | news.ycombinator.com | 2025-08-15@Berkeley SkyLab, we’re the first to bring semantic search to Claude Code with a fully local index in a novel, lightweight structure — check it out at LEANN(https://github.com/yichuan-w/LEANN).
-
Code Examples
-
local-deep-research
~95% on SimpleQA (e.g. Qwen3.6-27B on a 3090). Supports all local and cloud LLMs (llama.cpp, Ollama, Google, ...). 10+ search engines - arXiv, PubMed, your private documents. Everything Local & Encrypted.
Project mention: Local Deep Research: Run Your Own AI Research Assistant, Fully Private | dev.to | 2026-05-06Local Deep Research is a self-hosted AI research assistant. You give it a question. It searches across multiple sources — web, arXiv, PubMed, Wikipedia, GitHub, your own local documents — iterates on what it finds, and produces a structured report with citations.
-
Open-LLM-VTuber
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
# 1. Install uv (Astral's Python package manager — Open-LLM-VTuber requires it) curl -LsSf https://astral.sh/uv/install.sh | sh # 2. Clone git clone https://github.com/Open-LLM-VTuber/Open-LLM-VTuber.git cd Open-LLM-VTuber # 3. Install Python deps (uv resolves everything via pyproject.toml) uv sync # 4. Install ffmpeg (TTS post-processing + audio resampling) brew install ffmpeg # macOS # sudo apt install ffmpeg # Debian/Ubuntu # 5. Pull an LLM ollama pull qwen2.5:7b # 6. Run uv run run_server.py
-
Project mention: Your Agent Doesn't Need That 10,000-Token API Response: Context Offloading with Strands | dev.to | 2026-06-09
Strands Agents just shipped the ContextOffloader plugin. It's available in both the TypeScript and Python SDKs. It prevents large tool results from consuming your agent's context window automatically. When a tool returns a result that exceeds a configurable token threshold, the plugin stores each content block individually in an external storage backend and replaces it in the conversation with a truncated preview plus per-block references. Each offloaded result includes inline guidance telling the agent to use its available tools to selectively access the data it needs.
-
agentops
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including CrewAI, Agno, OpenAI Agents SDK, Langchain, Autogen, AG2, and CamelAI
Project mention: Tune self-correct SQL agent with RL: AgentLightning+verl+vLLM+AgentOps+LangGraph | news.ycombinator.com | 2025-08-11- AgentOps for collecting training data (telemetry): https://github.com/AgentOps-AI/agentops
-
Kiln
Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.
Project mention: Kiln – a free app and open-source library to build better AI products | news.ycombinator.com | 2026-05-12 -
-
whichllm
Find the local LLM that actually runs and performs best on your hardware. Ranked by real, recency-aware benchmarks, not parameter count. One command, run it instantly.
Source: https://github.com/Andyyyy64/whichllm
-
cascadeflow
Cascading runtime for AI agents. Optimize cost, latency, quality, and policy decisions inside the agent loop.
Project mention: Why One Model Is Never Enough: Routing Incident Analysis With cascadeflow | dev.to | 2026-05-22This is the story of how I built IncidentOS, an AI-powered operational memory system for SRE teams, and why cascadeflow became the piece that made the runtime actually usable.
-
elia
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
-
-
comfyui_LLM_party
LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, kimi,doubao. Adapted to local llms, vlm, gguf such as llama-3.3 Janus-Pro, Linkage graphRAG
-
openai-edge-tts
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
Project mention: Open source TTS by Resemble (claiming they are sota) | news.ycombinator.com | 2025-06-11It can definitely run on CPU — but I'm not sure if it can run on a machine without a GPU _entirely_.
To be honest, it uses a decently large amount of resources. If you had a GPU, you could expect about 4-5 gb memory usage. And given the optimizations for tensors on GPUs, I'm not sure how well thinks would work "CPU only".
If you try it, let me know. There are some "CPU" Docker builds in the repo you could look at for guidance.
If you want free TTS without using local resources, you could try edge-tts https://github.com/travisvn/openai-edge-tts
-
AudioMuse-AI
AudioMuse-AI uses sonic analysis to rediscover forgotten songs, uncover hidden connections in your music library, and generate intelligent playlists for Jellyfin, Navidrome, LMS, Lyrion, and Emby: no metadata or external services required.
Project mention: AudioMuse-AI: Local Sonic Analysis for Auto-Playlists on Jellyfin and Navidrome | news.ycombinator.com | 2025-12-14 -
Python ollama discussion
Python ollama related posts
-
OpenBrief Review: Local-First Video AI Summarizer 2026
-
Open WebUI: Jupyter code execution works despite ENABLE_CODE_EXECUTION=false
-
Thoth v3.20.0 – Full Linux Support, MiniMax Integration, Reliability Upgrades
-
Thoth – open-source Local-first AI Assistant
-
Stop paying for AI transcription! 🎙️ WritHer: 100% Local Voice Assistant for Windows. Privacy-first, Whisper + Ollama powered. Open Source on GitHub!
-
Open WebUI adds desktop app with task scheduling
-
Edster – An open-source local AI agent with swarm mode and a web UI
-
A note from our sponsor - SaaSHub
www.saashub.com | 9 Jun 2026
Index
What are some of the best open-source ollama projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | open-webui | 139,852 |
| 2 | ragflow | 81,919 |
| 3 | kirara-ai | 18,785 |
| 4 | deepwiki-open | 16,731 |
| 5 | LangBot | 16,193 |
| 6 | ChuanhuChatGPT | 15,318 |
| 7 | shell_gpt | 12,110 |
| 8 | LEANN | 11,827 |
| 9 | ollama-python | 10,118 |
| 10 | local-deep-research | 8,374 |
| 11 | Open-LLM-VTuber | 8,296 |
| 12 | harness-sdk | 6,020 |
| 13 | agentops | 5,605 |
| 14 | Kiln | 4,867 |
| 15 | Devon | 3,447 |
| 16 | whichllm | 2,716 |
| 17 | cascadeflow | 2,468 |
| 18 | elia | 2,437 |
| 19 | oterm | 2,388 |
| 20 | comfyui_LLM_party | 2,271 |
| 21 | openai-edge-tts | 1,914 |
| 22 | AudioMuse-AI | 1,840 |
| 23 | Alpaca | 1,563 |