Show HN: Built a Cloud-native Stack for Ollama: Build locally and push to deploy

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

get-started

1 62 5.7

Ollama Cloud is a Highly Scalable Cloud-native Stack for Ollama (by ollama-cloud)
ollama

192 58,943 9.9 Go

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

It's true other engines like vLLM are way faster and more optimized. I started with Ollama because its codebase is Go. In reality Ollama does not even take full advantage of llama.cpp itself as it does not implement concurrency plus adds latency using json in a CGO call. I discovered that building the wasm plugin, I was disappointed, and it's not on the ollama priorities to solve that see https://github.com/ollama/ollama/issues/3170
Another advantage of Ollama is it can easily run locally, so does the wasm plugin. Accomplishing the goal of local development environment which uses dreamland.
That's great feedback. I was thinking about fixing the concurrency issue myself, but creating a vLLM wasm plugin is a better idea. The user code won't need to change as long as the plugin exports as the same wasm host module.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

k8s-snap (Canonical Kubernetes) pour un déploiement simple et rapide d’un cluster k8s …
3 projects | dev.to | 25 Feb 2024
Linux for Pirates
1 project | news.ycombinator.com | 6 Feb 2024
Computer vision at the edge with Nvidia Jetson in 2 commands
2 projects | dev.to | 24 Jan 2024
Creating a computer vision app in minutes with just two Python functions
2 projects | dev.to | 2 Jan 2024
Simplifying Computer Vision: A Journey with Pipeless
3 projects | dev.to | 26 Dec 2023

Show HN: Built a Cloud-native Stack for Ollama: Build locally and push to deploy

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Artificial intelligence Cdn Cloud cloudcomputing Distributed
Post date: 19 Mar 2024

get-started

ollama

InfluxDB

Related posts

Show HN: Built a Cloud-native Stack for Ollama: Build locally and push to deploy

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Artificial intelligence Cdn Cloud cloudcomputing Distributed Post date: 19 Mar 2024

get-started

ollama

InfluxDB

Related posts

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Artificial intelligence Cdn Cloud cloudcomputing Distributed
Post date: 19 Mar 2024