codespin
ollama
codespin | ollama | |
---|---|---|
5 | 226 | |
57 | 72,781 | |
- | 14.0% | |
9.5 | 9.9 | |
6 days ago | 4 days ago | |
TypeScript | Go | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
codespin
-
GPT-4 Turbo with Vision is a step backwards for coding
Shameless plug. I have a VS Code extension that's very nearly ready.
Codespin CLI tools (ready to use): https://github.com/codespin-ai/codespin
VS Code extension for the CLI tool (soon): https://www.youtube.com/watch?v=2TJqosFmkao
I'll do a Show HN in a week or two.
-
LLMs and Programming in the first days of 2024
Shameless plug: https://github.com/codespin-ai/codespin-cli
It's similar to aider (which is a great tool btw) in goals, but with a different recipe.
-
Copying Angry Birds with nothing but AI
That AI is transformative for development is not in doubt any more. Just this past week, I've been able to build two medium sized services (a couple of thousand lines of code in python, a language I hadn't used for more than a decade!). What's truly impressive is that for the large part, it's better than the code I'd have written anyway. Want a nice README.md? Just provide the source code that contains routes/cli args/whatever, and it'll generate it for you. Want tests? Sure. Developers have never had it so easy.
Another thing to note is that for code generation, GPT4 runs circles around GPT3.5. GPT35 is alright at copying if you provide very tight examples, but GPT4 kinda "thinks".
Shameless plug: I have this open source app which automates a lot of grunt work in prompt generation - https://github.com/codespin-ai/codespin-cli
- An Open Source Node.JS-based CLI tool for Generating Code using GPT
- CodeSpin: Code generation framework and tools using OpenAI APIs
ollama
-
SpringAI, llama3 and pgvector: bRAGging rights!
To support the exploration, I've developed a simple Retrieval Augmented Generation (RAG) workflow that works completely locally on the laptop for free. If you're interested, you can find the code itself here. Basically, I've used Testcontainers to create a Postgres database container with the pgvector extension to store text embeddings and an open source LLM with which I send requests to: Meta's llama3 through ollama.
-
RAG with OLLAMA
Note: Before proceeding further you need to download and run Ollama, you can do so by clicking here.
-
Ollama 0.1.42
`file://*` URLs are now allowed => ollama works with simple html files now
https://github.com/ollama/ollama/commit/1a29e9a879433fc55cf1...
-
How to setup a free, self-hosted AI model for use with VS Code
This guide assumes you have a supported NVIDIA GPU and have installed Ubuntu 22.04 on the machine that will host the ollama docker image. AMD is now supported with ollama but this guide does not cover this type of setup.
-
beginner guide to fully local RAG on entry-level machines
Nowadays, running powerful LLMs locally is ridiculously easy when using tools such as ollama. Just follow the installation instructions for your #OS. From now on, we'll assume using bash on Ubuntu.
- Codestral: Mistral's Code Model
- AIM Weekly 27 May 2024
-
Devoxx Genie Plugin : an Update
I focused on supporting Ollama, GPT4All, and LMStudio, all of which run smoothly on a Mac computer. Many of these tools are user-friendly wrappers around Llama.cpp, allowing easy model downloads and providing a REST interface to query the available models. Last week, I also added "👋🏼 Jan" support because HuggingFace has endorsed this provider out-of-the-box.
- Ask HN: Are companies self hosting LLMs?
- Ollama v0.1.39 Pre-release. Support Phi-3 Medium
What are some alternatives?
matter-js - a 2D rigid body physics engine for the web ▲● ■
llama.cpp - LLM inference in C/C++
nitter - Alternative Twitter front-end
gpt4all - gpt4all: run open-source LLMs anywhere
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
private-gpt - Interact with your documents using the power of GPT, 100% privately, no data leaks
LocalAI - :robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
llama - Inference code for Llama models
koboldcpp - A simple one-file way to run various GGML and GGUF models with KoboldAI's UI
exllama - A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
text-generation-inference - Large Language Model Text Generation Inference
litellm - Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)