gpt_index VS sketch

Compare gpt_index vs sketch and see what are their differences.

gpt_index

LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. [Moved to: https://github.com/jerryjliu/llama_index] (by jerryjliu)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
gpt_index sketch
48 20
7,332 2,198
- 2.0%
9.8 4.4
about 1 year ago 3 months ago
Python Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

gpt_index

Posts with mentions or reviews of gpt_index. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-03-24.

sketch

Posts with mentions or reviews of sketch. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-05.
  • Ask HN: What have you built with LLMs?
    43 projects | news.ycombinator.com | 5 Feb 2024
    We've made a lot of data tooling things based on LLMs, and are in the process of rebranding and launching our main product.

    1. sketch (in notebook, ai for pandas) https://github.com/approximatelabs/sketch

    2. datadm (open source, "chat with data", with support for the open source LLMs (https://github.com/approximatelabs/datadm)

    3. Our main product: julyp. https://julyp.com/ (currently under very active rebrand and cleanup) -- but a "chat with data" style app, with a lot of specialized features. I'm also streaming me using it (and sometimes building it) every weekday on twitch to solve misc data problems (https://www.twitch.tv/bluecoconut)

    For your next question, about the stack and deploy:

  • Pandas AI – The Future of Data Analysis
    7 projects | news.ycombinator.com | 17 May 2023
    This morning I added a "Related Projects" [3] Section to the Buckaroo docs. If Buckaroo doesn't solve your problem, look at one of the other linked projects (like Mito).

    [1] https://github.com/approximatelabs/sketch

    [2] https://github.com/paddymul/buckaroo

    [3] https://buckaroo-data.readthedocs.io/en/latest/FAQ.html

  • Ask HN: What's your favorite GPT powered tool?
    16 projects | news.ycombinator.com | 15 May 2023
    For GPT/Copilot style help for pandas, in notebooks REPL flow (without needing to install plugins), I built sketch. I genuinely use it every-time I'm working on pandas dataframes for a quick one-off analysis. Just makes the iteration loop so much faster. (Specifically the `.sketch.howto`, anecdotally I actually don't use `.sketch.ask` anymore)

    https://github.com/approximatelabs/sketch

  • RasaGPT: First headless LLM chatbot built on top of Rasa, Langchain and FastAPI
    13 projects | news.ycombinator.com | 8 May 2023
    https://github.com/approximatelabs/lambdaprompt It has served all of my personal use-cases since making it, including powering `sketch` (copilot for pandas) https://github.com/approximatelabs/sketch

    Core things it does: Uses jinja templates, does sync and async, and most importantly treats LLM completion endpoints as "function calls", which you can compose and build structures around just with simple python. I also combined it with fastapi so you can just serve up any templates you want directly as rest endpoints. It also offers callback hooks so you can log & trace execution graphs.

    All together its only ~600 lines of python.

    I haven't had a chance to really push all the different examples out there, but most "complex behaviors", so there aren't many patterns to copy. But if you're comfortable in python, then I think it offers a pretty good interface.

    I hope to get back to it sometime in the next week to introduce local-mode (eg. all the open source smaller models are now available, I want to make those first-class)

  • [D] The best way to train an LLM on company data
    7 projects | /r/MachineLearning | 29 Mar 2023
    Please look at sketch and langchain pandas/SQL plugins. I have seen excellent results with both of these approaches. Both of these approaches will require you to send metadata to openAI.
  • Meet Sketch: An AI code Writing Assistant For Pandas
    1 project | /r/machinelearningnews | 1 Feb 2023
    👉 Understand your data through questions 👉 Create code from plain text Quick Read: https://www.marktechpost.com/2023/02/01/meet-sketch-an-ai-code-writing-assistant-for-pandas/ Github: https://github.com/approximatelabs/sketch
  • Replacing a SQL analyst with 26 recursive GPT prompts
    5 projects | news.ycombinator.com | 25 Jan 2023
    (3) Asking for re-writes of failed queries (happens occasionally) also helps

    The main challenge I think with a lot of these "look it works" tools for data applications, is how do you get an interface that actually will be easy to adopt. The chat-bot style shown here (discord and slack integration) I can see being really valuable, as I believe there has been some traction with these style integrations with data catalog systems recently. People like to ask data questions to other people in slack, adding a bot that tries to answer might short-circuit a lot of this!

    We built a prototype where we applied similar techniques to the pandas-code-writing part of the stack, trying to help keep data scientists / data analysts "in flow", integrating the code answers in notebooks (similar to how co-pilot puts suggestions in-line) -- and released https://github.com/approximatelabs/sketch a little while ago.

  • FLiP Stack Weekly for 21 Jan 2023
    19 projects | dev.to | 23 Jan 2023
    Python AI Helper https://github.com/approximatelabs/sketch
  • LangChain: Build AI apps with LLMs through composability
    8 projects | news.ycombinator.com | 17 Jan 2023
  • Show HN: Sketch – AI code-writing assistant that understands data content
    1 project | /r/patient_hackernews | 16 Jan 2023

What are some alternatives?

When comparing gpt_index and sketch you can also consider the following projects:

langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]

RasaGPT - 💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram

llama - Inference code for Llama models

lmql - A language for constraint-guided and efficient LLM programming.

awesome-chatgpt-prompts - This repo includes ChatGPT prompt curation to use ChatGPT better.

pandas-ai - Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

nanoGPT - The simplest, fastest repository for training/finetuning medium-sized GPTs.

rasa - 💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

finetuner - :dart: Task-oriented embedding tuning for BERT, CLIP, etc.

viper - Simple, expressive pipeline syntax to transform and manipulate data with ease