Show HN: SiteGPT – Create ChatGPT-like chatbots trained on your website content

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • langchain

    Discontinued ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain] (by hwchase17)

  • Not to go full "Dropbox in a weekend", but if you're technical enough to self-host, this is something you can build for yourself

    Everyone is going straight to embeddings, but it'd be easy enough to use old school NLP summarization from NLTK (https://www.nltk.org/)

    Hook that up a web scraping library like https://scrapy.org/ and get a summary of each page.

    Then embed a site map in your system prompt and use langchain (https://github.com/hwchase17/langchain) to allow GPT to query for a specific page's summary.

    -

    The point of this isn't to say that's how OP did it, but there might be people seeing stuff like this and wondering how on earth to get into it: This is something you could build in a weekend with pretty much no understanding of AI

  • NLTK

    NLTK Source

  • Not to go full "Dropbox in a weekend", but if you're technical enough to self-host, this is something you can build for yourself

    Everyone is going straight to embeddings, but it'd be easy enough to use old school NLP summarization from NLTK (https://www.nltk.org/)

    Hook that up a web scraping library like https://scrapy.org/ and get a summary of each page.

    Then embed a site map in your system prompt and use langchain (https://github.com/hwchase17/langchain) to allow GPT to query for a specific page's summary.

    -

    The point of this isn't to say that's how OP did it, but there might be people seeing stuff like this and wondering how on earth to get into it: This is something you could build in a weekend with pretty much no understanding of AI

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Scrapy

    Scrapy, a fast high-level web crawling & scraping framework for Python.

  • Not to go full "Dropbox in a weekend", but if you're technical enough to self-host, this is something you can build for yourself

    Everyone is going straight to embeddings, but it'd be easy enough to use old school NLP summarization from NLTK (https://www.nltk.org/)

    Hook that up a web scraping library like https://scrapy.org/ and get a summary of each page.

    Then embed a site map in your system prompt and use langchain (https://github.com/hwchase17/langchain) to allow GPT to query for a specific page's summary.

    -

    The point of this isn't to say that's how OP did it, but there might be people seeing stuff like this and wondering how on earth to get into it: This is something you could build in a weekend with pretty much no understanding of AI

  • marqo

    Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

  • You can get pretty far with this https://github.com/marqo-ai/marqo. Choose your LLM of choice to pair with it. Examples https://github.com/marqo-ai/marqo/blob/mainline/examples/GPT...

  • AIQA

    Question and Answer in nodejs using langchain and chromadb and the OpenAI API for GPT3

  • You could combine this https://github.com/realrasengan/AIQA

    with

    this https://github.com/realrasengan/gpt4all-wrapper-js

    And do it locally on your computer with just a little mod.

  • gpt4all-wrapper-js

    just a wrapper for gpt4all https://github.com/nomic-ai/gpt4all/ for nodejs

  • You could combine this https://github.com/realrasengan/AIQA

    with

    this https://github.com/realrasengan/gpt4all-wrapper-js

    And do it locally on your computer with just a little mod.

  • ask-your-stack

    Ask your stack demo

  • Even better, just ask your stack

    https://ask-your-stack.vercel.app/

    (It uses official docs to provide answers with context)

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts