Ask HN: Is anyone building a question answering system using the HN corpus?

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

openai-cookbook

214 55,805 9.5 MDX

Examples and guides for using the OpenAI API

Today, if someone wants to know what the HN community knows/thinks about a topic, they can either:
A) Search past HN comments on hn.algolia.com, or
B) Post a new 'Ask HN'.
LLMs could provide a new way to find answers within a corpus. These have been described elsewhere, e.g.
- https://github.com/openai/openai-cookbook/blob/main/examples/Question_answering_using_embeddings.ipynb
- https://news.ycombinator.com/item?id=34477543
I keep expecting someone (maybe minimaxir or simonw?) to post a 'Show HN: Get your question answered by the collective wisdom of HN', but I no one has so far (unless I missed the submission?).
Is someone already working on this?

gpt_index

48 7,332 9.8 Python

Discontinued LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. [Moved to: https://github.com/jerryjliu/llama_index]

The things I've seen all use hosted language models. For example https://github.com/jerryjliu/gpt_index depends on LangChain, which wraps APIs from hosted LLMs: https://langchain.readthedocs.io/en/latest/reference/modules...
AFAIK there's no GPT-3-like LLM that's easy to run at home, because the number of parameters is so so large. Your gaming PC's GPU won't have enough RAM to hold the model.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
hn-data-dumps

4 26 0.0 Python

Apparently there are two ways to access it on GCP:
https://github.com/ashish01/hn-data-dumps

Hacker News API

83 10,899 0.0

Documentation and Samples for the Official HN API

HN has an API[0], with a bit of effort you can make one yourself.
[0]https://github.com/HackerNews/API

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Multimodal Embeddings for JavaScript, Swift, and Python
1 project | news.ycombinator.com | 25 Apr 2024
Dreamboothing MusicGen
1 project | news.ycombinator.com | 25 Apr 2024
Dead Internet – Browse a fake web
1 project | news.ycombinator.com | 25 Apr 2024
Progress Quest: the original idle game
4 projects | news.ycombinator.com | 24 Apr 2024
How to Estimate Depth from a Single Image
8 projects | dev.to | 25 Apr 2024

Ask HN: Is anyone building a question answering system using the HN corpus?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Post date: 24 Jan 2023

openai-cookbook

gpt_index

WorkOS

hn-data-dumps

Hacker News API

Related posts