Chain-of-Thought Hub: Measuring LLMs' Reasoning Performance

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • pyllms

    Minimal Python library to connect to LLMs (OpenAI, Anthropic, AI21, Cohere, Aleph Alpha, HuggingfaceHub, Google PaLM2, with a built-in model performance benchmark.

  • For anyone reading this, these are the actual prompts being used to assess the models.

    https://github.com/kagisearch/pyllms/blob/ca9ad4d4bfdd9d58fe...

  • chain-of-thought-hub

    Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Create an AI prototyping environment using Jupyter Lab IDE with Typescript, LangChain.js and Ollama for rapid AI prototyping

    4 projects | dev.to | 2 May 2024
  • Show HN: FileKitty – Combine and label text files for LLM prompt contexts

    4 projects | news.ycombinator.com | 1 May 2024
  • Effortlessly Create an AI Dungeon Master Bot Using Julep and Chainlit

    1 project | dev.to | 1 May 2024
  • An Exploration of Software-defined networks in video streaming, Part Three: Performance of a streaming system over a SDN

    1 project | dev.to | 1 May 2024
  • Clasificador de imágenes con una red neuronal convolucional (CNN)

    2 projects | dev.to | 1 May 2024