'A-Team' of Math Proves a Critical Link Between Addition and Sets

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

LeanDojoChatGPT

2 98 5.3 Python

ChatGPT plugin for theorem proving in Lean

Check out this paper:
https://leandojo.org/
People have already trained models to assist suggestion tactics. They then linked it up to ChatGPT to interactively solve proofs.
In this scenario, ChatGPT asks the model for tactic suggestions, applies it to the proof and uses the feedback from Lean to then proceed with the next step.
FYI, The programmatic interface to Lean was written by an OpenAI employee who was on the Lean team a few years ago.
Also, check out Lean’s roadmap. They aspire to position Lean to becoming a target for LLMs because it has been designed for verification from the ground up.
As math and compsci nerds contribute to mathlib, all of those proofs are also building up a huge corpus that will likely be leveraged for both verification and optimization.
If AI can make verification a lot easier, then we’re likely going to see verification change programming similarly to the way it changed electronics.

linc

1 45 4.4 Jupyter Notebook

🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]

Recent work on combining LLMs with theorem provers with promising initial results:
https://paperswithcode.com/paper/linc-a-neurosymbolic-approa...
> Logical reasoning, i.e., deductively inferring the truth value of a conclusion from a set of premises, is an important task for artificial intelligence with wide potential impacts on science, mathematics, and society. While many prompting-based strategies have been proposed to enable Large Language Models (LLMs) to do such reasoning more effectively, they still appear unsatisfactory, often failing in subtle and unpredictable ways. In this work, we investigate the validity of instead reformulating such tasks as modular neurosymbolic programming, which we call LINC: Logical Inference via Neurosymbolic Computation. In LINC, the LLM acts as a semantic parser, translating premises and conclusions from natural language to expressions in first-order logic. These expressions are then offloaded to an external theorem prover, which symbolically performs deductive inference. Leveraging this approach, we observe significant performance gains on FOLIO and a balanced subset of ProofWriter for three different models in nearly all experimental conditions we evaluate. On ProofWriter, augmenting the comparatively small open-source StarCoder+ (15.5B parameters) with LINC even outperforms GPT-3.5 and GPT-4 with Chain-of-Thought (CoT) prompting by an absolute 38% and 10%, respectively. When used with GPT-4, LINC scores 26% higher than CoT on ProofWriter while performing comparatively on FOLIO. Further analysis reveals that although both methods on average succeed roughly equally often on this dataset, they exhibit distinct and complementary failure modes. We thus provide promising evidence for how logical reasoning over natural language can be tackled through jointly leveraging LLMs alongside symbolic provers. All corresponding code is publicly available at https://github.com/benlipkin/linc

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

FinGPT

1 project | news.ycombinator.com | 13 Dec 2023
GPT but for the Finance Industry

1 project | /r/aipromptprogramming | 16 Jun 2023
GPT but for the Finance Industry

1 project | /r/AutoGPT | 16 Jun 2023
GPT but for the Finance Industry

1 project | /r/AI_Agents | 16 Jun 2023
FinGPT: Open-Source Financial Large Language Models

1 project | /r/LocalLLaMA | 13 Jun 2023

'A-Team' of Math Proves a Critical Link Between Addition and Sets

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
chatgpt chatgpt-plugin large-language-models Lean Machine Learning
Post date: 8 Dec 2023

LeanDojoChatGPT

linc

InfluxDB

Related posts

FinGPT

GPT but for the Finance Industry

GPT but for the Finance Industry

GPT but for the Finance Industry

FinGPT: Open-Source Financial Large Language Models

'A-Team' of Math Proves a Critical Link Between Addition and Sets

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com chatgpt chatgpt-plugin large-language-models Lean Machine Learning Post date: 8 Dec 2023

LeanDojoChatGPT

linc

InfluxDB

Related posts

FinGPT

GPT but for the Finance Industry

GPT but for the Finance Industry

GPT but for the Finance Industry

FinGPT: Open-Source Financial Large Language Models

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
chatgpt chatgpt-plugin large-language-models Lean Machine Learning
Post date: 8 Dec 2023