Evaluating ChatGPT's Information Extraction Capabilities: An Assessment

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

ChatGPT_for_IE

1 133 8.1 Python

Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

Abstract: The capability of Large Language Models (LLMs) like ChatGPT to comprehend user intent and provide reasonable responses has made them extremely popular lately. In this paper, we focus on assessing the overall ability of ChatGPT using 7 fine-grained information extraction (IE) tasks. Specially, we present the systematically analysis by measuring ChatGPT's performance, explainability, calibration, and faithfulness, and resulting in 15 keys from either the ChatGPT or domain experts. Our findings reveal that ChatGPT's performance in Standard-IE setting is poor, but it surprisingly exhibits excellent performance in the OpenIE setting, as evidenced by human evaluation. In addition, our research indicates that ChatGPT provides high-quality and trustworthy explanations for its decisions. However, there is an issue of ChatGPT being overconfident in its predictions, which resulting in low calibration. Furthermore, ChatGPT demonstrates a high level of faithfulness to the original text in the majority of cases. We manually annotate and release the test sets of 7 fine-grained IE tasks contains 14 datasets to further promote the research. The datasets and code are available at this https URL - https://github.com/pkuserc/ChatGPT_for_IE

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Show HN: AiDAPal – An IDA Pro plugin and fine-tuned LLM for reverse engineering

1 project | news.ycombinator.com | 10 Jun 2024
Nvidia-patch: removes restriction simultaneous video encoding sessions

1 project | news.ycombinator.com | 10 Jun 2024
A macOS and Windows open source alternative to ChatGPT

1 project | news.ycombinator.com | 10 Jun 2024
AIM Weekly for 10 June 2024

23 projects | dev.to | 10 Jun 2024
How To Build a Simple GitHub Action To Deploy a Django Application to the Cloud

3 projects | dev.to | 10 Jun 2024

Evaluating ChatGPT's Information Extraction Capabilities: An Assessment

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Post date: 25 Apr 2023

ChatGPT_for_IE

Scout Monitoring

Related posts

Show HN: AiDAPal – An IDA Pro plugin and fine-tuned LLM for reverse engineering

Nvidia-patch: removes restriction simultaneous video encoding sessions

A macOS and Windows open source alternative to ChatGPT

AIM Weekly for 10 June 2024

How To Build a Simple GitHub Action To Deploy a Django Application to the Cloud