Document Type

Article

Publication Date

2023

Abstract

Although chatbots such as ChatGPT can facilitate cost-effective text generation and editing, factually incorrect responses (hallucinations) limit their utility. This study evaluates one particular type of hallucination: fabricated bibliographic citations that do not represent actual scholarly works. We used ChatGPT-3.5 and ChatGPT-4 to produce short literature reviews on 42 multidisciplinary topics, compiling data on the 636 bibliographic citations (references) found in the 84 papers. We then searched multiple databases and websites to determine the prevalence of fabricated citations, to identify errors in the citations to non-fabricated papers, and to evaluate adherence to APA citation format. Within this set of documents, 55% of the GPT-3.5 citations but just 18% of the GPT-4 citations are fabricated. Likewise, 43% of the real (non-fabricated) GPT-3.5 citations but just 24% of the real GPT-4 citations include substantive citation errors. Although GPT-4 is a major improvement over GPT-3.5, problems remain.

Comments

This article was originally published in Scientific Reports, available at https://doi.org/10.1038/s41598-023-41032-5

This work is published under a Creative Commons Attribution 4.0 International License.

Download

Included in

Artificial Intelligence and Robotics Commons, Scholarly Communication Commons

COinS

Publications and Research

Fabrication and errors in the bibliographic citations generated by ChatGPT

Document Type

Publication Date

Abstract

Comments

Included in

Browse

Author Corner

Search

Links

Publications and Research

Fabrication and errors in the bibliographic citations generated by ChatGPT

Authors

Document Type

Publication Date

Abstract

Comments

Included in

Share

Browse

Author Corner

Search

Links