Show simple item record

dc.contributor.authorDhingra, S
dc.contributor.authorSingh, M
dc.contributor.authorVaisakh, SB
dc.contributor.authorMalviya, N
dc.contributor.authorGill, SS
dc.date.accessioned2023-09-07T14:50:19Z
dc.date.available2023-08-28
dc.date.available2023-09-07T14:50:19Z
dc.date.issued2023-09-06
dc.identifier.issn2772-4859
dc.identifier.other100139
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/90567
dc.description.abstractCognitive psychology delves on understanding perception, attention, memory, language, problem-solving, decision-making, and reasoning. Large Language Models (LLMs) are emerging as potent tools increasingly capable of performing human-level tasks. The recent development in the form of Generative Pre-trained Transformer 4 (GPT-4) and its demonstrated success in tasks complex to humans exam and complex problems has led to an increased confidence in the LLMs to become perfect instruments of intelligence. Although GPT-4 report has shown performance on some cognitive psychology tasks, a comprehensive assessment of GPT-4, via the existing well-established datasets is required. In this study, we focus on the evaluation of GPT-4’s performance on a set of cognitive psychology datasets such as CommonsenseQA, SuperGLUE, MATH and HANS. In doing so, we understand how GPT-4 processes and integrates cognitive psychology with contextual information, providing insight into the underlying cognitive processes that enable its ability to generate the responses. We show that GPT-4 exhibits a high level of accuracy in cognitive psychology tasks relative to the prior state-of-the-art models. Our results strengthen the already available assessments and confidence on GPT-4’s cognitive psychology abilities. It has significant potential to revolutionise the field of Artificial Intelligence (AI), by enabling machines to bridge the gap between human and machine reasoning.en_US
dc.format.extent100139 - 100139
dc.publisherElsevieren_US
dc.relation.ispartofBenchCouncil Transactions on Benchmarks Standards and Evaluations
dc.rightsAttribution 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/us/*
dc.titleMind meets machine: Unravelling GPT-4’s cognitive psychologyen_US
dc.typeArticleen_US
dc.identifier.doi10.1016/j.tbench.2023.100139
pubs.author-urlhttps://arxiv.org/abs/2303.11436en_US
pubs.issue3en_US
pubs.notesNot knownen_US
pubs.publication-statusPublisheden_US
pubs.publisher-urlhttps://www.sciencedirect.com/science/article/pii/S277248592300056Xen_US
pubs.volume3en_US
dcterms.dateAccepted2023-08-28


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution 3.0 United States
Except where otherwise noted, this item's license is described as Attribution 3.0 United States