Beautiful Virgin Islands

Wednesday, Dec 10, 2025

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."
Researchers found students to have fared better at accounting exams than ChatGPT, OpenAI's chatbot product.

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."

The researchers from Brigham Young University (BYU), US, and 186 other universities wanted to know how OpenAI's technology would fare on accounting exams. They have published their findings in the journal Issues in Accounting Education.

In the researchers' accounting exam, students scored an overall average of 76.7 per cent, compared to ChatGPT's score of 47.4 per cent.

While in 11.3 per cent of the questions, ChatGPT was found to score higher than the student average, doing particularly well on accounting information systems (AIS) and auditing, the AI bot was found to perform worse on tax, financial, and managerial assessments. Researchers think this could possibly be because ChatGPT struggled with the mathematical processes required for the latter type.

The AI bot, which uses machine learning to generate natural language text, was further found to do better on true/false questions (68.7 per cent correct) and multiple-choice questions (59.5 per cent), but struggled with short-answer questions (between 28.7 and 39.1 per cent).

In general, the researchers said that higher-order questions were harder for ChatGPT to answer. In fact, sometimes ChatGPT was found to provide authoritative written descriptions for incorrect answers, or answer the same question different ways.

They also found that ChatGPT often provided explanations for its answers, even if they were incorrect. Other times, it went on to select the wrong multiple-choice answer, despite providing accurate descriptions.

Researchers importantly noted that ChatGPT sometimes made up facts. For example, when providing a reference, it generated a real-looking reference that was completely fabricated. The work and sometimes the authors did not even exist.

The bot was seen to also make nonsensical mathematical errors such as adding two numbers in a subtraction problem, or dividing numbers incorrectly.

Wanting to add to the intense ongoing debate about how how models like ChatGPT should factor into education, lead study author David Wood, a BYU professor of accounting, decided to recruit as many professors as possible to see how the AI fared against actual university accounting students.

His co-author recruiting pitch on social media exploded: 327 co-authors from 186 educational institutions in 14 countries participated in the research, contributing 25,181 classroom accounting exam questions.

They also recruited undergraduate BYU students to feed another 2,268 textbook test bank questions to ChatGPT. The questions covered AIS, auditing, financial accounting, managerial accounting and tax, and varied in difficulty and type (true/false, multiple choice, short answer).
Newsletter

Related Articles

Beautiful Virgin Islands
0:00
0:00
Close
UK Warns of Escalating Cyber Assault Linked to Putin’s State-Backed Operations
UK Consumer Spending Falters in November as Households Hold Back Ahead of Budget
UK Orders Fresh Review of Prince Harry’s Security Status After Formal Request
U.S. Authorises Nvidia to Sell H200 AI Chips to China Under Security Controls
Trump in Direct Assault: European Leaders Are Weak, Immigration a Disaster. Russia Is Strong and Big — and Will Win
"App recommendation" or disguised advertisement? ChatGPT Premium users are furious
"The Great Filtering": Australia Blocks Hundreds of Thousands of Minors From Social Networks
Mark Zuckerberg Pulls Back From Metaverse After $70 Billion Loss as Meta Shifts Priorities to AI
Nvidia CEO Says U.S. Data-Center Builds Take Years while China ‘Builds a Hospital in a Weekend’
Indian Airports in Turmoil as IndiGo Cancels Over a Thousand Flights, Stranding Thousands
Hollywood Industry on Edge as Netflix Secures Near-$60 Bln Loan for Warner Bros Takeover
Drugs and Assassinations: The Connection Between the Italian Mafia and Football Ultras
Hollywood megadeal: Netflix acquires Warner Bros. Discovery for 83 billion dollars
The Disregard for a Europe ‘in Danger of Erasure,’ the Shift Toward Russia: Trump’s Strategic Policy Document
Two and a Half Weeks After the Major Outage: A Cloudflare Malfunction Brings Down Multiple Sites
UK data-regulator demands urgent clarity on racial bias in police facial-recognition systems
Labour Uses Biscuits to Explain UK Debt — MPs Lean Into Social Media to Reach New Audiences
German President Lays Wreath at Coventry as UK-Germany Reaffirm Unity Against Russia’s Threat
UK Inquiry Finds Putin ‘Morally Responsible’ for 2018 Novichok Death — London Imposes Broad Sanctions on GRU
India backs down on plan to mandate government “Sanchar Saathi” app on all smartphones
King Charles Welcomes German President Steinmeier to UK in First State Visit by Berlin in 27 Years
UK Plans Major Cutback to Jury Trials as Crown Court Backlog Nears 80,000
UK Government to Significantly Limit Jury Trials in England and Wales
U.S. and U.K. Seal Drug-Pricing Deal: Britain Agrees to Pay More, U.S. Lifts Tariffs
UK Postpones Decision Yet Again on China’s Proposed Mega-Embassy in London
Head of UK Budget Watchdog Resigns After Premature Leak of Reeves’ Budget Report
Car-sharing giant Zipcar to exit UK market by end of 2025
Reports of Widespread Drone Deployment Raise Privacy and Security Questions in the UK
UK Signals Security Concerns Over China While Pursuing Stronger Trade Links
Google warns of AI “irrationality” just as Gemini 3 launch rattles markets
Top Consultancies Freeze Starting Salaries as AI Threatens ‘Pyramid’ Model
Macron Says Washington Pressuring EU to Delay Enforcement of Digital-Regulation Probes Against Meta, TikTok and X
UK’s DragonFire Laser Downs High-Speed Drones as £316m Deal Speeds Naval Deployment
UK Chancellor Rejects Claims She Misled Public on Fiscal Outlook Ahead of Budget
Starmer Defends Autumn Budget as Finance Chief Faces Accusations of Misleading Public Finances
EU Firms Struggle with 3,000-Hour Paperwork Load — While Automakers Fear De Facto 2030 Petrol Car Ban
White House launches ‘Hall of Shame’ site to publicly condemn media outlets for alleged bias
UK Budget’s New EV Mileage Tax Undercuts Case for Plug-In Hybrids
UK Government Launches National Inquiry into ‘Grooming Gangs’ After US Warning and Rising Public Outcry
Taylor Swift Extends U.K. Chart Reign as ‘The Fate of Ophelia’ Hits Six Weeks at No. 1
250 Still Missing in the Massive Fire, 94 Killed. One Day After the Disaster: Survivor Rescued on the 16th Floor
Trump: National Guard Soldier Who Was Shot in Washington Has Died; Second Soldier Fighting for His Life
UK Chancellor Reeves Defends Tax Rises as Essential to Reduce Child Poverty and Stabilise Public Finances
No Evidence Found for Claim That UK Schools Are Shifting to Teaching American English
European Powers Urge Israel to Halt West Bank Settler Violence Amid Surge in Attacks
"I Would Have Given Her a Kidney": She Lent Bezos’s Ex-Wife $1,000 — and Received Millions in Return
European States Approve First-ever Military-Grade Surveillance Network via ESA
UK to Slash Key Pension Tax Perk, Targeting High Earners Under New Budget
UK Government Announces £150 Annual Cut to Household Energy Bills Through Levy Reforms
UK Court Hears Challenge to Ban on Palestine Action as Critics Decry Heavy-Handed Measures
×