ChatGPT’s Answers To Software Engineering Questions Were 52% Incorrect

Bijay Pokharel, August 13, 2023 0 2 min read

OpenAI’s ChatGPT answered about 52 percent of software engineering questions incorrectly, according to a study, raising questions about the popular language models accuracy.

Despite ChatGPT’s popularity, there hasn’t been a thorough investigation into the quality and usability of its responses to software engineering queries, said researchers from Purdue University in the US.

To address this gap, the team undertook a comprehensive analysis of ChatGPT’s replies to 517 questions from Stack Overflow (SO).

“Our examination revealed that 52 percent of ChatGPT’s answers contain inaccuracies and 77 percent are verbose,” the researchers wrote in the paper, not peer-reviewed and published on a pre-print site.

Importantly, the team found that 54 percent of the time the errors were made due to ChatGPT not understanding the concept of the questions.

Even when it could understand the question, it failed to show an understanding of how to solve the problem, contributing to a high number of conceptual errors, they said.

Further, the researchers observed ChatGPT’s limitation to reasoning.

“In many cases, we saw ChatGPT give a solution, code, or formula without foresight or thinking about the outcome,” they said.

“Prompt engineering and human-in-the-loop fine-tuning can be helpful in probing ChatGPT to understand a problem to some extent, but they are still insufficient when it comes to injecting reasoning into LLM. Hence it is essential to understand the factors of conceptual errors as well as fix the errors originating from the limitation of reasoning,” they added.

READ

Meta Fires 20 Employees for Leaking Confidential Information

Moreover, ChatGPT also suffers from other quality issues such as verbosity, inconsistency, etc. Results of the in-depth manual analysis pointed to a large number of conceptual and logical errors in ChatGPT answers. The linguistic analysis results showed that ChatGPT answers are very formal, and rarely portray negative sentiments.

Nevertheless, users still preferred ChatGPT’s responses 39.34 percent of the time due to its comprehensiveness and articulate language style.

“These findings underscore the need for meticulous error correction in ChatGPT while also raising awareness among users about the potential risks associated with seemingly accurate answers,” the researchers said.

IT NEWS & UPDATES

Google Introduces New Pre-Fill Feature For Sheets

IT NEWS & UPDATES

IBM’s Prototype ‘Brain-Like’ Chip Promises Greener AI

Bijay Pokharel

Bijay Pokharel is the creator and owner of Abijita.com. He is a freelance technology writer focusing on all things pertaining to Cyber Security. The topics he writes about include malware, vulnerabilities, exploits, internet defense, women's safety and privacy, as well as research and innovation in information security. He is a tech enthusiast, keen learner, rational and cool person in his professional activities and challenges.

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

Recent Posts

Global Crackdown on AI-Generated Child Abuse Material Leads to 25 Arrests

OpenAI Brings Sora to the EU and UK

Google Requests South Korea to Allow Transfer of High-Precision Map Data Overseas

Chang’E-6 Study Reveals ‘Magma Ocean’ Entirely Covered Moon during Early Stages

Vo1d Malware Botnet Infects Over 1.5 Million Android TV Devices Worldwide

Suspected Cybercriminal Behind ‘DESORDEN Group’ Arrested in Thailand

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

SIGN UP FOR NEWSLETTERS

Please confirm your email address.

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

ChatGPT’s Answers To Software Engineering Questions Were 52% Incorrect

Bijay Pokharel

Related posts

Meta To Modify Its Controversial Cross-Check Programme On Facebook And Instagram

LG Energy Begins Production at US Battery Cell Plant

FTX CEO Sam Bankman-Fried Sentenced to 25 Years in Prison

SpaceX Launches 1st Batch of Satellites for Mobile Phone Connectivity Anywhere on Earth

Ticketmaster Confirms Massive Breach After Stolen Data Appears Online

Over 9 Billion eSim-Capable Devices to Be Shipped by 2030 Globally: Report

Leave a Reply Cancel reply

Recent Posts

Global Crackdown on AI-Generated Child Abuse Material Leads to 25 Arrests

OpenAI Brings Sora to the EU and UK

Google Requests South Korea to Allow Transfer of High-Precision Map Data Overseas

Chang’E-6 Study Reveals ‘Magma Ocean’ Entirely Covered Moon during Early Stages

Vo1d Malware Botnet Infects Over 1.5 Million Android TV Devices Worldwide

Suspected Cybercriminal Behind ‘DESORDEN Group’ Arrested in Thailand

Subscribe

Cybersecurity Newsletter

You have Successfully Subscribed!

SIGN UP FOR NEWSLETTERS

Please confirm your email address.