Comparing Large Language Models and Human Programmers for Generating Programming Code

Comparing Large Language Models and Human Programmers for Generating Programming Code

Abstract The performance of seven large language models (LLMs) in generating programming code using various prompt strategies, programming languages, and task difficulties is systematically evaluated. GPT‐4 substantially outperforms other LLMs, including Gemini Ultra and Claude 2. The coding perform...

Full description

Saved in:

Bibliographic Details
Main Authors:	Wenpin Hou, Zhicheng Ji
Format:	Article
Language:	English
Published:	Wiley 2025-02-01
Series:	Advanced Science
Subjects:	artificial intelligence computer programming human‐computer interaction large language models
Online Access:	https://doi.org/10.1002/advs.202412279
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Characteristics and perceived suitability of artificial intelligence-driven sports coaches: a pilot study on psychological and perceptual factors
by: Carlo Dindorf, et al.
Published: (2025-05-01)

Large language models and the future of gastroenterology: dissecting the biopolitics of data in a global health ecosystem
by: Bilal Irfan, et al.
Published: (2025-08-01)

Radiology-GPT: A large language model for radiology
by: Zhengliang Liu, et al.
Published: (2025-06-01)

Are Large Language Models Intelligent? Are Humans?
by: Olle Häggström
Published: (2023-08-01)

NeuralConstraints: integrating a neural generative model with constraint-based composition
by: Juan S. Vassallo, et al.
Published: (2025-04-01)

The potential of large language models to advance precision oncology
by: Shufan Liang, et al.
Published: (2025-05-01)

Generative AI and Large Language Models in Industry 5.0: Shaping Smarter Sustainable Cities
by: Giulio Salierno, et al.
Published: (2025-02-01)

Dual retrieving and ranking medical large language model with retrieval augmented generation
by: Qimin Yang, et al.
Published: (2025-05-01)

Worst-Case Input Generation for Concurrent Programs under Non-Monotone Resource Metrics
by: Long Pham, et al.
Published: (2024-12-01)

The application of large language models in ophthalmology
by: ZHANG Wencheng, et al.
Published: (2025-03-01)

Assessing the Accuracy of Diagnostic Capabilities of Large Language Models
by: Andrada Elena Urda-Cîmpean, et al.
Published: (2025-06-01)

Checkpoint-based rollback recovery in session programming
by: Claudio Antares Mezzina, et al.
Published: (2025-01-01)

D3: A Small Language Model for Drug-Drug Interaction prediction and comparison with Large Language Models
by: Ahmed Ibrahim, et al.
Published: (2025-06-01)

Chain-of-programming (CoP): empowering large language models for geospatial code generation task
by: Shuyang Hou, et al.
Published: (2025-08-01)

Computer programming in BASIC /
by: Myers, David L.
Published: (1991)

Fundamentals of COBOL programming.
by: Feingold, Carl
Published: (1969)

Comparative Evaluation of Teaching Plans on Prostate Cancer Generated by Various Large Language Models and a Human Expert
by: Rong Wang, et al.
Published: (2025-08-01)

Evaluating the potential risks of employing large language models in peer review
by: Lingxuan Zhu, et al.
Published: (2025-08-01)

Introduction to programming Java : with a problem solving approach /
by: Dean, John
Published: (2013)

An Introduction to Object-Oriented Programming with Java /
by: Thomas, Wu C.
Published: (2006)

Generative AI/LLMs for Plain Language Medical Information for Patients, Caregivers and General Public: Opportunities, Risks and Ethics
by: Pal A, et al.
Published: (2025-07-01)

Understanding Social Biases in Large Language Models
by: Ojasvi Gupta, et al.
Published: (2025-05-01)

MAGECODE: Machine-Generated Code Detection Method Using Large Language Models
by: Hung Pham, et al.
Published: (2024-01-01)

Impact of large language models and vision deep learning models in predicting neoadjuvant rectal score for rectal cancer treated with neoadjuvant chemoradiation
by: Hyun Bin Kim, et al.
Published: (2025-07-01)

Comparison of physician and large language model chatbot responses to online ear, nose, and throat inquiries
by: Masaomi Motegi, et al.
Published: (2025-07-01)

Applying Neutrosophic Natural Language Processing to Analyze Complex Phenomena in Interdisciplinary Contexts
by: Diego Fernando Coka Flores, et al.
Published: (2024-12-01)

Evaluating Reasoning in Large Language Models with a Modified Think-a-Number Game: Case Study
by: Petr Hoza
Published: (2025-07-01)

Impact of retrieval augmented generation and large language model complexity on undergraduate exams created and taken by AI agents
by: Erick Tyndall, et al.
Published: (2025-01-01)

Detecting Fake News in Urdu Language Using Machine Learning, Deep Learning, and Large Language Model-Based Approaches
by: Muhammad Shoaib Farooq, et al.
Published: (2025-07-01)

Fundamentals of Pascal : understanding programming and program solving /
by: Nance, Douglas W.
Published: (1990)

Large Language Models in Transportation: A Comprehensive Bibliometric Analysis of Emerging Trends, Challenges, and Future Research
by: Mahbub Hassan, et al.
Published: (2025-01-01)

Consensus on the Potential of Large Language Models in Healthcare: Insights from a Delphi Survey in Korea
by: Ah-Ram Sul, et al.
Published: (2025-04-01)

Empowering Individuals With Visual Impairment: A Digital Braille Solution for Learning the Urdu Language
by: Farzana Jabeen, et al.
Published: (2025-01-01)

The Multi-Agentization of a Dual-Arm Nursing Robot Based on Large Language Models
by: Chuanhong Fang, et al.
Published: (2025-04-01)

Rooted in and beyond interaction: A systematic review of interactive affordances of chatbots for language learning amidst the rise of large language models
by: Yunfei Du, et al.
Published: (2025-09-01)

Artificial intelligence technologies and applications for language learning and teaching
by: Son Jeong-Bae, et al.
Published: (2023-09-01)

Large language models in clinical nutrition: an overview of its applications, capabilities, limitations, and potential future prospects
by: Jamal Belkhouribchia, et al.
Published: (2025-08-01)

Artificial Intelligence and the Human–Computer Interaction in Occupational Therapy: A Scoping Review
by: Ioannis Kansizoglou, et al.
Published: (2025-05-01)

Potentials and Challenges of Large Language Models (LLMs) in the Context of Administrative Decision-Making
by: Paulina Jo Pesch, et al.
Published: (2025-03-01)

Reply: evaluating Microsoft Bing with ChatGPT-4 for the assessment of abdominal computed tomography and magnetic resonance images
by: Alperen Elek, et al.
Published: (2025-07-01)