Kattis vs ChatGPT: Assessment and Evaluation of Programming Tasks in the Age of Artificial Intelligence

Dunder, Nora; Lundborg, Saga; Wong, Jacqueline; Viberg, Olga

doi:https://doi.org/10.1145/3636555.3636882

Kattis vs ChatGPT: Assessment and Evaluation of Programming Tasks in the Age of Artificial Intelligence

Files

3636555.3636882.pdf (691.6 KB)

Publication date

2024-03-18

Authors

Dunder, Nora

Lundborg, Saga

Wong, Jacqueline

Viberg, Olga

DOI

https://doi.org/10.1145/3636555.3636882

Document Type

Part of book

Metadata

Show full item record

Collections

Utrecht University Repository

License

cc_by_nc

Abstract

AI-powered education technologies can support students and teachers in computer science education. However, with the recent developments in generative AI, and especially the increasingly emerging popularity of ChatGPT, the effectiveness of using large language models for solving programming tasks has been underexplored. The present study examines ChatGPT’s ability to generate code solutions at different difficulty levels for introductory programming courses. We conducted an experiment where ChatGPT was tested on 127 randomly selected programming problems provided by Kattis, an automatic software grading tool for computer science programs, often used in higher education. The results showed that ChatGPT independently could solve 19 out of 127 programming tasks generated and assessed by Kattis. Further, ChatGPT was found to be able to generate accurate code solutions for simple problems but encountered difficulties with more complex programming tasks. The results contribute to the ongoing debate on the utility of AI-powered tools in programming education.

Keywords

Academic Integrity, Automated Grading, ChatGPT, Programming Education, Software, Human-Computer Interaction, Computer Vision and Pattern Recognition, Computer Networks and Communications

Citation

Dunder, N, Lundborg, S, Wong, J & Viberg, O 2024, Kattis vs ChatGPT : Assessment and Evaluation of Programming Tasks in the Age of Artificial Intelligence. in LAK '24: Proceedings of the 14th Learning Analytics and Knowledge Conference. Association for Computing Machinery, pp. 821-827. https://doi.org/10.1145/3636555.3636882

URI

https://dspace.library.uu.nl/handle/1874/451788

Kattis vs ChatGPT: Assessment and Evaluation of Programming Tasks in the Age of Artificial Intelligence

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI