Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions.

This study uses the Oracle SQL certification exam questions to explore the design of automatic classifiers for exam questions containing code snippets. SQL's question classification assigns a class label in the exam topics to a question. With this classification, questions can be selected from...

Full description

Saved in:
Bibliographic Details
Main Authors: Hung-Yi Chen, Po-Chou Shih, Yunsen Wang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0309050
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841533152924794880
author Hung-Yi Chen
Po-Chou Shih
Yunsen Wang
author_facet Hung-Yi Chen
Po-Chou Shih
Yunsen Wang
author_sort Hung-Yi Chen
collection DOAJ
description This study uses the Oracle SQL certification exam questions to explore the design of automatic classifiers for exam questions containing code snippets. SQL's question classification assigns a class label in the exam topics to a question. With this classification, questions can be selected from the test bank according to the testing scope to assemble a more suitable test paper. Classifying questions containing code snippets is more challenging than classifying questions with general text descriptions. In this study, we use factorial experiments to identify the effects of the factors of the feature representation scheme and the machine learning method on the performance of the question classifiers. Our experiment results showed the classifier with the TF-IDF scheme and Logistics Regression model performed best in the weighted macro-average AUC and F1 performance indices. The classifier with TF-IDF and Support Vector Machine performed best in weighted macro-average Precision. Moreover, the feature representation scheme was the main factor affecting the classifier's performance, followed by the machine learning method, over all the performance indices.
format Article
id doaj-art-45b0dd1e5be44d84af91c5f1fc833bc3
institution Kabale University
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-45b0dd1e5be44d84af91c5f1fc833bc32025-01-17T05:31:26ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01201e030905010.1371/journal.pone.0309050Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions.Hung-Yi ChenPo-Chou ShihYunsen WangThis study uses the Oracle SQL certification exam questions to explore the design of automatic classifiers for exam questions containing code snippets. SQL's question classification assigns a class label in the exam topics to a question. With this classification, questions can be selected from the test bank according to the testing scope to assemble a more suitable test paper. Classifying questions containing code snippets is more challenging than classifying questions with general text descriptions. In this study, we use factorial experiments to identify the effects of the factors of the feature representation scheme and the machine learning method on the performance of the question classifiers. Our experiment results showed the classifier with the TF-IDF scheme and Logistics Regression model performed best in the weighted macro-average AUC and F1 performance indices. The classifier with TF-IDF and Support Vector Machine performed best in weighted macro-average Precision. Moreover, the feature representation scheme was the main factor affecting the classifier's performance, followed by the machine learning method, over all the performance indices.https://doi.org/10.1371/journal.pone.0309050
spellingShingle Hung-Yi Chen
Po-Chou Shih
Yunsen Wang
Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions.
PLoS ONE
title Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions.
title_full Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions.
title_fullStr Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions.
title_full_unstemmed Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions.
title_short Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions.
title_sort exploration of designing an automatic classifier for questions containing code snippets a case study of oracle sql certification exam questions
url https://doi.org/10.1371/journal.pone.0309050
work_keys_str_mv AT hungyichen explorationofdesigninganautomaticclassifierforquestionscontainingcodesnippetsacasestudyoforaclesqlcertificationexamquestions
AT pochoushih explorationofdesigninganautomaticclassifierforquestionscontainingcodesnippetsacasestudyoforaclesqlcertificationexamquestions
AT yunsenwang explorationofdesigninganautomaticclassifierforquestionscontainingcodesnippetsacasestudyoforaclesqlcertificationexamquestions