Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions.
This study uses the Oracle SQL certification exam questions to explore the design of automatic classifiers for exam questions containing code snippets. SQL's question classification assigns a class label in the exam topics to a question. With this classification, questions can be selected from...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2025-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0309050 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841533152924794880 |
---|---|
author | Hung-Yi Chen Po-Chou Shih Yunsen Wang |
author_facet | Hung-Yi Chen Po-Chou Shih Yunsen Wang |
author_sort | Hung-Yi Chen |
collection | DOAJ |
description | This study uses the Oracle SQL certification exam questions to explore the design of automatic classifiers for exam questions containing code snippets. SQL's question classification assigns a class label in the exam topics to a question. With this classification, questions can be selected from the test bank according to the testing scope to assemble a more suitable test paper. Classifying questions containing code snippets is more challenging than classifying questions with general text descriptions. In this study, we use factorial experiments to identify the effects of the factors of the feature representation scheme and the machine learning method on the performance of the question classifiers. Our experiment results showed the classifier with the TF-IDF scheme and Logistics Regression model performed best in the weighted macro-average AUC and F1 performance indices. The classifier with TF-IDF and Support Vector Machine performed best in weighted macro-average Precision. Moreover, the feature representation scheme was the main factor affecting the classifier's performance, followed by the machine learning method, over all the performance indices. |
format | Article |
id | doaj-art-45b0dd1e5be44d84af91c5f1fc833bc3 |
institution | Kabale University |
issn | 1932-6203 |
language | English |
publishDate | 2025-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj-art-45b0dd1e5be44d84af91c5f1fc833bc32025-01-17T05:31:26ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01201e030905010.1371/journal.pone.0309050Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions.Hung-Yi ChenPo-Chou ShihYunsen WangThis study uses the Oracle SQL certification exam questions to explore the design of automatic classifiers for exam questions containing code snippets. SQL's question classification assigns a class label in the exam topics to a question. With this classification, questions can be selected from the test bank according to the testing scope to assemble a more suitable test paper. Classifying questions containing code snippets is more challenging than classifying questions with general text descriptions. In this study, we use factorial experiments to identify the effects of the factors of the feature representation scheme and the machine learning method on the performance of the question classifiers. Our experiment results showed the classifier with the TF-IDF scheme and Logistics Regression model performed best in the weighted macro-average AUC and F1 performance indices. The classifier with TF-IDF and Support Vector Machine performed best in weighted macro-average Precision. Moreover, the feature representation scheme was the main factor affecting the classifier's performance, followed by the machine learning method, over all the performance indices.https://doi.org/10.1371/journal.pone.0309050 |
spellingShingle | Hung-Yi Chen Po-Chou Shih Yunsen Wang Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions. PLoS ONE |
title | Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions. |
title_full | Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions. |
title_fullStr | Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions. |
title_full_unstemmed | Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions. |
title_short | Exploration of designing an automatic classifier for questions containing code snippets-A case study of Oracle SQL certification exam questions. |
title_sort | exploration of designing an automatic classifier for questions containing code snippets a case study of oracle sql certification exam questions |
url | https://doi.org/10.1371/journal.pone.0309050 |
work_keys_str_mv | AT hungyichen explorationofdesigninganautomaticclassifierforquestionscontainingcodesnippetsacasestudyoforaclesqlcertificationexamquestions AT pochoushih explorationofdesigninganautomaticclassifierforquestionscontainingcodesnippetsacasestudyoforaclesqlcertificationexamquestions AT yunsenwang explorationofdesigninganautomaticclassifierforquestionscontainingcodesnippetsacasestudyoforaclesqlcertificationexamquestions |