USING LLM IN DISTANCE LEARNING: ADAPTING ENGLISH-LANGUAGE ALGEBRAIC TASKS
DOI:
https://doi.org/10.31471/1993-9965-2024-1(56)-68-78Keywords:
artificial intelligence, distance education, question generation, knowledge assessment.Abstract
The COVID-19 pandemic and the full-scale aggression against Ukraine have underscored the importance of remote learning. Consequently, the demands on both the quality of software systems and the educators who prepare lecture courses and knowledge assessments have increased. The market has already seen the emergence of many free and commercial solutions based on large language models (LLMs) such as ChatGPT, Claude, Google Gemini, etc., which are already being used by students to prepare their assignments. The integration of LLM/AI into the educational process, particularly in distance education, where the variety of tasks, their explanations, and interpretations are crucial, presents a new challenge that requires thoughtful consideration and practical implementation. One potential approach is the adaptation of well-known English-language task sets into domestic distance learning systems for Ukrainian-speaking students. However, this process will only be meaningful if the AI is capable of generating translations (adaptations) of sufficient quality to meet educational requirements. Addi-tionally, students are already attempting to use AI as an assistant in solving tasks in exact sciences, but the accuracy of AI solutions for tasks in the Ukrainian language remains underexplored. The aim of this article is to analyze the current possibilities for distance education in the context of the active proliferation of LLM-based systems regarding the adaptation of English-language school-level algebra tasks. The objectives of the article are as follows: 1) to analyze the ability of two popular LLM models to generate translations (adaptations) of algebraic tasks according to specified rules; 2) to analyze the ability of LLM models to solve tasks in their original form; 3) to analyze the ability of LLM models to solve tasks in their adapted form; 4) to analyze the ability of LLM to explain the solution process of tasks in detail in Ukrainian. The primary source for verification was the GSM8K task set (https://github.com/openai/grade-school-math, a collection of school-level math problems created by a professional teachers). Subsequently, ChatGPT v4 / v4 Omni and Gemini 1.5 PRO Preview adapted 200 tasks from this set into Ukrainian, replacing English names with Ukrainian ones and converting units of measurement (without actual recalculation, simply for convenience and understanding by students, e.g., "gallon" became "liter" and, US dollar — hryvnia, etc.) To assess the quality of the Ukrainian adaptation, the Open Source software, called LanguageTool (https://languagetool.org/) was used. Additionally, a human conducted a visual check for the correctness of the adaptation of selected questions through random sampling from the translated data set. The adapted tasks had to be understood not only in English but also in Ukrainian and solved by the LLM models. The results were also compared with the reference answers from the set. The findings were as follows: 1) both LLM models demonstrated almost identical grammatical quality of the Ukrainian translation — about 70%. 2) LLMs demonstrated solution accuracy of the original tasks at the level of 85% (ChatGPT v4), 93.5% (ChatGPT v4 Omni), and 75% (Gemini) respectively. There is a clear dependence not only on the structure of the prompt for the system, but also on the volume of output and the level of detail required by the system. If the LLM performs several different tasks (translation/solution) within one request, the results of mathematical calculations will be much worse than when the LLM is only required to show the detailed path to the final result. ChatGPT on the English variant showed 93.5% (85%) of solved tasks, which correlates well with the results of other studies in this field; 3) for the Ukrainian variant (ChatGPT adaptation), the accuracy differed by 5% from the original formulation (after analyzing typical errors and correcting the adaptation). For Gemini, the English variant showed ~25%, while the Ukrainian (own adaptation and ChatGPT adaptation) showed ~50% errors; 4) both systems demonstrated the ability to explain the solution path of the task, but require external control of the results. The development of LLMs shows enormous potential for this technology in distance learning systems in the Ukrainian language. Significant time savings in preparing numerous uniform tasks from English sources. AI demonstrates the ability to analyze tasks, and the quality of the solution does not significantly depend on the language of the original task. Right now, human control over the results of calculations proposed by AI is still required and should be considered mandatory.
Downloads
References
Leontieva I. V. ChatGPT v osvitnomu protsesi vyshchoi shkoly: zaboronyty ne mozhna vykorystovuvaty. Osvita ta pedahohichna nauka. 2023. No 1 (183). P. 13–23. URL: http://eps.luguniv.edu.ua/index.php/eps/issue/view/17 (date of access: 30.04.2024) [in Ukrainian]
Balyk N. R., Shmyher H. P. Vprova-dzhennia shtuchnoho intelektu v osvitu shliakhom vykorystannia ChatGPT. Aktualni aspekty rozvytku STEAM-osvity v umovakh yevrointehratsii : zb. materialiv Mizhnar. nauk.-prakt. internet-konf., m. Kropyvnytskyi, 21 kvit. 2013 r. 2023. P. 147–149. URL: http://dspace.tnpu.edu.ua/handle/123456789/28824 (date of access: 30.04.2024) [in Ukrainian]
Ivashkevych I. Zastosuvannia movnoi modeli ChatGPT dlia perekladu: perevahy ta nedo-liky. Dyskurs profesiinoi i tvorchoi komunikatsii: linhvokulturnyi, kohnityvnyi, perekladatskyi ta metodychnyi aspekty : zb. materialiv VIII Mizhnar. nauk.prakt. konf., m. Kyiv, 18–19 trav. 2023 r. Kyiv, 2023. P. 110–113. URL: https://ktppnm.kpi.ua/sites/default/files/2023-11/DPTK_blok_tezy_18_05_23.pdf (date of access: 30.04.2024) [in Ukrainian]
Taranovskyi A. O., Samoilov V. D. ChatGPT i mozhlyvist yoho vykorystannia dlia bezekspertnoho stvorennia testiv. Elektronnoe Modelirovanie. 2023. T. 45, No 2. P. 44–60. DOI: https://doi.org/10.15407/emodel.45.02.044 (date of access: 30.04.2024) [in Ukrainian]
Shuliak I. M. Chat-GPT u teorii y praktytsi perekladu: vyklyky y perspektyvy. Mizhkulturna komunikatsiia i perekladoznavstvo: tochky dotyku ta perspektyvy rozvytku: materialy VI Mizhnar. nauk.-prakt. konf.: zb. nauk. pr., m. Pereiaslav, 12–13 trav. 2023 r. Pereiaslav, 2023. P. 101–104. URL: http://dspace.tnpu.edu.ua/handle/123456789/31746 (date of access: 30.04.2024[in Ukrainian]
Odukha V. S. Sposib vyznachennia yakosti avtomatyzovanoho perekladu z vykorystanniam neironnoi merezhi : kvalifikatsiina robota baka-lavra: 122. Khmelnytskyi, 2023. 83 p. URL: https://elar.khmnu.edu.ua/handle/123456789/13796 (date of access: 30.04.2024) [in Ukrainian]
Huangetal J. Large Language Models Cannot Self-Correct Reasoning Yet / URL: https://arxiv.org/abs/2310.01798 (date of access: 03.06.2024).
Cobbeetal K. Training Verifiersto Solve Math Word Problems / URL: https://arxiv.org/abs/2110.14168 (date of access: 30.04.2024).
Fedorovych I. Protses onovlennia slovnyka ukrainskoi movy dlia morfolohichnoho analizatora Pymorphy2. VI Mizhnarodna studentska naukovo - tekhnichna konferentsiia "Pryrodnychi ta huma-nitarni nauky. Aktualni pytannia": materialy Mizhnar. student. naukovo-tekhn. konf., m. Kyiv, 28–29 kvit. 2022 r. Kyiv, 2022. P. 187–188. URL: https://elartu.tntu.edu.ua/bitstream/lib/41264/2/187-188.pdf (date of access: 03.06.2024) [in Ukrainian]
Minhalova Yu. Ohliad on-lain servisiv, shcho varto vykorystovuvaty dlia samoredahuvannia naukovoho tekstu. Aktualni pytannia suchasnoi informatyky. 2019. P. 122–126. URL: http://eprints.zu.edu.ua/id/eprint/28104 (date of access: 03.06.2024) [in Ukrainian]
Pobudova slovnyka predmetnoi oblasti na osnovi avtomatyzovanoho analizu tekstiv ukrainskoiu movoiu / O. Kunhurtsev ta in. Tekhnichni nauky ta tekhnolohii. 2016. No 3 (5). P. 164–174. URL: http://nbuv.gov.ua/UJRN/tnt_2016_3_21 (date of access: 03.06.2024) [in Ukrainian]
Automatic correction system of english-ukrainian computer translation for technical texts in the field of automation of technological processes / A. O. Stopakevych ta in. Informatics and mathematical methods in simulation. 2023. Vol. 13, No 3-4. P. 322–334. DOI: https://doi.org/10.15276/imms.v13.no3-4.322 (date of access: 03.06.2024).
ChatGPT v4o (12 chervnia 2024 r.). Pokazhy detalnyi rozviazok zadachi: Shchodnia Liudmyla hoduie kozhnu zi svoikh kurei troma skliankamy zmishanoho kuriachoho kormu, shcho mistyt nasinnia, khrobakiv ta ovochi, shchob zberehty yikh zdorovia. Vona daie kuriam korm trychi na den. Vrantsi vona daie svoiemu stadu kurei 15 sklianok kormu. Vden vona daie svoim kuriam shche 25 sklianok kormu. Skilky sklianok kormu vona povynna daty svoim kuriam u ostannii pryiom yizhi na den, yakshcho rozmir stada Liudmyly stanovyt 20 kurei? Dzherelo: https://chatgpt.com/share/922ef0ca-5aba-41a0-9620-5fddbc79cf3d [in Ukrainian]
Downloads
Published
How to Cite
Issue
Section
License
Авторські права....