MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Aljagthami, Aamer, Banabila, Mohammed, Alshehri, Musab, Kabini, Mohammed, Alahmadi, Mohammad D.
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Software Engineering
Accesso online:	https://arxiv.org/abs/2509.12973
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866909791276761088
author	Aljagthami, Aamer Banabila, Mohammed Alshehri, Musab Kabini, Mohammed Alahmadi, Mohammad D.
author_facet	Aljagthami, Aamer Banabila, Mohammed Alshehri, Musab Kabini, Mohammed Alahmadi, Mohammad D.
contents	Large language models (LLMs) have shown promise for automated source-code translation, a capability critical to software migration, maintenance, and interoperability. Yet comparative evidence on how model choice, prompt design, and prompt language shape translation quality across multiple programming languages remains limited. This study conducts a systematic empirical assessment of state-of-the-art LLMs for code translation among C++, Java, Python, and C#, alongside a traditional baseline (TransCoder). Using BLEU and CodeBLEU, we quantify syntactic fidelity and structural correctness under two prompt styles (concise instruction and detailed specification) and two prompt languages (English and Arabic), with direction-aware evaluation across language pairs. Experiments show that detailed prompts deliver consistent gains across models and translation directions, and English prompts outperform Arabic by 13-15%. The top-performing model attains the highest CodeBLEU on challenging pairs such as Java to C# and Python to C++. Our evaluation shows that each LLM outperforms TransCoder across the benchmark. These results demonstrate the value of careful prompt engineering and prompt language choice, and provide practical guidance for software modernization and cross-language interoperability.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_12973
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Evaluating Large Language Models for Code Translation: Effects of Prompt Language and Prompt Design Aljagthami, Aamer Banabila, Mohammed Alshehri, Musab Kabini, Mohammed Alahmadi, Mohammad D. Software Engineering Large language models (LLMs) have shown promise for automated source-code translation, a capability critical to software migration, maintenance, and interoperability. Yet comparative evidence on how model choice, prompt design, and prompt language shape translation quality across multiple programming languages remains limited. This study conducts a systematic empirical assessment of state-of-the-art LLMs for code translation among C++, Java, Python, and C#, alongside a traditional baseline (TransCoder). Using BLEU and CodeBLEU, we quantify syntactic fidelity and structural correctness under two prompt styles (concise instruction and detailed specification) and two prompt languages (English and Arabic), with direction-aware evaluation across language pairs. Experiments show that detailed prompts deliver consistent gains across models and translation directions, and English prompts outperform Arabic by 13-15%. The top-performing model attains the highest CodeBLEU on challenging pairs such as Java to C# and Python to C++. Our evaluation shows that each LLM outperforms TransCoder across the benchmark. These results demonstrate the value of careful prompt engineering and prompt language choice, and provide practical guidance for software modernization and cross-language interoperability.
title	Evaluating Large Language Models for Code Translation: Effects of Prompt Language and Prompt Design
topic	Software Engineering
url	https://arxiv.org/abs/2509.12973

Documenti analoghi