Evaluation of Machine Translation Systems: A Literature Review on ChatGPT and Google Translate
DOI:
https://doi.org/10.24256/ideas.v13i1.6236Keywords:
: Machine Translation, ChatGPT, Google Translate, Comparative Analysis, FluencyAbstract
Abstract: This is a literature review discussing 15 selected papers about ChatGPT and Google Translate study results based on keyword analysis and publication year. We applied descriptive data analysis technique to analyze the data. We selected Studies on translation performance of natural language processing tools were chosen due to their increasing prominence and diverse applications, ranging from literary to technical translations. The data for this literature review was retrieved from Scopus and Google Scholar. The search was limited to the last five years to ensure the inclusion of recent advancements, particularly those reflecting improvements in ChatGPT’s GPT-4 engine and updates in Google Translate’s neural machine translation capabilities. The results showed that ChatGPT excels in fluency and contextual understanding, particularly in literary and poetic translations, outperforming Google Translate in maintaining stylistic elements and complex language structures. Both systems demonstrated strengths in specialized translations, with ChatGPT showing notable proficiency in medical literature and technical texts. However, challenges remained in low-resource languages and specialized domains, requiring further training and development. Despite technological advancements, human translators are essential for achieving culturally nuanced translations. This study has some implications for future implementing for enhancement contextual understanding, improving accuracy for low-resource languages, and addressing specific error patterns through ongoing research and collaborative efforts between human translators and machine translation tools. These recommendations aim to optimize the performance of ChatGPT and Google Translate, thereby ensuring more accurate and contextually appropriate translations across various fields.
References
Abdulmohsen Alosaimi, B., & Abdulaziz Alawad, N. (2024). Evaluation of the Translation of Separable Phrasal Verbs Generated by ChatGPT. Arab World English Journal, 1(1), 282–291. https://doi.org/10.24093/awej/chatgpt.19
Akula, B., Barrault, L., Gonzalez, G. M., Hansanti, P., & Hoffman, J. (2020). No Language Left Behind: Scaling Human-Centered Machine Translation—Meta Research.
Ali, A., & Pandya, S. (2021). A four-stage framework for the development of a research problem statement in doctoral dissertations. International Journal of Doctoral Studies, 16, 469–485. https://doi.org/10.28945/4839
Almahasees, Z., & Mahmoud, S. (2022). Evaluation of Google Image Translate in Rendering Arabic Signage into English. World Journal of English Language, 12(1), 185–197. https://doi.org/10.5430/wjel.v12n1p185
Bani, M., & Masruddin, M. (2021). Development of an Android-based harmonic oscillation pocket book for senior high school students. JOTSE: Journal of Technology and Science Education, 11(1), 93-103.
Bonyadi, A. (2020). Exploring Linguistic Modifications of Machine-Translated Literary Articles: The Case of Google Translate. Journal of Foreign Language Teaching and Translation Studies, 5(3), 93–106. https://doi.org/10.22034/efl.2020.250576.1057
Cerf, V. G. (2023). Large Language Models. Communications of the ACM, 66(8), 7. https://doi.org/10.1145/3606337
Deb, D., Dey, R., & Balas, V. E. (2019). Literature review and technical reading. Intelligent Systems Reference Library, 153, 9–21. https://doi.org/10.1007/978-981-13-2947-0_2
Gabashvili, I. S. (2023). The impact and applications of ChatGPT: A Systematic Review of Literature Reviews. Aurametrix, April 2023. https://doi.org/10.17605/OSF.IO/87U6Q.Keywords
Gao, R., Lin, Y., Zhao, N., & Cai, Z. G. (2024). Machine translation of Chinese classical poetry: a comparison among ChatGPT, Google Translate, and DeepL Translator. Humanities and Social Sciences Communications, 11(1), 1–10. https://doi.org/10.1057/s41599-024-03363-0
Gao, Y., Wang, R., & Hou, F. (2023). How to Design Translation Prompts for ChatGPT: An Empirical Study.
Garg, A., & Agarwal, M. (2018). Machine Translation: A Literature Review.
GOOD, G. (2015). 済無No Title No Title No Title. Angewandte Chemie International Edition, 6(11), 951–952., 1(April).
Hariri, W. (2023). Unlocking the Potential of ChatGPT: A Comprehensive Exploration of its Applications, Advantages, Limitations, and Future Directions in Natural Language Processing.
Hendy, A., Abdelrehim, M., Sharaf, A., Raunak, V., Gabr, M., Matsushita, H., Kim, Y. J., Afify, M., & Awadalla, H. H. (2023). How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation.
Ismayanti, D., Said, Y. R., Usman, N., & Nur, M. I. (2024). The Students Ability in Translating Newspaper Headlines into English: A Case Study. IDEAS: Journal on English Language Teaching and Learning, Linguistics and Literature, 12(1), 108-131.
Jiao, W., Huang, J. T., Wang, W., He, Z., Liang, T., Wang, X., Shi, S., & Tu, Z. (2023). ParroT: Translating during Chat using Large Language Models Tuned with Human Translation and Feedback. Findings of the Association for Computational Linguistics: EMNLP 2023, 15009–15020. https://doi.org/10.18653/v1/2023.findings-emnlp.1001
Jiao, W., Wang, W., Huang, J., Wang, X., Shi, S., & Tu, Z. (2023). Is ChatGPT a Good Translator? Yes, with GPT-4 as the engine.
Kadaoui, K., Magdy, S. M., Waheed, A., Khondaker, M. T. I., El-Shangiti, A. O., Nagoudi, E. M. B., & Abdul-Mageed, M. (2023). TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties. ArabicNLP 2023: 1st Arabic Natural Language Processing Conference, Proceedings, ArabicNLP, 52–75. https://doi.org/10.18653/v1/2023.arabicnlp-1.6
Khan, N. A., Osmonaliev, K., & Sarwar, M. Z. (2023). Pushing the Boundaries of Scientific Research with the Use of Artificial Intelligence Tools: Navigating Risks and Unleashing Possibilities. Nepal Journal of Epidemiology, 13(1), 1258–1263. https://doi.org/10.3126/nje.v13i1.53721
KOÇER GÜLDAL, B., & İŞİSAĞ, K. U. (2019). A comparative study on Google Translate: An error analysis of Turkish-to-English translations in terms of the text typology of Katherina Reiss. RumeliDE Dil ve Edebiyat Araştırmaları Dergisi, 5, 367–376. https://doi.org/10.29000/rumelide.606217
Li, H., Graesser, A. C., & Cai, Z. (2014). Comparison of Google translation with human translation. Proceedings of the 27th International Florida Artificial Intelligence Research Society Conference, FLAIRS 2014, 190–195.
Liu, Y., Han, T., Ma, S., Zhang, J., Yang, Y., Tian, J., He, H., Li, A., He, M., Liu, Z., Wu, Z., Zhao, L., Zhu, D., Li, X., Qiang, N., Shen, D., Liu, T., & Ge, B. (2023). Summary of ChatGPT-related research and perspective towards the future of large language models. Meta-Radiology, 1(2), 100017. https://doi.org/10.1016/j.metrad.2023.100017
Masruddin, M. (2019). Efficacy Of Using Spelling Bee Game In Teaching Vocabulary To Indonesian English As Foreign Language (Efl) Students, The Asian Efl Journal.
Masruddin, Hartina, S., Arifin, M. A., & Langaji, A. (2024). Flipped learning: facilitating student engagement through repeated instruction and direct feedback. Cogent Education, 11(1), 2412500.
Nila, Firda, S., & Susanto, T. (2017). Google Translate impacts students’ translation of economic text: accuracy and acceptability. 6th ELTLT International Conference Proceedings, October, 487–491.
Noviarini, T. (2021). the Translation Results of Google Translate From Indonesian To English. Jurnal Smart, 7(1), 21–26. https://doi.org/10.52657/js.v7i1.1335
Peng, K., Ding, L., Zhong, Q., Shen, L., Liu, X., Zhang, M., Ouyang, Y., & Tao, D. (2023). Towards Making the Most of ChatGPT for Machine Translation. Findings of the Association for Computational Linguistics: EMNLP 2023, 5622–5633. https://doi.org/10.2139/ssrn.4390455
Sepesy Maučec, M., & Donaj, G. (2020). Machine Translation and the Evaluation of Its Quality. Recent Trends in Computational Intelligence, 1–20. https://doi.org/10.5772/intechopen.89063
Siu, S. C. (2023). ChatGPT and GPT-4 for Professional Translators: Exploring the Potential of Large Language Models in Translation. SSRN Electronic Journal, 1–36. https://doi.org/10.2139/ssrn.4448091
Stevanović, I., & Radičević, L. (2020). Comparative Analysis of Machine Translation Systems. International Journal of Computer Applications, 12(2), 5–8.
Temsah, O., Khan, S. A., Chaiah, Y., Senjab, A., Alhasan, K., Jamal, A., Aljamaan, F., Malki, K. H., Halwani, R., Al-Tawfiq, J. A., Temsah, M.-H., & Al-Eyadhy, A. (2023). Overview of Early ChatGPT’s Presence in Medical Literature: Insights From a Hybrid Literature Review by ChatGPT and Human Experts. Cureus, 15(4). https://doi.org/10.7759/cureus.37281
Wang, L., Lyu, C., Ji, T., Zhang, Z., Yu, D., Shi, S., & Tu, Z. (2023). Document-Level Machine Translation with Large Language Models. EMNLP 2023—2023 Conference on Empirical Methods in Natural Language Processing, Proceedings, March, 16646–16661. https://doi.org/10.18653/v1/2023 .emnlp-main.1036
Yahya, A., Husnaini, H., & Putri, N. I. W. (2024). Developing Common Expressions Book in Indonesian Traditional Market in Three Languages (English-Indonesian-Mandarin). Language Circle: Journal of Language and Literature, 18(2), 288-295.
Downloads
Published
Issue
Section
Citation Check
License
Copyright (c) 2025 Ohod Faisal Ahmed, Ida Kusuma Dewi, Mohammad Yunus Anis

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under an Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See the Effect of Open Access)