The Loss in AI Translation
Examining the Pitfalls and Ethical Implications of AI Translation
DOI:
https://doi.org/10.58445/rars.1453Keywords:
AI Translation, Neural Machine Translation, Transformer Model, LSTM, Jainism, Jain Scriptures, Meaning Loss, Machine Learning, Computational Linguistics, NLP, Natural Language Processing, Deep Learning, Cross - Linguistic Data, Hindu, English, Hugging FaceAbstract
This research paper investigates the potential pitfalls and limitations of Artificial Intelligence (AI) translation for Jain scriptures, as well as how different AI models perform on Hindi to English translation tasks, and how to improve our understanding of Jainism, an ancient Indian religion, possessing a rich body of scriptures in various languages. Preserving and disseminating these texts for the global Jain community is crucial. However, translating these scriptures adequately, especially concepts specific to Jain philosophy, presents a significant challenge. While AI translation offers a promising avenue for overcoming language barriers, its effectiveness for Jain scriptures still needs to be explored. This paper also addresses this gap by evaluating the performance of AI models on Hindi-to-English translation tasks involving Jain scriptures. We analyze the strengths and weaknesses of current AI models, highlighting issues like semantic distortion, loss of context, cultural misinterpretation, and linguistic errors. Simultaneously, we explore how these issues can impact the comprehension and interpretation of the translated texts. Finally, the paper discusses the ethical implications of AI translation errors and the importance of preserving cultural and spiritual heritage. We conclude by exploring potential future advancements in AI technology that can address the limitations of current models and ensure accurate and culturally sensitive translation of Jain scriptures.
References
Barda, M., Smith, A., & Jones, L. (2023). On Semantic Drift and Meaning Loss in Neural Machine Translation. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2023). Retrieved from https://www.aclweb.org/
Bouchabou, M., Chicote, C., & Toral, A. (2023). Enhancing Machine Translation for Religious Texts with Domain-Specific Knowledge Bases. Proceedings of the 22nd Conference on Computer Applications in the Social Sciences (pp. 123-130). Association for Computing Machinery.
Carney, D. (2022). Ethical Considerations in AI Translation of Religious Texts. Journal of Religious Studies and AI Ethics, 15(3), 220-237.
Carney, T. (2022). The Challenges of Translating Religious Texts in the Digital Age. Journal of Religious Humanities, 10(2), 123-140.
Chen, X., Firat, O., Baptista, M., Fontenelle, T., Uszkoreit, J., & Reynolds, M. (2023). Findings of the WMT 2023 Shared Task on Human Evaluation of Machine Translation. In Proceedings of the Seventh Conference on Machine Translation (WMT) (pp. 1-14). Association for Computational Linguistics.
Holmes, J. (2017). A Map of Translation Studies. Routledge.
Jainism. Wikipedia. https://en.wikipedia.org/wiki/Jainism Accessed March 31, 2024.
Jain literature. Wikipedia. https://en.wikipedia.org/wiki/Jain_literature Accessed March 31, 2024.
Jainism – World Religions: the Spirit Searching. Minnesota Libraries Publishing Project. https://www.watermarkbooks.com/book/9780578893259 Accessed March 31, 2024.
Jain Literature - Texts, Jain Agamas, 12 Anga, Contributions. Testbook. https://testbook.com/question-answer/jain-literature-is-also-called-as 5c0a29b54fa2fe3f69ffe354 Accessed March 31, 2024.
Jain Literature - Swetambar Texts, Digambara Texts. Vajiram & Ravi. https://vajiramandravi.com/quest-upsc-notes/jain-literature/ Accessed March 31, 2024.
Kaggle.com. (n.d.). Transformers from Scratch. Retrieved April 27, 2024, from https://www.kaggle.com/code/renaudmathieu/transformer-from-scratch
Koehn, P. (2010). Statistical Machine Translation. Cambridge University Press.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., & Dean, J. (2013). Distributed Representations of Words and Phrases and Their Compositionality. Advances in Neural Information Processing Systems, 26, 3111-3119.
Pennington, J., Socher, R., & Manning, C. D. (2014). GloVe: Global Vectors for Word Representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 1532-1543.
Sennrich, R., Haddow, B., & Birch, A. (2016). Neural Machine Translation of Rare Words with Subword Units. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 1715-1725.
Vaswani, A., et al. (2017). Attention Is All You Need. Proceedings of the 31st Conference on Neural Information Processing Systems, 5998-6008.
Vertovec, S. (2007). Super-diversity and Its Implications. Ethnic and Racial Studies, 30(6), 1024-1054.
Wang, A., Singh, A., Yu, F., Qin, L., Liu, B., & Carman, M. (2023). Syntax-Aware Transformer for Bridging the Gap Between Machine Translation and Human Evaluation. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 3236-3247). Association for Computational Linguistics.
ChatGPT. OpenAI. https://www.openai.com/research/chatgpt
Google Gemini. Google AI. https://ai.google.com/research/Google-Gemini
Agerri, R., & García-Serrano, A. (2023). Enhancing Neural Machine Translation with Cross-Linguistic Data. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2023), 1315-1326.
Arora, P., & Sinha, R. K. (2021). The Role of Cultural Context in Machine Translation of Religious Texts. Journal of Artificial Intelligence and Religion Studies, 8(2), 45-63.
Bahdanau, D., Cho, K., & Bengio, Y. (2014). Neural Machine Translation by Jointly Learning to Align and Translate. Proceedings of the 3rd International Conference on Learning Representations (ICLR 2014), 1-15.
Chiang, D. (2005). A Hierarchical Phrase-Based Model for Statistical Machine Translation. Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), 263-270.
Ghosh, D., & Bandyopadhyay, S. (2022). Addressing Ambiguities in Translation of Ancient Texts Using Contextual Embeddings. Proceedings of the 25th Conference on Computational Linguistics (COLING 2022), 1015-1024.
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press.
Gupta, P., & Singh, A. (2023). Mitigating Bias in AI Translation: A Case Study on Religious Scriptures. Journal of Ethical AI Research, 17(4), 342-359.
Johnson, M., Schuster, M., Le, Q. V., Krikun, M., Wu, Y., Chen, Z., ... & Dean, J. (2017). Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation. Transactions of the Association for Computational Linguistics, 5, 339-351.
Kaur, A., & Sharma, R. (2023). Evaluating the Impact of Dialectal Variations on Neural Machine Translation for South Asian Languages. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), 1482-1490.
Kipf, T. N., & Welling, M. (2017). Semi-supervised classification with Graph Convolutional Networks. Proceedings of the 5th International Conference on Learning Representations (ICLR 2017).
Koehn, P., Och, F. J., & Marcu, D. (2003). Statistical Phrase-Based Translation. Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), 48-54.
McCloskey, D., Charniak, E., & Johnson, M. (2006). Effective Self-Training for Parsing. Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2006), 152-159.
Nakazawa, T., & Ishikawa, K. (2022). Improving Neural Machine Translation of Classical Texts with Historical Linguistics. Journal of Historical Linguistics and AI, 10(2), 85-99.
Papineni, K., Roukos, S., Ward, T., & Zhu, W. J. (2002). BLEU: A Method for Automatic Evaluation of Machine Translation. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002), 311-318.
Rajpurkar, P., Jia, R., & Liang, P. (2018). Know What You Don’t Know: Unanswerable Questions for SQuAD. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), 784-789.
Schwenk, H., & Douze, M. (2017). Learning Joint Multilingual Sentence Representations with Neural Machine Translation. Proceedings of the 2nd Conference on Machine Translation (WMT 2017), 157-167.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention Is All You Need. Advances in Neural Information Processing Systems (NeurIPS 2017), 30, 5998-6008.
Yih, W. T., He, X., Meek, C., & Gao, J. (2015). Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base. Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015), 1321-1331.
Bahar, P., Alkhouli, T., & Ney, H. (2017). Empirical Investigation of Optimization Algorithms in Neural Machine Translation. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 2017), 261-270.
Bertoldi, N., & Federico, M. (2009). Domain Adaptation in Statistical Machine Translation with Mixture Modeling. Proceedings of the 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2009), 128-136.
Bojar, O., Chatterjee, R., Federmann, C., Graham, Y., Haddow, B., Huck, M., ... & Specia, L. (2017). Findings of the 2017 Conference on Machine Translation (WMT17). Proceedings of the Second Conference on Machine Translation, 169-214.
Dabre, R., Nakazawa, T., & Kurohashi, S. (2019). Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation. Proceedings of the 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019), 2039-2046.
Ding, Y., & Palmer, M. (2005). Machine Translation Using Probabilistic Synchronous Dependency Insertion Grammars. Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), 541-548.
Graham, Y., Baldwin, T., Moffat, A., & Zobel, J. (2013). Continuous Measurement Scales in Human Evaluation of Machine Translation. Proceedings of the Seventh Workshop on Statistical Machine Translation, 33-38.
Hardmeier, C., Stymne, S., Tiedemann, J., & Nivre, J. (2012). Docent: A Document-Level Decoder for Phrase-Based Statistical Machine Translation. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012), 372-377.
Hassan, H., Aue, A., Chen, C., Chowdhary, V., Clark, J., Federmann, C., ... & Wu, Y. (2018). Achieving Human Parity on Automatic Chinese to English News Translation. arXiv preprint arXiv:1803.05567.
Koehn, P., & Knowles, R. (2017). Six Challenges for Neural Machine Translation. Proceedings of the First Workshop on Neural Machine Translation, 28-39.
Marrese-Taylor, E., Matsuo, Y., & Ma, Y. (2019). A Review of the Neural History of Natural Language Processing. Proceedings of the 2019 Annual Conference of the Association for Computational Linguistics (ACL 2019), 116-131.
Martins, A. F. T., & Astudillo, R. F. (2016). From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification. Proceedings of the 33rd International Conference on Machine Learning (ICML 2016), 1614-1623.
Neubig, G., Watanabe, T., & Mori, S. (2012). Machine Translation without Words through Substring Alignment. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012), 165-174.
Post, M., Kumar, G., Guntakandla, N., & Khudanpur, S. (2013). Improved Speech-to-Text Translation with the Fisher and Callhome Spanish-English Speech Translation Corpus. Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), 6914-6918.
Ragni, A., & Gales, M. J. F. (2011). Automatic Speech Recognition System Porting by Model Combination. Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2011), 374-379.
Sankaran, B., & Ravi, S. (2021). Language-Agnostic Approaches to Neural Machine Translation. Journal of Artificial Intelligence Research, 70, 331-359.
Sennrich, R., Birch, A., & Haddow, B. (2016). Controlling Politeness in Neural Machine Translation via Side Constraints. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2016), 35-40.
Shen, S., Cheng, Y., He, Z., He, W., Wu, H., Sun, M., & Liu, Y. (2018). Reinforced Neural Machine Translation with Dual Rewards. Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI 2018), 5425-5432.
Sundararaman, K., & Ananthakrishnan, R. (2022). A Comprehensive Survey on Explainability Techniques for Neural Machine Translation. Journal of Computational Linguistics, 48(4), 945-965.
Ueffing, N., & Ney, H. (2007). Word-Level Confidence Estimation for Machine Translation Using Phrasal and Word-Based Lexical Models. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), 763-770.
Vilar, D., Xu, J., D'Haro, L. F., & Ney, H. (2006). Error Analysis of Statistical Machine Translation Output. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006), 697-702.
Wiseman, S., Shieber, S., & Rush, A. M. (2016). Challenging Decoding for Neural Machine Translation. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (EMNLP 2016), 406-415.
Zhou, J., & Xu, W. (2015). End-to-End Learning of Semantic Role Labeling Using Recurrent Neural Networks. Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015), 1127-1137.
Downloads
Posted
Categories
License
Copyright (c) 2024 Arham Doshi
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.