Volume no :
9 |Issue no :
1Article Type :
Scholarly ArticleAuthor :
Arul Selvan MPublished Date :
June, 2025Publisher :
Journal of Artificial Intelligence and Cyber Security (JAICS)
Page No: 1 - 11
Abstract : The rapid advancement of large language models (LLMs) has opened new avenues for addressing the challenges faced by low-resource and endangered languages in the realms of translation, understanding, and preservation. Despite significant progress in natural language processing, most existing models predominantly focus on high-resource languages with abundant digital data, leaving many minority and endangered languages underserved. This work explores the potential of multilingual LLMs to bridge this gap by leveraging transfer learning, cross-lingual embeddings, and few-shot learning techniques to enhance the translation quality and linguistic understanding of low-resource languages. By training on diverse multilingual corpora, these models can capture shared linguistic features and semantic patterns that facilitate knowledge transfer from resource-rich to resource-scarce languages. Furthermore, we investigate methodologies to incorporate linguistic typology, phonetics, and cultural context into LLM architectures, thereby improving their capacity to represent unique language structures and idiomatic expressions that are often lost in traditional translation pipelines. Beyond translation, the models can assist in language documentation efforts by generating annotated corpora, supporting automated transcription of oral histories, and enabling interactive language learning tools tailored for endangered language communities. We also emphasize ethical considerations surrounding data collection, model biases, and community involvement, advocating for collaborative frameworks that empower native speakers and respect cultural sensitivities. Experimental evaluations demonstrate promising improvements in translation accuracy and semantic comprehension for selected low-resource languages, while case studies highlight the models’ utility in revitalizing languages at risk of extinction through digital content creation and accessible educational resources. Challenges such as data scarcity, domain mismatch, and preservation of linguistic diversity remain, prompting ongoing research into adaptive fine-tuning, active learning, and multimodal integration including speech and text. This work contributes to the broader agenda of digital inclusivity by promoting technological tools that not only facilitate communication across linguistic boundaries but also foster the preservation and revival of cultural heritage embedded in endangered languages. Ultimately, the integration of multilingual LLMs in low-resource language contexts offers a scalable and sustainable approach to maintaining global linguistic diversity, ensuring that minority voices continue to thrive in an increasingly interconnected digital world.
Keyword multilingual large language models, low-resource languages, endangered language preservation, machine translation, cross-lingual transfer learning, language documentation
Reference:
- Jeyaprabha, B., & Sundar, C. (2021). The mediating effect of e-satisfaction on e-service quality and e-loyalty link in securities brokerage industry. Revista Geintec-gestao Inovacao E Tecnologias, 11(2), 931-940.
- Jeyaprabha, B., & Sunder, C. What Influences Online Stock Traders’ Online Loyalty Intention? The Moderating Role of Website Familiarity. Journal of Tianjin University Science and Technology.
- Jeyaprabha, B., Catherine, S., & Vijayakumar, M. (2024). Unveiling the Economic Tapestry: Statistical Insights Into India’s Thriving Travel and Tourism Sector. In Managing Tourism and Hospitality Sectors for Sustainable Global Transformation(pp. 249-259). IGI Global.
- JEYAPRABHA, B., & SUNDAR, C. (2022). The Psychological Dimensions Of Stock Trader Satisfaction With The E-Broking Service Provider. Journal of Positive School Psychology, 3787-3795.
- Nadaf, A. B., Sharma, S., & Trivedi, K. K. (2024). CONTEMPORARY SOCIAL MEDIA AND IOT BASED PANDEMIC CONTROL: A ANALYTICAL APPROACH. Weser Books, 73.
- Trivedi, K. K. (2022). A Framework of Legal Education towards Litigation-Free India. Issue 3 Indian JL & Legal Rsch., 4, 1.
- Trivedi, K. K. (2022). HISTORICAL AND CONCEPTUAL DEVELOPMENT OF PARLIAMENTARY PRIVILEGES IN INDIA.
- Himanshu Gupta, H. G., & Trivedi, K. K. (2017). International water clashes and India (a study of Indian river-water treaties with Bangladesh and Pakistan).
- Nair, S. S., Lakshmikanthan, G., Kendyala, S. H., & Dhaduvai, V. S. (2024, October). Safeguarding Tomorrow-Fortifying Child Safety in Digital Landscape. In 2024 International Conference on Computing, Sciences and Communications (ICCSC)(pp. 1-6). IEEE.
- Lakshmikanthan, G., Nair, S. S., Sarathy, J. P., Singh, S., Santiago, S., & Jegajothi, B. (2024, December). Mitigating IoT Botnet Attacks: Machine Learning Techniques for Securing Connected Devices. In 2024 International Conference on Emerging Research in Computational Science (ICERCS)(pp. 1-6). IEEE.
- Nair, S. S. (2023). Digital Warfare: Cybersecurity Implications of the Russia-Ukraine Conflict. International Journal of Emerging Trends in Computer Science and Information Technology, 4(4), 31-40.
- Mahendran, G., Kumar, S. M., Uvaraja, V. C., & Anand, H. (2025). Effect of wheat husk biogenic ceramic Si3N4 addition on mechanical, wear and flammability behaviour of castor sheath fibre-reinforced epoxy composite. Journal of the Australian Ceramic Society, 1-10.
- Mahendran, G., Mageswari, M., Kakaravada, I., & Rao, P. K. V. (2024). Characterization of polyester composite developed using silane-treated rubber seed cellulose toughened acrylonitrile butadiene styrene honey comb core and sunn hemp fiber. Polymer Bulletin, 81(17), 15955-15973.
- Mahendran, G., Gift, M. M., Kakaravada, I., & Raja, V. L. (2024). Load bearing investigations on lightweight rubber seed husk cellulose–ABS 3D-printed core and sunn hemp fiber-polyester composite skin building material. Macromolecular Research, 32(10), 947-958.
- Chunara, F., Dehankar, S. P., Sonawane, A. A., Kulkarni, V., Bhatti, E., Samal, D., & Kashwani, R. (2024). Advancements In Biocompatible Polymer-Based Nanomaterials For Restorative Dentistry: Exploring Innovations And Clinical Applications: A Literature Review. African Journal of Biomedical Research, 27(3S), 2254-2262.
- Prova, Nuzhat Noor Islam. “Healthcare Fraud Detection Using Machine Learning.” 2024 Second International Conference on Intelligent Cyber Physical Systems and Internet of Things (ICoICI). IEEE, 2024.
- Prova, N. N. I. (2024, August). Garbage Intelligence: Utilizing Vision Transformer for Smart Waste Sorting. In 2024 Second International Conference on Intelligent Cyber Physical Systems and Internet of Things (ICoICI)(pp. 1213-1219). IEEE.
- Prova, N. N. I. (2024, August). Advanced Machine Learning Techniques for Predictive Analysis of Health Insurance. In 2024 Second International Conference on Intelligent Cyber Physical Systems and Internet of Things (ICoICI)(pp. 1166-1170). IEEE.
- Vijayalakshmi, K., Amuthakkannan, R., Ramachandran, K., & Rajkavin, S. A. (2024). Federated Learning-Based Futuristic Fault Diagnosis and Standardization in Rotating Machinery. SSRG International Journal of Electronics and Communication Engineering, 11(9), 223-236.
- Devi, K., & Indoria, D. (2021). Digital Payment Service In India: A Review On Unified Payment Interface. J. of Aquatic Science, 12(3), 1960-1966.
- Kumar, G. H., Raja, D. K., Varun, H. D., & Nandikol, S. (2024, November). Optimizing Spatial Efficiency Through Velocity-Responsive Controller in Vehicle Platooning. In 2024 8th International Conference on Computational System and Information Technology for Sustainable Solutions (CSITSS)(pp. 1-5). IEEE.
- Vidhyasagar, B. S., Harshagnan, K., Diviya, M., & Kalimuthu, S. (2023, October). Prediction of Tomato Leaf Disease Plying Transfer Learning Models. In IFIP International Internet of Things Conference(pp. 293-305). Cham: Springer Nature Switzerland.
- Sivakumar, K., Perumal, T., Yaakob, R., & Marlisah, E. (2024, March). Unobstructive human activity recognition: Probabilistic feature extraction with optimized convolutional neural network for classification. In AIP Conference Proceedings(Vol. 2816, No. 1). AIP Publishing.
- Kalimuthu, S., Perumal, T., Yaakob, R., Marlisah, E., & Raghavan, S. (2024, March). Multiple human activity recognition using iot sensors and machine learning in device-free environment: Feature extraction, classification, and challenges: A comprehensive review. In AIP Conference Proceedings(Vol. 2816, No. 1). AIP Publishing.
- Bs, V., Madamanchi, S. C., & Kalimuthu, S. (2024, February). Early Detection of Down Syndrome Through Ultrasound Imaging Using Deep Learning Strategies—A Review. In 2024 Second International Conference on Emerging Trends in Information Technology and Engineering (ICETITE)(pp. 1-6). IEEE.
- Kalimuthu, S., Ponkoodanlingam, K., Jeremiah, P., Eaganathan, U., & Juslen, A. S. A. (2016). A comprehensive analysis on current botnet weaknesses and improving the security performance on botnet monitoring and detection in peer-to-peer botnet. Iarjset, 3(5), 120-127.
- Kumar, T. V. (2023). REAL-TIME DATA STREAM PROCESSING WITH KAFKA-DRIVEN AI MODELS.
- Kumar, T. V. (2023). Efficient Message Queue Prioritization in Kafka for Critical Systems.
- Kumar, T. V. (2022). AI-Powered Fraud Detection in Real-Time Financial Transactions.
- Kumar, T. V. (2021). NATURAL LANGUAGE UNDERSTANDING MODELS FOR PERSONALIZED FINANCIAL SERVICES.
- Kumar, T. V. (2020). Generative AI Applications in Customizing User Experiences in Banking Apps.
- Kumar, T. V. (2020). FEDERATED LEARNING TECHNIQUES FOR SECURE AI MODEL TRAINING IN FINTECH.
- Kumar, T. V. (2015). CLOUD-NATIVE MODEL DEPLOYMENT FOR FINANCIAL APPLICATIONS.
- Kumar, T. V. (2018). REAL-TIME COMPLIANCE MONITORING IN BANKING OPERATIONS USING AI.
- Raju, P., Arun, R., Turlapati, V. R., Veeran, L., & Rajesh, S. (2024). Next-Generation Management on Exploring AI-Driven Decision Support in Business. In Optimizing Intelligent Systems for Cross-Industry Application(pp. 61-78). IGI Global.
- Turlapati, V. R., Thirunavukkarasu, T., Aiswarya, G., Thoti, K. K., Swaroop, K. R., & Mythily, R. (2024, November). The Impact of Influencer Marketing on Consumer Purchasing Decisions in the Digital Age Based on Prophet ARIMA-LSTM Model. In 2024 International Conference on Integrated Intelligence and Communication Systems (ICIICS)(pp. 1-6). IEEE.
- Sreekanthaswamy, N., Anitha, S., Singh, A., Jayadeva, S. M., Gupta, S., Manjunath, T. C., & Selvakumar, P. (2025). Digital Tools and Methods. Enhancing School Counseling With Technology and Case Studies, 25.
- Sreekanthaswamy, N., & Hubballi, R. B. (2024). Innovative Approaches To Fmcg Customer Journey Mapping: The Role Of Block Chain And Artificial Intelligence In Analyzing Consumer Behavior And Decision-Making. Library of Progress-Library Science, Information Technology & Computer, 44(3).
- Deshmukh, M. C., Ghadle, K. P., & Jadhav, O. S. (2020). Optimal solution of fully fuzzy LPP with symmetric HFNs. In Computing in Engineering and Technology: Proceedings of ICCET 2019(pp. 387-395). Springer Singapore.
- Kalluri, V. S. Optimizing Supply Chain Management in Boiler Manufacturing through AI-enhanced CRM and ERP Integration. International Journal of Innovative Science and Research Technology (IJISRT).
- Kalluri, V. S. Impact of AI-Driven CRM on Customer Relationship Management and Business Growth in the Manufacturing Sector. International Journal of Innovative Science and Research Technology (IJISRT).
- Sameera, K., & MVR, S. A. R. (2014). Improved power factor and reduction of harmonics by using dual boost converter for PMBLDC motor drive. Int J Electr Electron Eng Res, 4(5), 43-51.
- Sidharth, S. (2017). Real-Time Malware Detection Using Machine Learning Algorithms.
- Sidharth, S. (2017). Access Control Frameworks for Secure Hybrid Cloud Deployments.
- Sidharth, S. (2016). Establishing Ethical and Accountability Frameworks for Responsible AI Systems.
- Sidharth, S. (2015). AI-Driven Detection and Mitigation of Misinformation Spread in Generated Content.
- Sidharth, S. (2015). Privacy-Preserving Generative AI for Secure Healthcare Synthetic Data Generation.
- Sidharth, S. (2018). Post-Quantum Cryptography: Readying Security for the Quantum Computing Revolution.
- Sidharth, S. (2019). DATA LOSS PREVENTION (DLP) STRATEGIES IN CLOUD-HOSTED APPLICATIONS.
- Sidharth, S. (2017). Cybersecurity Approaches for IoT Devices in Smart City Infrastructures.