Self-attention encoder-decoder with model adaptation for transliteration and translation tasks in regional language

Shanthala Nagaraja, Kiran Y. Chandappa

Abstract


The recent advancements in natural language processing (NLP) have highlighted the significance of integrating machine transliteration with translation for enhanced language services, particularly in the context of regional languages. This paper introduces a novel neural network architecture that leverages a self-attention mechanism to create an autoencoder without the need for iterative or convolutional processes. The selfattention mechanism operates on projection matrices, feature matrices, and target queries, utilizing the Softmax function for optimization. The introduction of the self-attention encoder-decoder with model adaptation (SAEDM) represents a breakthrough, marking a substantial enhancement in transliteration and translation accuracy over previous methodologies. This innovative approach employs both student and teacher models, with the student model's loss calculated through the probabilities and prediction labels via the negative log entropy function. The proposed architecture is distinctively designed at the character level, incorporating a word-to-word embedding framework, a beam search algorithm for sentence generation, and a binary classifier within the encoder-decoder structure to ensure the uniqueness of the content. The effectiveness of the proposed model is validated through comprehensive evaluations using transliteration and translation datasets in Kannada and Hindi languages, demonstrating its superior performance compared to existing models.

Keywords


Auto-encoder; Natural language processing; Neural network; Self-attention encoder-decoder with model adaptation; Self-attention mechanism

Full Text:

PDF


DOI: http://doi.org/10.11591/ijres.v14.i1.pp243-253

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Reconfigurable and Embedded Systems (IJRES)
p-ISSN 2089-4864, e-ISSN 2722-2608
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

Web Analytics Made Easy - Statcounter View IJRES Stats