Analysing Unified Embedding with Morphological Insight for Multilingual Text Representation

Govinda Rajulu G; L.Sharmila; L.Sharmila; D. Venkatesan; D. Venkatesan; Jayapraksah   Chinnadurai; Jayapraksah   Chinnadurai

doi:10.5269/bspm.82246

Analysing Unified Embedding with Morphological Insight for Multilingual Text Representation

Autores/as

Govinda Rajulu G Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology
L.Sharmila Sri Venkateswara College of Technology, Chennai.
L.Sharmila Sri Venkateswara College of Technology, Chennai.
D. Venkatesan St. Martin’s Engineering College
D. Venkatesan St. Martin’s Engineering College
Jayapraksah Chinnadurai GALGOTIAS UNIVERSITY
Jayapraksah Chinnadurai GALGOTIAS UNIVERSITY

DOI:

https://doi.org/10.5269/bspm.82246

Resumen

The increasing complexity of multilingual text representation challenges the design of embedding techniques that can effectively capture linguistic nuances across different languages. In this paper, we propose a unified embedding model that integrates morphological insights for multilingual text representation, specifically focusing on a dataset that spans English, Marathi, and Konkani languages. We explore various embedding methods, such as Fused Embeddings, Character-Level Embeddings, Word-Level Embeddings, and Contextualized Multilingual Embeddings, and demonstrate how these methods can be combined with morphological information for enhanced language understanding. Our approach is illustrated with a real-world dataset, and we evaluate its performance on downstream tasks like text summarization, highlighting the gains from fused embeddings. The comparative analysis of different embedding methods, along with a detailed workflow and mathematical modeling, provides insights into the strengths and weaknesses of each approach

Descargas

PDF (Inglés)

Publicado

2026-06-19

Número

Vol. 44 Núm. 17 (2026): Recent Trends in Mathematical Sciences and Technological Applications

Sección

Conf. Issue: Applications of Mathematics in Modern Science and Technology

Licencia

Derechos de autor 2026 Boletim da Sociedade Paranaense de Matemática

Esta obra está bajo una licencia internacional Creative Commons Atribución 4.0.

When the manuscript is accepted for publication, the authors agree automatically to transfer the copyright to the (SPM).

The journal utilize the Creative Common Attribution (CC-BY 4.0).

Cómo citar

G, G. R., L.Sharmila, L.Sharmila, D. Venkatesan, D. Venkatesan, Chinnadurai , J. . ., & Chinnadurai , J. . . (2026). Analysing Unified Embedding with Morphological Insight for Multilingual Text Representation. Boletim Da Sociedade Paranaense De Matemática, 44(17), 1-13. https://doi.org/10.5269/bspm.82246