Analysing Unified Embedding with Morphological Insight for Multilingual Text Representation

Govinda Rajulu G; L.Sharmila; L.Sharmila; D. Venkatesan; D. Venkatesan; Jayapraksah   Chinnadurai; Jayapraksah   Chinnadurai

doi:10.5269/bspm.82246

Analysing Unified Embedding with Morphological Insight for Multilingual Text Representation

Auteurs-es

Govinda Rajulu G Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology
L.Sharmila Sri Venkateswara College of Technology, Chennai.
L.Sharmila Sri Venkateswara College of Technology, Chennai.
D. Venkatesan St. Martin’s Engineering College
D. Venkatesan St. Martin’s Engineering College
Jayapraksah Chinnadurai GALGOTIAS UNIVERSITY
Jayapraksah Chinnadurai GALGOTIAS UNIVERSITY

DOI :

https://doi.org/10.5269/bspm.82246

Résumé

The increasing complexity of multilingual text representation challenges the design of embedding techniques that can effectively capture linguistic nuances across different languages. In this paper, we propose a unified embedding model that integrates morphological insights for multilingual text representation, specifically focusing on a dataset that spans English, Marathi, and Konkani languages. We explore various embedding methods, such as Fused Embeddings, Character-Level Embeddings, Word-Level Embeddings, and Contextualized Multilingual Embeddings, and demonstrate how these methods can be combined with morphological information for enhanced language understanding. Our approach is illustrated with a real-world dataset, and we evaluate its performance on downstream tasks like text summarization, highlighting the gains from fused embeddings. The comparative analysis of different embedding methods, along with a detailed workflow and mathematical modeling, provides insights into the strengths and weaknesses of each approach

Téléchargements

PDF (Anglais)

Publié

2026-06-19

Numéro

Vol. 44 No. 17 (2026): Recent Trends in Mathematical Sciences and Technological Applications

Rubrique

Conf. Issue: Applications of Mathematics in Modern Science and Technology

Licence

Cette œuvre est sous licence Creative Commons Attribution 4.0 International.

When the manuscript is accepted for publication, the authors agree automatically to transfer the copyright to the (SPM).

The journal utilize the Creative Common Attribution (CC-BY 4.0).

Comment citer

G, G. R., L.Sharmila, L.Sharmila, D. Venkatesan, D. Venkatesan, Chinnadurai , J. . ., & Chinnadurai , J. . . (2026). Analysing Unified Embedding with Morphological Insight for Multilingual Text Representation. Boletim Da Sociedade Paranaense De Matemática, 44(17), 1-13. https://doi.org/10.5269/bspm.82246