Analysing Unified Embedding with Morphological Insight for Multilingual Text Representation

Auteurs-es

  • Govinda Rajulu G Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology
  • L.Sharmila Sri Venkateswara College of Technology, Chennai.
  • L.Sharmila Sri Venkateswara College of Technology, Chennai.
  • D. Venkatesan St. Martin’s Engineering College
  • D. Venkatesan St. Martin’s Engineering College
  • Jayapraksah Chinnadurai GALGOTIAS UNIVERSITY
  • Jayapraksah Chinnadurai GALGOTIAS UNIVERSITY

DOI :

https://doi.org/10.5269/bspm.82246

Résumé

The increasing complexity of multilingual text representation challenges the design of embedding techniques that can effectively capture linguistic nuances across different languages. In this paper, we propose a unified embedding model that integrates morphological insights for multilingual text representation, specifically focusing on a dataset that spans English, Marathi, and Konkani languages. We explore various embedding methods, such as Fused Embeddings, Character-Level Embeddings, Word-Level Embeddings, and Contextualized Multilingual Embeddings, and demonstrate how these methods can be combined with morphological information for enhanced language understanding. Our approach is illustrated with a real-world dataset, and we evaluate its performance on downstream tasks like text summarization, highlighting the gains from fused embeddings. The comparative analysis of different embedding methods, along with a detailed workflow and mathematical modeling, provides insights into the strengths and weaknesses of each approach

Téléchargements

Publié

2026-06-19

Numéro

Rubrique

Conf. Issue: Applications of Mathematics in Modern Science and Technology

Comment citer

G, G. R., L.Sharmila, L.Sharmila, D. Venkatesan, D. Venkatesan, Chinnadurai , J. . ., & Chinnadurai , J. . . (2026). Analysing Unified Embedding with Morphological Insight for Multilingual Text Representation. Boletim Da Sociedade Paranaense De Matemática, 44(17), 1-13. https://doi.org/10.5269/bspm.82246