Computational Modeling of Morphologically Rich Languages – The Case of Nouns in Albanian Language

Abstract

The Albanian language, characterized as synthetic-analytical, presents a rich system of inflection that poses significant challenges in developing computational models for morphological analysis. This study aims to construct a computational morphological model for the nominal system in Albanian, with a focus on nouns, which exhibit various grammatical categories and forms including number (singular and plural), gender (masculine, feminine, neuter), case (nominative, genitive, dative, accusative, and ablative), and definite and indefinite forms.
The initial phase involves a thorough analysis of the morphological structure of the Albanian nominal system, identifying its grammatical categories, forms, and construction mechanisms, such as endings, inflectional suffixes, stem alternations, suppletion, and combinations. A precise methodology was employed to address these complexities, developing formulas based on different noun stems to encompass all possible forms for each grammatical variation. These formulas are crucial for generating different forms, aiming to minimize manual intervention and streamline the automatic completion of nominal forms. The next phase entails evaluating the developed models by comparing their results to manually constructed forms, thereby enhancing accuracy and efficiency. The effectiveness of the models is validated through real applications, such as Albanian spell checking. These models are indispensable for developing applications for spelling and grammar in Albanian, as well as other NLP applications, thereby advancing natural language processing tools for the Albanian language.



Author Information
Anila Çepani, University of Tirana, Albania
Adelina Çerpja, Academy of Sciences of Albania, Albania

Paper Information
Conference: BCE2024
Stream: Design

This paper is part of the BCE2024 Conference Proceedings (View)
Full Paper
View / Download the full paper in a new tab/window


To cite this article:
Çepani A., & Çerpja A. (2025) Computational Modeling of Morphologically Rich Languages – The Case of Nouns in Albanian Language ISSN: 2435-9467 – The Barcelona Conference on Education 2024: Official Conference Proceedings (pp. 545-557) https://doi.org/10.22492/issn.2435-9467.2024.47
To link to this article: https://doi.org/10.22492/issn.2435-9467.2024.47


Virtual Presentation


Comments & Feedback

Place a comment using your LinkedIn profile

Comments

Share on activity feed

Powered by WP LinkPress

Share this Research

Posted by James Alexander Gordon