Computational Modeling of Morphologically Rich Languages – The Case of Nouns in Albanian Language



Author Information

Anila Çepani, University of Tirana, Albania
Adelina Çerpja, Academy of Sciences of Albania, Albania

Abstract

The Albanian language, characterized as synthetic-analytical, presents a rich system of inflection that poses significant challenges in developing computational models for morphological analysis. This study aims to construct a computational morphological model for the nominal system in Albanian, with a focus on nouns, which exhibit various grammatical categories and forms including number (singular and plural), gender (masculine, feminine, neuter), case (nominative, genitive, dative, accusative, and ablative), and definite and indefinite forms.
The initial phase involves a thorough analysis of the morphological structure of the Albanian nominal system, identifying its grammatical categories, forms, and construction mechanisms, such as endings, inflectional suffixes, stem alternations, suppletion, and combinations. A precise methodology was employed to address these complexities, developing formulas based on different noun stems to encompass all possible forms for each grammatical variation. These formulas are crucial for generating different forms, aiming to minimize manual intervention and streamline the automatic completion of nominal forms. The next phase entails evaluating the developed models by comparing their results to manually constructed forms, thereby enhancing accuracy and efficiency. The effectiveness of the models is validated through real applications, such as Albanian spell checking. These models are indispensable for developing applications for spelling and grammar in Albanian, as well as other NLP applications, thereby advancing natural language processing tools for the Albanian language.


Paper Information

Conference: BCE2024
Stream: Design

This paper is part of the BCE2024 Conference Proceedings (View)
Full Paper
View / Download the full paper in a new tab/window


To cite this article:
Çepani A., & Çerpja A. (2025) Computational Modeling of Morphologically Rich Languages – The Case of Nouns in Albanian Language ISSN: 2435-9467 – The Barcelona Conference on Education 2024: Official Conference Proceedings (pp. 545-557) https://doi.org/10.22492/issn.2435-9467.2024.47
To link to this article: https://doi.org/10.22492/issn.2435-9467.2024.47


Virtual Presentation


Comments & Feedback

Place a comment using your LinkedIn profile

Comments

    Share on activity feed

    Powered by WP LinkPress

    Share this Research

    Posted by James Alexander Gordon