Morphological typology of languages for IR
This paper presents a morphological classification of languages from the IR perspective. Linguistic typology research has shown that the morphological complexity of every language in the world can be described by two variables, index of synthesis and index of fusion. These variables provide a theoretical basis for IR research handling morphological issues. A common theoretical framework is needed in particular because of the increasing significance of cross‐language retrieval research and CLIR systems processing different languages. The paper elaborates the linguistic morphological typology for the purposes of IR research. It studies how the indexes of synthesis and fusion could be used as practical tools in mono‐ and cross‐lingual IR research. The need for semantic and syntactic typologies is discussed. The paper also reviews studies made in different languages on the effects of morphology and stemming in IR.
Year of publication: |
2001
|
---|---|
Authors: | Pirkola, Ari |
Published in: |
Journal of Documentation. - MCB UP Ltd, ISSN 1758-7379, ZDB-ID 1479864-5. - Vol. 57.2001, 3, p. 330-348
|
Publisher: |
MCB UP Ltd |
Subject: | Foreign languages | Text retrieval | Electronic publishing |
Saved in:
Online Resource
Saved in favorites
Similar items by subject
-
Application of probabilistic methods to Chinese
Huang, Xiangji, (1997)
-
Understanding inverse document frequency: on theoretical arguments for IDF
Robertson, Stephen, (2004)
-
Subject retrieval of scholarly monographs via electronic databases
East, John W., (2006)
- More ...