A Special Issue of the Computational Linguistics Journal
Parsing Morphologically Rich Languages

In the context of computational linguistics, parsing is the task of automatically analyzing the syntactic structure of sentences in natural language, providing information that is crucial for further semantic processing and downstream applications. Although the performance of parsing systems has in general improved tremendously in recent years, there is increasing evidence that performance is highly sensitive to typological differences between languages. Thus, statistical models for phrase structure parsing developed for English often exhibit a drastic drop in performance when applied to languages such as German, Arabic, French and Hebrew. Similarly, multilingual evaluation campaigns for statistical dependency parsers have shown considerable variation in accuracy across languages that seem to be related at least partly to typological characteristics. In both cases, it appears that the greatest challenges are posed by morphologically rich languages (MRL), where significant information concerning syntactic structure is expressed at the word level, where each word can have a very high number of possible forms, and where word order is weakly constrained by syntactic structure.

The challenges exhibited by MRLs transcend language boundaries, and emerging insights are often relevant across different theoretical frameworks and methodological traditions. Considering parsing research from the point of view of MRLs therefore sheds light on the generality and adequacy of currently available state-of-the-art parsing methods for dealing with complex linguistic phenomena, vis à vis morphosyntactic interactions. This special issue aims to provide the focal point for studies of large-scale, broad-coverage parsing models that can successfully cope with the challenges exhibited by MRLs, from both the formal and the statistical points of view. It sets out to provide an overview of the state-of-the-art solutions, shared insights across languages and frameworks, and lessons relevant to downstream applications (such as machine translation of MRLs).


We solicit novel contributions describing completed work on broad-coverage parsing of morphologically rich languages, from formal or statistical points of view, in a single or multiple frameworks. We encourage contributions that emphasize how particular methods respond to the challenges associated with parsing MRLs and morphosyntactic phenomena, and go beyond the idiosyncrasies associated with individual languages. The range of topics to be covered in the special issue includes, but is not limited to:


In order to provide a wide exposure to the state-of-the-art in the field, allowing us to cover multiple frameworks as well as multiple languages that exhibit different structure and characteristics, the extended editorial board of this special issue will use a new format with multiple short papers of length up to 25 pages (excluding references). Submitted papers must follow the CL formatting guidelines available at Submissions should be made through the CL electronic submission system.


