Morphosyntactic Tagging of Slovene Legal Language. Morphosyntactic Tagging of Slovene Legal Language.

Morphosyntactic Tagging of Slovene Legal Language‪.‬

Informatica 2006, Dec, 30, 4

    • 79,00 Kč
    • 79,00 Kč

Publisher Description

Part-of-speech tagging or, more accurately, morphosyntactic tagging, is a procedure that assigns to each word token appearing in a text its morphosyntactic description, e.g. "masculine singular common noun in the genitive case". Morphosyntactic tagging is an important component of many language technology applications, such as machine translation, speech synthesis, or information extraction. In the paper we report on an experiment on morphosyntactic tagging of Slovene, on a sample of Slovene legal language. We evaluate the accuracy of the TnT tagger, which had been trained on the MULTEXT-East language resources for Slovene. The test data come from the freely available parallel English-Slovene corpus SVEZ-IJS, which contains the Slovene translation European Union legal acts. Presented are the details of the manually corrected test corpus and an analysis of the tagging errors. The paper also discusses a simple transformation-based program that fixes some of the more common errors, and concludes with some directions for future work. Povzetek: V prispevku je opisan poskus oblikoslovnega oznacevanja na vzorcu slovenskih pravnih besedil.

GENRE
Business & Personal Finance
RELEASED
2006
1 December
LANGUAGE
EN
English
LENGTH
19
Pages
PUBLISHER
Slovenian Society Informatika
SIZE
256.4
KB

More Books by Informatica

On the Crossing Number of Almost Planar Graphs. On the Crossing Number of Almost Planar Graphs.
2006
Named Entity Recognition Using Appropriate Unlabeled Data, Post-Processing and Voting (Technical Report) Named Entity Recognition Using Appropriate Unlabeled Data, Post-Processing and Voting (Technical Report)
2010
Efficient Morphological Parsing with a Weighted Finite State Transducer (Report) Efficient Morphological Parsing with a Weighted Finite State Transducer (Report)
2009
The Modelling of Manpower by Markov Chains--a Case Study of the Slovenian Armed Forces (Report) The Modelling of Manpower by Markov Chains--a Case Study of the Slovenian Armed Forces (Report)
2008
Survey of Egovernment Services in Serbia (Report) Survey of Egovernment Services in Serbia (Report)
2007
Statistical Dependency Parsing of Four Treebanks. Statistical Dependency Parsing of Four Treebanks.
2006