Corpus de sintaxis superficial del finés en formato conll
A 2,000-sentence corpus of weather-related sentences of Finnish has been annotated, using surface syntactic relations according to the Meaning-Text Theory. This corpus will be used to obtain resources that will help to improve the PESCaDO system and that will be adaptable to future projects. More in detail, such annotation will be used for i) training a parser, for extracting new data from Finnish webpages, and ii) automatically obtaining deeper levels of annotation (Deep Syntax and Semantics); this will allow for training a statistical generator which can be integrated to the Linguistic Generation module. The annotation is presented in standard CoNLL (one word per line) format.