ISST-TANL Corpus

ISST-TANL Corpus is a manually annotated corpus, encoded in the standard CoNLL format and including PoS tagging and syntactic dependency annotation. Jointly developed by Cnr-Istituto di Linguistica Computazionale “Antonio Zampolli” (CNR-ILC) and University of Pisa, it exemplifies the general use of the language and consists of articles extracted from newspapers and periodicals, selected to cover a high variety of topics. This corpus was used for training and testing in the shared activity “Domain Adaptation for Dependency Analysis” of EVALITA 2011.