next up previous contents
Next: Tools Up: About the documents of Previous: Corpus encoding

Linguistic annotation

Morphosyntactic annotation is addressed as part of the linguistic annotation task (EAGLES, 1996c). This report constitutes a proposal which results from intensive collaboration with the EAGLES Computational Lexicons Working Group.

Recommendations for three different degrees of standardisation are discussed and recommendations for morphosyntactic categories are presented. An intermediate tagset with obligatory, recommended and optional attributes or values is described in the body of the report. An Appendix containing an English and an Italian tagset mapped onto the intermediate tagset for morphosyntactic annotation is included.

The document on syntactic annotation (EAGLES, 1996d) is structured in 5 sections plus an Appendix in which the annotation scheme proposed is illustrated with small text samples from Dutch, English, Spanish and German.

After an introductory section in which definitions, goals and several conceptual distinctions are discussed, section 3 addresses the different layers of annotation. The guidelines are found in section 4, where obligatory and recommended annotations and optional categories are proposed; this is complemented by the discussion of some of the issues in the practical application of the scheme in section 5.