next up previous contents
Next: The content of the Up: Short summaries of the Previous: The report on tagset

The report on tagset mapping (TSM)

This report describes the experiments carried out in view of a mapping between the English version of the EAGLES morphosyntax proposals and existing corpus tagsets (EAGLES, 1996h). The tagsets chosen as examples are:

The first part contains a concise description of the objectives of the mapping exercise, of the preparation of an ELM specification for English (cf. the description of the ELM reports), the technical realisation of the mapping procedures and the use and output of the mapping from the LIQUY tool.

The second part contains a draft ELM specification for English, which mainly follows the organisation of the other ELM reports. It is however less worked-out, because it has not been used for manual tagging. Thus, the parts which support the operational use (delimitation tables) are not available in as much detail as they are for German, French and Italian.

The annex contains the Prolog specification of ELM-EN, the compiled feature hierarchy and lists of the BNC and UPenn tagsets.

The mapping is exemplified by a list of all possible queries in ELM-EN and all results produced on BNC and on UPenn.