Preliminary Recommendations


Application to French (Corpus)

The IBMF tagset codes punctuation marks as specific tags. Indeed, they are morphological manifestations and can help predict other tags.

The tagset distinguishes among: weak punctuation (AAAA), strong punctuation (YAAA) and sentence boundary (ZTRM).