ILC will contribute substantially to the project esp. in the area of the Bio-Lexicon design and population, drawing from its outstanding knowledge about lexicon design and standard-conformant lexicon specifications.
In particular, ILC will be involved in the following activities:
design of a representational model for a terminological lexicon (Bio-Lexicon) which will be re-usable, extendable and easy to link to a generic lexicon; the model will integrate terminological, ontological and lexical information and be compatible with existing and upcoming HLT standards in order to support advanced text mining applications in biology;
implementation of the model: the container with all the lexical objects and data categories required will be provided, together with the means to fill and inspect the container itself and allowing communication with instances of the model via the web;
analysis and automatic generation of equivalent classes of terminological variants;
acquisition of linguistic information about terms from text corpora and term inventories and population of the Bio-Lexicon with extracted terms; additional detailed linguistic information about terms will be extracted to enhance the utility of the Bio-Lexicon;
acquisition of bio-events from biomedical text corpora and population of the Bio-Lexicon with verbs and bio-event nouns; bio-event information will be represented in terms of event frames according to the representation model;
Mapping Terminological Resources to an Ontology;
HLT Infrastructure and Tool Integration;
Multilingual Access.
Deliverable
BioLexicon Data Base - Version 1
BioLexicon Data Base - Version 2
BioLexicon Data Base - Version 3
BioLexicon Data Base - Release Note
I documenti sono protetti con password; per eventuali problemi si prega di contattare Riccardo Del Gratta.
@
BioLexicon
Complete BioLexicon
BioLexicon Only CHEBI
DDL
DDL Data Base New Version
I documenti sono protetti con password; per eventuali problemi si prega di contattare Riccardo Del Gratta.