Knowledge model of regulatory networks involved in Arabidopsis seed development for information extraction and integration from text - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Knowledge model of regulatory networks involved in Arabidopsis seed development for information extraction and integration from text

Bertrand Dubreucq
Loic Lepiniec

Résumé

A comprehensive understanding of the molecular network underlying seed development regulations remains a major scientific challenge with important potential impact for fundamental research, agriculture and industry. Seed development requires the coordinated growth of different tissues that involves complex genetics and environmental regulations. Most of this knowledge is spread in thousands of articles. Integrating knowledge conveyed by text with knowledge derived from experimental data requires a common model in view of modeling in systems biology. In this work, we focused on a model organism, Arabidopsis thaliana. We designed a knowledge model that meet the needs of text-mining, (i.e. manual annotation of texts and automatic information extraction), experimental data indexing and retrieval and reuse to other plant systems. The model defines 6 sets of entity types that cover 16 types of biological entities and external factors. The first two sets are central in regulation description: genetic process entities and their products, such as molecular entities or biochemical processes. The last four, Genotypes, Tissues, Development phase and Environmental factor, define the observed conditions for the regulation. Events represent binary and n-ary relations between entities: three event types describe regulations involving gene expression, metabolic pathways, regulatory networks and molecular accumulation, which is critical in seed development. Additional qualifiers specify involvement, activation, inhibition or requirement. Molecular interactions are represented by three biological events: Binds_to, Encodes and Interacts_with. Comparison, Belongs_To, Found_In and Found_During events are used to describe similarity, membership, spatial and temporal relations among entities. All events have Negation and Speculation modalities. This model was used by biology experts to manually annotate a reference corpus and train our Alvis methods that have shown its relevance.
Fichier principal
Vignette du fichier
Biocreative2015_knowledge_model_Arabidopsis_ECVfinaleV4_1.pdf (3.18 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-01512197 , version 1 (02-06-2020)

Identifiants

  • HAL Id : hal-01512197 , version 1
  • PRODINRA : 344677

Citer

Estelle Chaix, Bertrand Dubreucq, Dialekti Valsamou, Abdelhak Fatihi, Robert Bossy, et al.. Knowledge model of regulatory networks involved in Arabidopsis seed development for information extraction and integration from text. BioCreative 5, Sep 2015, Madrid, Spain. ⟨hal-01512197⟩
227 Consultations
18 Téléchargements

Partager

Gmail Facebook X LinkedIn More