Partial n-Ary relation instances on food packaging composition and permeability extracted from scientific publication tables - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue (Data Paper) Data in Brief Année : 2022

Partial n-Ary relation instances on food packaging composition and permeability extracted from scientific publication tables

Résumé

This dataset is dedicated to text mining and is composed of partial n-Ary relation instances concerning food packaging composition and gas permeability. It was created from 31 tables derived from 10 English-language scientific articles in html format from several international journals hosted on the ScienceDirect website. This dataset includes two sets of data: manual table annotation results and automatic data extraction results. The tables were first annotated by one annotator and cross-curated by three different annotators. The annotation task aimed to identify all table data dealing with packaging permeability measurements and compositions. An Ontological and Terminological Resource (OTR) was used for the annotation process. The annotation guidelines were drawn up through a collective iterative approach involving the annotators, and they may be accessed alongside the data. This dataset of n-Ary relations can be used in natural language processing (NLP) approaches implemented in experimental fields, especially for n-Ary relation extraction research. It can also be useful for training or evaluation of methods for the extraction of experimental data from tables and text in scientific documents, especially in experimental domains such as food packaging.
Fichier principal
Vignette du fichier
1-s2.0-S2352340922002116-main.pdf (982.25 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03610433 , version 1 (16-03-2022)

Licence

Paternité

Identifiants

Citer

Martin Lentschat, Patrice Buche, Luc Menut, Romane Guari, Mathieu Roche. Partial n-Ary relation instances on food packaging composition and permeability extracted from scientific publication tables. Data in Brief, 2022, 41, pp.108000. ⟨10.1016/j.dib.2022.108000⟩. ⟨hal-03610433⟩
457 Consultations
46 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More