Core Scientific Dataset Model: A lightweight and portable model and file format for multi-dimensional scientific data

Abstract : The Core Scientific Dataset (CSD) model with JavaScript Object Notation (JSON) serialization is presented as a lightweight, portable, and versatile standard for intra-and interdisciplinary scientific data exchange. This model supports datasets with a p-component dependent variable, {U 0 ,. .. , U q ,. .. , U p 1 }, discretely sampled at M unique points in a d-dimensional independent variable (X 0 ,. .. X k ,. .. X d 1) space. Moreover, this sampling is over an orthogonal grid, regular or rectilinear, where the principal coordinate axes of the grid are the independent variables. It can also hold correlated datasets assuming the different physical quantities (dependent variables) are sampled on the same orthogonal grid of independent variables. The model encapsulates the dependent variables' sampled data values and the minimum metadata needed to accurately represent this data in an appropriate coordinate system of independent variables. The CSD model can serve as a re-usable building block in the development of more sophisticated portable scientific dataset file standards.
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02413965
Contributor : Dominique Massiot <>
Submitted on : Monday, December 16, 2019 - 2:03:49 PM
Last modification on : Tuesday, January 14, 2020 - 11:21:14 AM

File

CSDM-ManuscriptPLOS.pdf
Publisher files allowed on an open archive

Identifiers

Collections

Citation

Deepansh Srivastava, Thomas Vosegaard, Dominique Massiot, Philip Grandinetti. Core Scientific Dataset Model: A lightweight and portable model and file format for multi-dimensional scientific data. PLoS ONE, Public Library of Science, 2020, 15, pp.e0225953. ⟨10.1371/journal.pone.0225953⟩. ⟨hal-02413965⟩

Share

Metrics

Record views

50

Files downloads

19