A. We, O. Constantin, R. Corcho, A. Filgueira, A. Hardisty et al., We also thank those who helped us identify ten-year achievements Atkinson was supported by the EU H2020 project ENVRI plus No. 654182 20 and by SKA- Link 21 . Gesing was supported by the Science Gateways Community Institute NSF Award Number ACI-1547611 22 , by the IMLS project " Planning a Research Data Software Preservation Quality Tool Award Id LG-72-16-0122-16 23 and by the Center for Research Computing at the The core Triana workflow system (Taylor) was supported by PPARC (GridOneD and Geo 600), Montagnat was partly funded by the French National Research Agency (ANR) through the " Investments for the Future " Program reference ANR-11-LABX OMII-UK (WHIP) and the Sintero 28

C. S. Liew, M. P. Atkinson, M. Galea, T. F. Ang, P. Martin et al., Scientific Workflows, ACM Computing Surveys, vol.49, issue.4, pp.1-6639, 2017.
DOI : 10.1109/CCGrid.2011.74

S. Davidson and J. Freire, Provenance and scientific workflows, Proceedings of the 2008 ACM SIGMOD international conference on Management of data , SIGMOD '08, pp.1-6, 2008.
DOI : 10.1145/1376616.1376772

S. Gómez-pérez and . Bechhofer, Using a suite of ontologies for preserving workflow-centric research objects, Web Semantics: Science, Services and Agents on the World Wide Web, vol.32, pp.16-42, 2015.

N. Cerezo, J. Montagnat, and M. Blay-fornarino, Computer-Assisted Scientific Workflow Design, Journal of Grid Computing, vol.3, issue.2, pp.585-610, 2013.
DOI : 10.1145/2110497.2110509

URL : https://hal.archives-ouvertes.fr/hal-00833692

D. Garijo, Y. Gil, O. Corcho, . Abstract, . Link et al., Exploit: An end-to-end framework for workflow sharing, Future Gener. Comput. Syst. This special issue (N), p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.01.008

I. S. Pérez, R. Ferreira-da-silva, M. Rynge, E. Deelman, M. S. Pérez-hernández et al., Reproducibility of execution environments in computational science using semantics and clouds, Future Gener. Comput. Syst, pp.67-354, 2017.

R. Filgueira, R. Ferreira-da-silva, A. Krause, E. Deelman, and M. P. Atkinson, Asterism: Pegasus and Dispel4py Hybrid Workflows for Data-Intensive Science, 2016 Seventh International Workshop on Data-Intensive Computing in the Clouds (DataCloud), pp.1-8, 2016.
DOI : 10.1109/DataCloud.2016.004

M. D. Wilkinson, M. Dumontier, I. J. Aalbersberg, G. Appleton, M. Axton et al., The FAIR Guiding Principles for scientific data management and stewardship, Scientific Data, vol.22, issue.2012, 2016.
DOI : 10.7717/peerj-cs.1

URL : http://doi.org/10.1038/sdata.2016.18

K. Plankensteiner, R. Prodan, M. Janetschek, T. Fahringer, J. Montagnat et al., Fine-Grain Interoperability of Scientific Workflows in Distributed Computing Infrastructures, Fine-Grain Interoperability of Scientific Workflows in Distributed Computing Infrastructures, pp.429-456, 2013.
DOI : 10.1145/997140.997145

URL : https://hal.archives-ouvertes.fr/hal-00832214

G. Terstyanszky, T. Kukla, T. Kiss, P. Kacsuk, A. Balasko et al., Enabling scientific workflow sharing through coarse-grained interoperability, Future Generation Computer Systems, vol.37, issue.0, pp.46-59
DOI : 10.1016/j.future.2014.02.016

J. Arshad, G. Terstyánszky, T. Kiss, N. Weingarten, and G. Taffoni, A Formal Approach to Support Interoperability in Scientific Meta-workflows, Journal of Grid Computing, vol.76, issue.1, pp.655-671, 2016.
DOI : 10.1016/j.jcss.2009.11.009

J. Goecks, A. Nekrutenko, J. Taylor, and T. G. Team, Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences, Genome Biology, vol.11, issue.8, p.86, 2010.
DOI : 10.1186/gb-2010-11-8-r86

M. Abouelhoda, S. A. Issa, and M. Ghanem, Tavaxy: Integrating Taverna and Galaxy workflows with cloud computing support, BMC Bioinformatics, vol.13, issue.1, pp.1471-2105
DOI : 10.1101/gr.5969107

URL : http://doi.org/10.1186/1471-2105-13-77

R. Ferreira-da-silva, R. Filgueira, I. Pietri, M. Jiang, R. Sakellariou et al., A characterization of workflow management systems for extreme-scale applications, Future Generation Computer Systems, vol.75, p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.02.026

T. Glatard, Software architectures to integrate workflow engines in science gateways, Future Generation Computer Systems, vol.75, p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.01.005

URL : http://doi.org/10.1016/j.future.2017.01.005

R. Sethi and Y. Gil, Scientific workflows in data analysis: Bridging expertise across multiple domains, Future Generation Computer Systems, vol.75, p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.01.001

S. Cohen-boulakia, Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities, Future Generation Computer Systems, vol.75, p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.01.012

URL : https://hal.archives-ouvertes.fr/hal-01516082

S. Woodman, H. Hiden, and P. Watson, Applications of provenance in performance prediction and data storage optimisation, Future Generation Computer Systems, vol.75, p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.01.003

P. Alper, K. Belhajjame, and C. A. Goble, Static analysis of Taverna workflows to predict provenance patterns, Future Generation Computer Systems, vol.75, p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.01.004

C. Pradal, C. Fournier, P. Valduriez, and S. Cohen-boulakia, OpenAlea, Proceedings of the 27th International Conference on Scientific and Statistical Database Management, SSDBM '15, pp.1-11, 2015.
DOI : 10.1007/978-3-642-13818-8_33

URL : https://hal.archives-ouvertes.fr/hal-00831785

M. Janetschek, R. Prodan, and S. Benedict, A Workflow Runtime Environment for Manycore Parallel Architectures, Future Gener. Comput. Syst. This special issue (N), p.pp?pp, 2017.
DOI : 10.1145/2822332.2822333

V. Arabnejad, K. Bubendorfer, and B. Ng, Scheduling deadline constrained scientific workflows on dynamically provisioned cloud resources, Future Generation Computer Systems, vol.75, p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.01.002

J. Wang, A. Taal, P. Martin, Y. Hu, H. Zhou et al., Planning virtual infrastructures for time critical applications with multiple deadline constraints, Future Generation Computer Systems, vol.75, p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.02.001

A. M. Chirkin, A. S. Belloum, S. V. Kovalchuk, M. X. Makkes, M. A. Melnik et al., Execution Time Estimation for Workflow Scheduling, Future Gener. Comput. Syst. This special issue (N), p.pp?pp, 2017.
DOI : 10.1109/works.2014.11

J. P. Macker and I. Taylor, Orchestration and analysis of decentralized worklows within heterogeneous networking infrastructures, Future Gener. Comput. Syst. This special issue (N), p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.01.007

V. Silva, J. Leite, J. J. Camata, D. De-oliveira, A. L. Coutinho et al., Raw data queries during data-intensive parallel workflow execution, Future Generation Computer Systems, vol.75, p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.01.016

URL : https://hal.archives-ouvertes.fr/lirmm-01445219

S. Al-kiswany, L. B. Costa, H. Yang, E. Vairavanathan, and M. Ripeanu, A cross-layer optimized storage system for workflow applications, Future Generation Computer Systems, vol.75, p.pp?pp, 2017.
DOI : 10.1016/j.future.2017.02.038

S. Fehrenbach and J. Cheney, Language-integrated provenance, Proceedings of the 18th International Symposium on Principles and Practice of Declarative Programming, PPDP '16, pp.214-227, 2016.
DOI : 10.1145/2723372.2735359

URL : http://arxiv.org/abs/1607.04104

S. Gesing, M. Atkinson, R. Filgueira, I. Taylor, A. Jones et al., Workflows in a Dashboard: A New Generation of Usability, 2014 9th Workshop on Workflows in Support of Large-Scale Science, pp.82-93, 2014.
DOI : 10.1109/WORKS.2014.6

W. Fan, Y. Wu, J. Xu, W. Yu, J. Jiang et al., Parallelizing Sequential Graph Computations, Proceedings of the 2017 ACM International Conference on Management of Data , SIGMOD '17, 2017.
DOI : 10.1145/2749246.2749258

Y. Chronis, Y. Foufoulas, V. Nikolopoulos, A. Papadopoulos, L. Stamatogiannakis et al., A relational approach to complex dataflows, Proceedings of the Workshops of the EDBT/ICDT 2016 Joint Conference, EDBT/ICDT Workshops 2016 of CEUR Workshop Proceedings, CEUR-WS.org, 2016.

H. Glaves, Data-Intensive Federations in which the British Geological Survey (BGS) is involved, Personal communication at ENVRI week, 2016.

D. Robertson, F. Giunchiglia, S. Pavis, E. Turra, G. Bella et al., Healthcare data safe havens: towards a logical architecture and experiment automation, The Journal of Engineering, vol.1, issue.1
DOI : 10.1049/joe.2016.0170

C. Wu, R. Tobar, K. Vinsen, A. Wicenec, D. Pallot et al., DALiuGE: A graph execution framework for harnessing the astronomical data deluge, Astronomy and Computing, vol.20
DOI : 10.1016/j.ascom.2017.03.007

M. Pierce, S. Marru, S. Pamidighantam, B. Demeler, E. Brookes et al., Apache airavata: Enabling science with science gateways
DOI : 10.1002/cpe.3534

J. J. Durillo, V. Nae, and R. Prodan, Multi-objective Workflow Scheduling: An Analysis of the Energy Efficiency and Makespan Tradeoff, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, pp.203-210, 2013.
DOI : 10.1109/CCGrid.2013.62

D. Kimovski, N. Saurabh, S. Gec, P. Stefanic, G. Kecskemeti et al., Towards an Environment for Efficient and Transparent Virtual Machine Operations: The ENTICE Approach, 2016 5th IEEE International Conference on Cloud Networking (Cloudnet), pp.242-247, 2016.
DOI : 10.1109/CloudNet.2016.30

J. Qin, T. Fahringer, and R. Prodan, A novel graph based approach for automatic composition of high quality grid workflows, Proceedings of the 18th ACM international symposium on High performance distributed computing, HPDC '09, pp.167-176, 2009.
DOI : 10.1145/1551609.1551637

P. Kacsuk, T. Kiss, and G. Sipos, Solving the grid interoperability problem by P-GRADE portal at workflow level, Future Generation Computer Systems, vol.24, issue.7, pp.744-751, 2008.
DOI : 10.1016/j.future.2008.02.008

T. Kiss and T. Kukla, Achieving Interoperation of Grid Data Resources via Workflow Level Integration, Journal of Grid Computing, vol.33, issue.4, pp.355-374, 2009.
DOI : 10.1007/11511854_1

G. Terstyánszky, T. Kukla, T. Kiss, P. Kacsuk, and ´. A. Balaskó, Enabling scientific workflow sharing through coarse-grained interoperability, Future Generation Computer Systems, vol.37, pp.37-46, 2014.
DOI : 10.1016/j.future.2014.02.016

J. Arshad, A. Hoffmann, S. Gesing, R. Grunzke, J. Krüger et al., Multi-level meta-workflows: new concept for regularly occurring tasks in quantum chemistry, Journal of Cheminformatics, vol.362, issue.16, 2016.
DOI : 10.1016/j.ica.2008.06.002

URL : http://doi.org/10.1186/s13321-016-0169-8

T. Kiss, P. Greenwell, H. Heindl, G. Terstyánszky, and N. Weingarten, Parameter Sweep Workflows for Modelling Carbohydrate Recognition, Journal of Grid Computing, vol.30, issue.6, pp.587-601, 2010.
DOI : 10.1021/ci00057a005

S. Shahand, A. Benabdelkader, M. M. Jaghoori, M. Mourabit, J. Huguet et al., A data-centric neuroscience gateway: design, implementation, and experiences, Concurrency and Computation: Practice and Experience, vol.10, issue.4, pp.489-506, 2015.
DOI : 10.1007/s10723-012-9242-3

URL : https://zenodo.org/record/10724/files/paper12.pdf

A. Costa, P. Massimino, M. Bandieramonte, U. Becciani, M. Krokos et al., An Innovative Science Gateway for the Cherenkov Telescope Array, Journal of Grid Computing, vol.9, issue.3, pp.547-559, 2015.
DOI : 10.1007/978-3-319-11268-8_5

B. Ludäscher, I. Altintas, C. Berkley, D. Higgins, E. Jaeger et al., Scientific workflow management and the Kepler system, Scientific workflow management and the kepler system, pp.1039-1065, 2006.
DOI : 10.1016/B978-012387582-2/50033-2

S. Bowers, T. M. Mcphillips, S. Riddle, M. K. Anand, B. Ludäscher et al., Kepler/pPOD: Scientific Workflow and Provenance Support for Assembling the Tree of Life, Provenance and Annotation of Data and Processes, Second International Provenance and Annotation Workshop, pp.70-77, 2008.
DOI : 10.1007/978-3-540-89965-5_9

P. Mouallem, R. Barreto, S. Klasky, N. Podhorszki, and M. A. Vouk, Tracking Files in the Kepler Provenance Framework, Scientific and Statistical Database Management, 21st International Conference, SSDBM 2009 Proceedings, pp.273-282, 2009.
DOI : 10.1007/11890850_15

L. Moreau, B. Ludäscher, I. Altintas, R. S. Barga, S. Bowers et al., Special Issue: The First Provenance Challenge, Concurrency and Computation: Practice and Experience, pp.409-418, 2008.
DOI : 10.1002/cpe.1233

L. Moreau, B. Clifford, J. Freire, J. Futrelle, Y. Gil et al., The Open Provenance Model core specification (v1.1), Future Generation Computer Systems, vol.27, issue.6, pp.743-756, 2011.
DOI : 10.1016/j.future.2010.07.005

J. Wang, D. Crawl, and I. Altintas, Kepler + Hadoop, Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, WORKS '09, 2009.
DOI : 10.1145/1645164.1645176

A. Goderis, C. X. Brooks, I. Altintas, E. A. Lee, and C. A. Goble, Heterogeneous composition of models of computation, Future Generation Computer Systems, vol.25, issue.5, pp.552-560, 2009.
DOI : 10.1016/j.future.2008.06.014

A. L. Hartman, S. Riddle, T. Mcphillips, B. Ludäscher, and J. A. Eisen, Introducing W.A.T.E.R.S.: a Workflow for the Alignment, Taxonomy, and Ecology of Ribosomal Sequences, BMC Bioinformatics, vol.11, issue.1, p.317, 2010.
DOI : 10.1186/1471-2105-11-317

T. Stropp, T. M. Mcphillips, B. Ludäscher, and M. Bieda, Workflows for microarray data processing in the Kepler environment, BMC Bioinformatics, vol.13, issue.1, 2012.
DOI : 10.1093/bioinformatics/btr499

J. Cummings, A. Pankin, N. Podhosrzki, G. Park, S. Ku et al., Plasma edge kinetic-mhd modeling in tokamaks using kepler workflow for code coupling, data management and visualization, Communications in Computational Physics, vol.4, issue.3, pp.675-702, 2008.

S. Aiche, T. Sachsenberg, E. Kenar, M. Walzer, B. Wiswedel et al., Workflows for automated downstream data analysis and visualization in large-scale computational mass spectrometry, PROTEOMICS, vol.8, issue.8
DOI : 10.1074/mcp.M113.035907

URL : http://doi.org/10.1002/pmic.201400391

T. Glatard, J. Montagnat, D. Lingrand, and X. Pennec, Flexible and Efficient Workflow Deployment of Data-Intensive Applications On Grids With MOTEUR, International Journal of High Performance Computing Applications, vol.22, issue.3, pp.347-360, 2008.
DOI : 10.1177/1094342008096067

T. Glatard, C. Lartizien, B. Gibaud, R. Ferreira-da-silva, G. Forestier et al., A Virtual Imaging Platform for Multi-Modality Medical Image Simulation, IEEE Transactions on Medical Imaging, vol.32, issue.1, pp.110-118, 2013.
DOI : 10.1109/TMI.2012.2220154

URL : https://hal.archives-ouvertes.fr/inserm-00762497

J. Montagnat, B. Isnard, T. Glatard, K. Maheshwari, and M. , Blay-Fornarino, A data-driven workflow language for grids based on array programming principles, pp.1-10, 2009.
DOI : 10.1145/1645164.1645171

URL : https://hal.archives-ouvertes.fr/hal-00677806/document

J. R. Balderrama, T. Truong-huu, and J. Montagnat, Scalable and Resilient Workflow Executions on Production Distributed Computing Infrastructures, 2012 11th International Symposium on Parallel and Distributed Computing, pp.119-126, 2012.
DOI : 10.1109/ISPDC.2012.24

URL : https://hal.archives-ouvertes.fr/hal-00691579

A. Gaignard, J. Montagnat, B. Gibaud, G. Forestier, and T. Glatard, Domain-specific summarisation of Life-Science e-experiments from provenance traces, Web Semantics: Science, Services and Agents on the World Wide Web, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01027596

E. Deelman, K. Vahi, G. Juve, M. Rynge, S. Callaghan et al., Pegasus, a workflow management system for science automation, Future Generation Computer Systems, vol.46, pp.46-63, 2015.
DOI : 10.1016/j.future.2014.10.008

E. Deelman, G. Singh, M. Su, J. Blythe, Y. Gil et al., Pegasus: A Framework for Mapping Complex Scientific Workflows onto Distributed Systems, Scientific Programming, vol.13, issue.3, pp.219-237, 2005.
DOI : 10.1155/2005/128026

URL : http://doi.org/10.1155/2005/128026

E. Deelman, K. Vahi, M. Rynge, G. Juve, R. Mayani et al., Pegasus in the Cloud: Science Automation through Workflow Technologies, IEEE Internet Computing, vol.20, issue.1, pp.70-76, 2016.
DOI : 10.1109/MIC.2016.15

B. Abbott, R. Abbott, T. Abbott, M. Abernathy, F. Acernese et al., GW150914: First results from the search for binary black hole coalescence with Advanced LIGO, Physical Review D, vol.93, issue.12, p.122003, 2016.
DOI : 10.1088/0264-9381/21/5/060

URL : https://hal.archives-ouvertes.fr/in2p3-01273253

M. Wilde, M. Hategan, J. M. Wozniak, B. Clifford, D. S. Katz et al., Swift: A language for distributed parallel scripting, Parallel Computing, vol.37, issue.9, pp.633-652, 2011.
DOI : 10.1016/j.parco.2011.05.005

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.300.4929

T. G. Armstrong, J. M. Wozniak, M. Wilde, and I. T. Foster, Compiler Techniques for Massively Scalable Implicit Task Parallelism, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis, pp.299-310, 2014.
DOI : 10.1109/SC.2014.30

J. Ozik, N. Collier, J. Wozniak, C. Macal, C. Cockrell et al., High performance model exploration of mutation patterns in an agent-based model of colorectal cancer

J. M. Wozniak, K. Chard, B. Blaiszik, R. Osborn, M. Wilde et al., Big data remote access interfaces for light source science, Big Data Computing (BDC), pp.51-60, 2015.

K. Williamson, Blue Waters supercomputer used to create 3D elevation models for White House Arctic Initiative URL http://www.ncsa.illinois.edu/news/story/blue waters supercomputer used to create 3 d elevation models for white house Taverna: a tool for building and running workflows of services, Nucleic Acids Research, pp.729-732

T. M. Oinn, M. Addis, J. Ferris, D. Marvin, M. Senger et al., Taverna: a tool for the composition and enactment of bioinformatics workflows, Bioinformatics, vol.20, issue.17, pp.3045-3054, 2004.
DOI : 10.1093/bioinformatics/bth361

K. Wolstencroft, R. Haines, D. Fellows, A. R. Williams, D. Withers et al., The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud, Nucleic Acids Research, vol.41, issue.W1, pp.557-561, 2013.
DOI : 10.1093/nar/gkt328

A. R. Hardisty, F. Bacall, N. Beard, M. Balcázar-vargas, B. Balech et al., BioVeL: a virtual laboratory for data analysis and modelling in biodiversity science and ecology, BMC Ecology, vol.1, issue.(Web Server iss, p.49, 2016.
DOI : 10.1080/14888386.2015.1068709

J. Ruiz, J. Garrido, J. Santander-vela, S. Sánchez-expósito, and L. Verdes-montenegro, Astrotavernabuilding workflows with virtual observatory services, Astronomy and Computing, vol.7, pp.3-11, 2014.
DOI : 10.1016/j.ascom.2014.09.002

J. Zhao, C. Wroe, C. A. Goble, R. Stevens, D. Quan et al., Using Semantic Web Technologies for Representing E-science Provenance, Proceedings Lecture Notes in Computer Science, vol.3298, pp.92-106, 2004.
DOI : 10.1007/978-3-540-30475-3_8

C. Wroe, R. Stevens, C. A. Goble, A. Roberts, and R. M. Greenwood, A SUITE OF DAML+OIL ONTOLOGIES TO DESCRIBE BIOINFORMATICS WEB SERVICES AND DATA, International Journal of Cooperative Information Systems, vol.38, issue.02, pp.197-224, 2003.
DOI : 10.1093/bioinformatics/15.6.510

D. De-roure, C. A. Goble, and R. Stevens, The design and realisation of the Virtual Research Environment for social sharing of workflows, Future Generation Computer Systems, vol.25, issue.5, pp.561-567, 2009.
DOI : 10.1016/j.future.2008.06.010

C. A. Goble, J. Bhagat, S. Aleksejevs, D. Cruickshank, D. T. Michaelides et al., myExperiment: a repository and social network for the sharing of bioinformatics workflows, Nucleic Acids Research, vol.38, issue.suppl_2, pp.677-682, 2010.
DOI : 10.1093/nar/gkq429

D. Rogers, I. Harvey, T. T. Huu, K. Evans, T. Glatard et al., Bundle and Pool Architecture for Multi-Language, Robust, Scalable Workflow Executions, Journal of Grid Computing, vol.5, issue.1, pp.457-480, 2013.
DOI : 10.1088/1742-6596/219/6/062049

URL : https://hal.archives-ouvertes.fr/hal-00832221

K. Vahi, I. Harvey, T. Samak, D. Gunter, K. Evans et al., A Case Study into Using Common Real-Time Workflow Monitoring Infrastructure for Scientific Workflows, Journal of Grid Computing, vol.3, issue.3?4, pp.381-406, 2013.
DOI : 10.1007/s10723-005-9011-7

Y. Gil, P. A. González-calero, J. Kim, J. Moody, and V. Ratnakar, A semantic framework for automatic generation of computational workflows using distributed data and component catalogues, Journal of Experimental & Theoretical Artificial Intelligence, vol.8, issue.4, pp.389-467, 2011.
DOI : 10.1002/cpe.1050

Y. Gil, D. Garijo, V. Ratnakar, R. Mayani, R. Adusumilli et al., Towards continuous scientific data analysis and hypothesis evolution, Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp.4406-4414, 2017.