C. Augonnet and R. Namyst, A unified runtime system for heterogeneous multicore architectures, Proceedings of the International Euro-Par Workshops, HPPC'08, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00326917

C. Augonnet, S. Thibault, and R. Namyst, Automatic Calibration of Performance Models on Heterogeneous Multicore Architectures, Proceedings of the International Euro-Par Workshops, HPPC'09, 2009.
DOI : 10.1007/978-3-642-14122-5_9

URL : https://hal.archives-ouvertes.fr/inria-00421333

C. Augonnet, S. Thibault, R. Namyst, and M. Nijhuis, Exploiting the Cell/BE Architecture with the StarPU Unified Runtime System, SAMOS Workshop, 2009.
DOI : 10.1007/978-3-642-03138-0_36

URL : https://hal.archives-ouvertes.fr/inria-00378705

. Ortí, A proposal to extend the openmp tasking model for heterogeneous architectures, IWOMP '09: Proceedings of the 5th International Workshop on OpenMP, pp.154-167, 2009.

E. Ayguadé, R. M. Badia, F. D. Igual, J. Labarta, R. Mayo et al., An Extension of the StarSs Programming Model for Platforms with Multiple GPUs, Proceedings of the 15th Euro-Par Conference, 2009.
DOI : 10.1109/TPDS.2003.1214317

C. Banino, O. Beaumont, L. Carter, J. Ferrante, A. Legrand et al., Scheduling strategies for master-slave tasking on heterogeneous processor platforms, IEEE Transactions on Parallel and Distributed Systems, vol.15, issue.4, pp.319-330, 2004.
DOI : 10.1109/TPDS.2004.1271181

URL : https://hal.archives-ouvertes.fr/hal-00789427

F. Gregory, S. Diamos, and . Yalamanchili, Harmony: an execution model and runtime for heterogeneous many core systems, HPDC '08: Proceedings of the 17th international symposium on High performance distributed computing, pp.197-200, 2008.

R. Dolbeau, S. Bihan, and F. Bodin, HMPP: A hybrid multi-core parallel programming environment, 2007.

J. Planas, R. M. Badia, E. Ayguadé, and J. Labarta, Hierarchical Task-Based Programming With StarSs, International Journal of High Performance Computing Applications, vol.23, issue.3, p.284, 2009.
DOI : 10.1177/1094342009106195

G. Teodoro, R. Sachetto, O. Sertel, M. Gurcan, W. M. Jr et al., Coordinating the use of GPU and CPU for improving performance of compute intensive applications, 2009 IEEE International Conference on Cluster Computing and Workshops, 2009.
DOI : 10.1109/CLUSTR.2009.5289193

S. Tomov, J. Dongarra, and M. Baboulin, Towards dense linear algebra for hybrid GPU accelerated manycore systems, Parallel Computing, vol.36, issue.5-6, 2009.
DOI : 10.1016/j.parco.2009.12.005

H. Topcuoglu, S. Hariri, and M. Wu, Performance-effective and low-complexity task scheduling for heterogeneous computing. Parallel and Distributed Systems, IEEE Transactions on, vol.13, issue.3, pp.260-274, 2002.

V. Volkov and J. W. Demmel, Benchmarking GPUs to tune dense linear algebra, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-11, 2008.
DOI : 10.1109/SC.2008.5214359

L. Wesolowski, An application programming interface for general purpose graphics processing units in an asynchronous runtime system, 2008.

R. , C. Whaley, and J. Dongarra, Automatically Tuned Linear Algebra Software, Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999.