F. Agakov, E. Bonilla, J. Cavazos, B. Franke, G. Fursin et al., Using Machine Learning to Focus Iterative Optimization, International Symposium on Code Generation and Optimization (CGO'06), pp.295-305, 2006.
DOI : 10.1109/CGO.2006.37

D. Barthou, J. Collard, and P. Feautrier, Fuzzy Array Dataflow Analysis, Journal of Parallel and Distributed Computing, vol.40, issue.2, pp.210-226, 1997.
DOI : 10.1006/jpdc.1996.1261

URL : https://hal.archives-ouvertes.fr/hal-00551673

C. Bastoul, Code generation in the polyhedral model is easier than you think, Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004., pp.7-16, 2004.
DOI : 10.1109/PACT.2004.1342537

URL : https://hal.archives-ouvertes.fr/hal-00017260

M. Benabderrahmane, L. Pouchet, A. Cohen, and C. Bastoul, The Polyhedral Model Is More Widely Applicable Than You Think, Intl. Conf. on Compiler Construction (ETAPS CC'10), pp.283-303, 2010.
DOI : 10.1007/978-3-642-11970-5_16

URL : https://hal.archives-ouvertes.fr/inria-00551087

F. Bodin, T. Kisuki, P. M. Knijnenburg, M. F. O-'boyle, and E. Rohou, Iterative compilation in a non-linear optimisation space, W. on Profile and Feedback Directed Compilation, 1998.
URL : https://hal.archives-ouvertes.fr/inria-00475919

U. Bondhugula, M. Baskaran, S. Krishnamoorthy, J. Ramanujam, A. Rountev et al., Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model, International conference on Compiler Construction (ETAPS CC), 2008.
DOI : 10.1007/978-3-540-78791-4_9

U. Bondhugula, O. Gunluk, S. Dash, and L. Renganarayanan, A model for fusion and code motion in an automatic parallelizing compiler, Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, pp.343-352, 2010.
DOI : 10.1145/1854273.1854317

U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan, A practical automatic polyhedral program optimization system, ACM SIGPLAN Conference on Programming Language Design and Implementation, 2008.

C. Chen, J. Chame, and M. Hall, CHiLL: A framework for composing high-level loop transformations, 2008.

P. Clauss, Counting solutions to linear and nonlinear constraints through ehrhart polynomials: applications to analyze and transform scientific programs, Proc. of the Intl. Conf. on Supercomputing, pp.278-285, 1996.
URL : https://hal.archives-ouvertes.fr/hal-01100306

A. Cohen, S. Girbal, D. Parello, M. Sigler, O. Temam et al., Facilitating the search for compositions of program transformations, Proceedings of the 19th annual international conference on Supercomputing , ICS '05, pp.151-160, 2005.
DOI : 10.1145/1088149.1088169

URL : https://hal.archives-ouvertes.fr/hal-01257296

A. Darte, On the complexity of loop fusion, Parallel Computing, pp.149-157, 1999.

A. Darte and G. Huard, Loop shifting for loop parallelization, 2000.

A. Darte, G. Silber, and F. Vivien, Combining Retiming and Scheduling Techniques for Loop Parallelization and Loop Tiling, Parallel Proc. Letters, pp.379-392, 1997.
DOI : 10.1142/S0129626497000383

URL : https://hal.archives-ouvertes.fr/hal-00856890

P. Feautrier, Parametric integer programming, RAIRO - Operations Research, vol.22, issue.3, pp.243-268, 1988.
DOI : 10.1051/ro/1988220302431

P. Feautrier, Dataflow analysis of array and scalar references, International Journal of Parallel Programming, vol.24, issue.4, pp.23-53, 1991.
DOI : 10.1007/BF01407931

P. Feautrier, Some efficient solutions to the affine scheduling problem. I. One-dimensional time, International Journal of Parallel Programming, vol.40, issue.6, pp.313-348, 1992.
DOI : 10.1007/BF01407835

P. Feautrier, Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time, International Journal of Parallel Programming, vol.2, issue.4, pp.389-420, 1992.
DOI : 10.1007/BF01379404

F. Franchetti, Y. Voronenko, and M. Püschel, Formal loop merging for signal transforms, ACM SIGPLAN Conf. on Programming Language Design and Implementation, pp.315-326, 2005.

S. Girbal, N. Vasilache, C. Bastoul, A. Cohen, D. Parello et al., Semi-Automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies, International Journal of Parallel Programming, vol.20, issue.1, pp.261-317, 2006.
DOI : 10.1007/s10766-006-0012-3

URL : https://hal.archives-ouvertes.fr/hal-01257288

A. Guillou, F. Quilleré, P. Quinton, S. Rajopadhye, and T. Risset, Hardware design methodology with the Alpha language, FDL'01, 2001.

F. Irigoin and R. Triolet, Supernode partitioning, Proceedings of the 15th ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '88, pp.319-329, 1988.
DOI : 10.1145/73560.73588

W. Kelly, Optimization within a Unified Transformation Framework, 1996.

K. Kennedy and K. Mckinley, Maximizing loop parallelism and improving data locality via loop fusion and distribution, Languages and Compilers for Parallel Computing, pp.301-320, 1993.
DOI : 10.1007/3-540-57659-2_18

I. Kodukula, N. Ahmed, and K. Pingali, Data-centric multi-level blocking, ACM SIGPLAN'97 Conf. on Programming Language Design and Implementation, pp.346-357, 1997.

M. Kudlur and S. Mahlke, Orchestrating the execution of stream programs on multicore platforms, ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI'08), pp.114-124, 2008.

R. Lethin, A. Leung, B. Meister, N. Vasilache, D. Wohlford et al., R-stream compiler, Encyclopedia of Parallel Computing, 2011.

K. S. Mckinley, S. Carr, and C. Tseng, Improving data locality with loop transformations, ACM Transactions on Programming Languages and Systems, vol.18, issue.4, pp.424-453, 1996.
DOI : 10.1145/233561.233564

N. Megiddo and V. Sarkar, Optimal weighted loop fusion for parallel programs, Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures , SPAA '97, pp.282-291, 1997.
DOI : 10.1145/258492.258520

A. Nisbet, GAPS: A compiler framework for genetic algorithm (GA) optimised parallelisation, Proc. of the Intl. Conf. and Exhibition on High-Performance Computing and Networking, pp.987-989, 1998.
DOI : 10.1007/BFb0037253

L. Pouchet, Interative Optimization in the Polyhedral Model, 2010.

L. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos, Iterative optimization in the polyhedral model: Part II, multidimensional time, ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI'08), pp.90-100, 2008.
URL : https://hal.archives-ouvertes.fr/hal-01257273

L. Pouchet, U. Bondhugula, C. Bastoul, A. Cohen, J. Ramanujam et al., Combined Iterative and Model-driven Optimization in an Automatic Parallelization Framework, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, 2010.
DOI : 10.1109/SC.2010.14

URL : https://hal.archives-ouvertes.fr/inria-00551067

A. Qasem and K. Kennedy, Profitable loop fusion and tiling using model-driven empirical search, Proceedings of the 20th annual international conference on Supercomputing , ICS '06, pp.249-258, 2006.
DOI : 10.1145/1183401.1183437

J. Ramanujam and P. Sadayappan, Tiling multidimensional iteration spaces for multicomputers, Journal of Parallel and Distributed Computing, vol.16, issue.2, pp.108-230, 1992.
DOI : 10.1016/0743-7315(92)90027-K

M. Ren, J. Y. Park, M. Houston, A. Aiken, and W. J. Dally, A tuning framework for software-managed memory hierarchies, Proceedings of the 17th international conference on Parallel architectures and compilation techniques, PACT '08, pp.280-291, 2008.
DOI : 10.1145/1454115.1454155

L. Renganarayanan, D. Kim, S. Rajopadhye, and M. M. Strout, Parameterized tiled loops for free, SIGPLAN Notices, Proc. of the 2007 PLDI Conf, pp.405-414, 2007.

A. Schrijver, Theory of linear and integer programming, 1986.

S. Singhai and K. Mckinley, A Parametrized Loop Fusion Algorithm for Improving Parallelism and Cache Locality, The Computer Journal, vol.40, issue.6, pp.340-355, 1997.
DOI : 10.1093/comjnl/40.6.340

N. J. Sloane, Sequence a000670. The On-Line Encyclopedia of Integer Sequences

M. Stephenson, S. Amarasinghe, M. Martin, and U. Reilly, Meta optimization, ACM SIGPLAN Notices, vol.38, issue.5, pp.77-90, 2003.
DOI : 10.1145/780822.781141

A. Tiwari, C. Chen, J. Chame, M. Hall, and J. K. Hollingsworth, A scalable autotuning framework for computer optimization, IPDPS'09, 2009.

N. Vasilache, Scalable Program Optimization Techniques in the Polyhedra Model, 2007.

S. Verdoolaege, F. Catthoor, M. Bruynooghe, and G. Janssens, Feasibility of incremental translation, 2002.

Y. Voronenko, F. De-mesmay, and M. Püschel, Computer Generation of General Size Linear Transform Libraries, 2009 International Symposium on Code Generation and Optimization, 2009.
DOI : 10.1109/CGO.2009.33

R. C. Whaley, A. Petitet, and J. J. Dongarra, Automated empirical optimizations of software and the ATLAS project, Parallel Computing, vol.27, issue.1-2, pp.3-35, 2001.
DOI : 10.1016/S0167-8191(00)00087-9

M. Wolf, D. Maydan, and D. Chen, Combining loop transformations considering caches and scheduling, Proceedings of the 29th Annual IEEE/ACM International Symposium on Microarchitecture. MICRO 29, pp.274-286, 1996.
DOI : 10.1109/MICRO.1996.566468

M. Wolfe, More iteration space tiling, Proceedings of the 1989 ACM/IEEE conference on Supercomputing , Supercomputing '89, pp.655-664, 1989.
DOI : 10.1145/76263.76337

M. Wolfe, High performance compilers for parallel computing, 1995.