A. Alexandrov, M. F. Ionescu, K. E. Schauser, and C. J. Scheiman, LogGP, Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures , SPAA '95, pp.95-105, 1995.
DOI : 10.1145/215399.215427

B. Allcock, J. Bester, J. Bresnahan, A. L. Chervenak, I. Foster et al., Efficient data transport and replica management for high-performance dataintensive computing, IEEE Mass Storage Conference, 2001.

M. Nedim, A. Alpdemir, A. Mukherjee, N. W. Gounaris, P. Paton et al., Ogsa-dqp : A service for distributed querying on the grid Advances in Database Technology -EDBT, pp.858-861, 2004.

E. R. Carlos, E. Alves, . Cáceres, K. H. Frank, S. W. Dehne et al., A cgm/bsp parallel similarity algorithm, pp.1-8, 2002.

M. Antonioletti, M. Atkinson, R. Baxter, A. Borley, N. P. Hong et al., The design and implementation of Grid database services in OGSA-DAI, Concurrency and Computation: Practice and Experience, vol.43, issue.2-4
DOI : 10.1002/cpe.939

M. Armbrust, A. Fox, R. Griffith, A. D. Joseph, R. H. Katz et al., Above the clouds : A berkeley view of cloud computing, 2009.

M. Bamha, An Optimal Skew-insensitive Join and Multi-join Algorithm for Distributed Architectures, Proceedings of the International Conference on Database and Expert Systems Applications (DEXA'2005). 22-26 August, pp.616-625, 2005.
DOI : 10.1007/11546924_60

M. Bamha and G. Hains, A Skew-Insensitive Algorithm for Join and Multi-join Operations on Shared Nothing Machines, the 11th International Conference on Database and Expert Systems Applications DEXA'2000, 2000.
DOI : 10.1007/3-540-44469-6_60

M. Bamha and G. Hains, An Efficient Equi-semi-join Algorithm for Distributed Architectures, Proceedings of the 5th International Conference on Computational Science (ICCS'2005). 22-25 May, pp.755-763, 2005.
DOI : 10.1007/11428848_97

URL : https://hal.archives-ouvertes.fr/hal-00081352

M. Bamha and G. Hains, A frequency adaptive join algorithm for Shared Nothing machines, Appears also in Progress in Computer Research, F. Columbus Ed, pp.333-345, 1999.

M. Bamha and M. Exbrayat, Pipelining a skew-insensitive parallel join algorithm. Parallel Processing Letters, pp.317-328, 2003.
URL : https://hal.archives-ouvertes.fr/hal-00081339

G. Bilardi, K. T. Herley, A. Pietracaprina, G. Pucci, and P. Spirakis, BSP vs LogP, Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures , SPAA '96, pp.25-32, 1996.
DOI : 10.1145/237502.237504

H. Boral, W. Alexander, L. Clay, G. Copeland, S. Danforth et al., Prototyping Bubba, a highly parallel database system, IEEE Transactions on Knowledge and Data Engineering, vol.2, issue.1, pp.4-24, 1990.
DOI : 10.1109/69.50903

F. Cappello, E. Caron, M. Dayde, F. Desprez, Y. Jegou et al., Grid'5000: a large scale and highly reconfigurable grid experimental testbed, The 6th IEEE/ACM International Workshop on Grid Computing, 2005., pp.99-106, 2005.
DOI : 10.1109/GRID.2005.1542730

URL : https://hal.archives-ouvertes.fr/hal-00684943

L. Carter and M. N. Wegman, Universal classes of hash functions, Journal of Computer and System Sciences, vol.18, issue.2, pp.143-154, 1979.
DOI : 10.1016/0022-0000(79)90044-8

S. Chaudhuri and K. Shim, Including group-by in query optimization, VLDB '94 : Proceedings of the 20th International Conference on Very Large Data Bases, pp.354-366, 1994.

M. Chen and P. S. Yu, Combining joint and semi-join operations for distributed query processing, IEEE Transactions on Knowledge and Data Engineering, vol.5, issue.3, pp.534-542, 1993.
DOI : 10.1109/69.224205

M. S. Chen, M. L. Lo, P. S. Yu, and H. C. Young, Using segmented right-deep trees for the execution of pipelined hash joins, VLDB '92 : Proceedings of the 18th International Conference on Very Large Data Bases, pp.15-26, 1992.

S. Christodoulakis, Implications of certain assumptions in database performance evauation, ACM Transactions on Database Systems, vol.9, issue.2, pp.163-186, 1984.
DOI : 10.1145/329.318578

E. F. Codd, A relational model of data for large shared data banks, Communications of the ACM, vol.13, issue.6, pp.377-387, 1970.
DOI : 10.1145/362384.362685

D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser et al., LogP: towards a realistic model of parallel computation, ACM SIGPLAN Notices, vol.28, issue.7, pp.1-12, 1993.
DOI : 10.1145/173284.155333

A. Datta, B. Moon, and H. Thomas, A case for parallelism in datawarehousing and OLAP, Ninth International Workshop on Database and Expert Systems Applications , DEXA 98, pp.226-231, 1998.

J. Dean and S. Ghemawat, MapReduce, OSDI ' 04 : Sixth Symposium on Operating System Design and Implementation, 2004.
DOI : 10.1145/1327452.1327492

F. Dehne, A. Fabri, and A. Rau-chaplin, Scalable parallel geometric algorithms for coarse grained multicomputers, Proceedings of the ninth annual symposium on Computational geometry , SCG '93, pp.298-307, 1993.
DOI : 10.1145/160985.161154

URL : https://hal.archives-ouvertes.fr/inria-00074853

F. Dehne, W. Dittrich, and D. Hutchinson, Efficient external memory algorithms by simulating coarse-grained parallel algorithms, SPAA '97 : Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures, pp.106-115, 1997.

K. H. Frank, W. Dehne, D. A. Dittrich, A. Hutchinson, and . Maheshwari, Bulk synchronous parallel algorithms for the external memory model, Theory Comput. Syst, vol.35, issue.6, pp.567-597, 2002.

D. J. Dewitt, J. F. Naughton, D. A. Schneider, and S. Seshadri, Practical Skew Handling in Parallel Joins, Proceedings of the 18th VLDB Conference, pp.27-40, 1992.

J. David, J. Dewitt, and . Gray, Parallel database systems : The future of high performance database systems, Commun. ACM, vol.35, issue.6, pp.85-98, 1992.

D. J. Dewitt, S. Ghandeharizadeh, D. A. Schneider, A. Bricker, H. Hsiao et al., The Gamma database machine project, IEEE Transactions on Knowledge and Data Engineering, vol.2, issue.1, pp.44-62, 1990.
DOI : 10.1109/69.50905

J. Eisenbiegler, W. Löwe, and W. Zimmermann, BSP, LogP, and oblivious programs, Euro-Par '98 : Proceedings of the 4th International Euro-Par Conference on Parallel Processing, pp.865-874, 1998.
DOI : 10.1007/BFb0057942

J. Michael and . Flynn, Very high-speed computing systems, Proceeding of the IEEE, pp.1901-1909, 1966.

S. Fortune and J. Wyllie, Parallelism in random access machines, Proceedings of the tenth annual ACM symposium on Theory of computing , STOC '78, pp.114-118, 1978.
DOI : 10.1145/800133.804339

I. Foster and C. Kesselman, The Grid : Blueprint for a Future Computing Infrastructure, 1999.

I. Foster, What is the grid ? -a three point checklist, GRIDtoday, vol.1, issue.6, 2002.

I. T. Foster, Globus Toolkit Version 4: Software for Service-Oriented Systems, Lecture Notes in Computer Science, vol.3779, pp.2-13, 2005.
DOI : 10.1007/11577188_2

I. T. Foster, Y. Zhao, I. Raicu, and S. Lu, Cloud computing and grid computing 360-degree compared. CoRR, abs/0901, p.131, 2009.

F. Gava, External Memory in Bulk Synchronous Parallel ML. Scalable Computing : Practice and Experience, pp.43-70, 2005.
URL : https://hal.archives-ouvertes.fr/hal-00109451

S. Ghemawat, H. Gobioff, and S. Leung, The google file system, SOSP '03 : Proceedings of the nineteenth ACM symposium on Operating systems principles, pp.29-43, 2003.

P. B. Gibbons, Y. Matias, and V. Ramachandran, The qrqw pram : accounting for contention in parallel algorithms, SODA '94 : Proceedings of the fifth annual ACM-SIAM symposium on Discrete algorithms, pp.638-648, 1994.

P. B. Gibbons, Y. Matias, and V. Ramachandran, The Queue-Read Queue-Write PRAM Model: Accounting for Contention in Parallel Algorithms, SIAM Journal on Computing, vol.28, issue.2, pp.733-769, 1999.
DOI : 10.1137/S009753979427491

A. Gounaris, Resource aware query processing on the grid, 2005.

A. Gounaris, J. Smith, N. W. Paton, R. Sakellariou, A. A. Fernandes et al., Adapting to Changing Resource Performance in Grid Query Processing, Data Management in Grids, First VLDB Workshop, DMG 2005, pp.30-44, 2005.
DOI : 10.1007/11611950_4

R. L. Graham, Bounds on Multiprocessing Timing Anomalies, SIAM Journal on Applied Mathematics, vol.17, issue.2, pp.416-429, 1969.
DOI : 10.1137/0117039

A. Gupta, V. Harinarayan, and D. Quass, Aggregate-query processing in data warehousing environments, Proceedings of the 21th International Conference on Very Large Data Bases, pp.358-369, 1995.

J. Han and M. Kamber, Data Mining, 2000.
DOI : 10.1007/978-1-4899-7993-3_104-2

L. Harada and M. Kitsuregawa, Dynamic join product skew handling for hash-joins in shared-nothing database systems, Proceedings of the 4th International Conference on Database Systems for Advanced Applications (DASFAA), pp.246-255, 1995.

M. A. , H. Hassan, and M. Bamha, An efficient pipelined parallel join algorithm on heterogeneous distributed architectures Extended and revised version of "ICSOFT, In Communications in Computer and Information Science, vol.47, pp.119-133, 2008.

M. A. , H. Hassan, and M. Bamha, Pipelined parallelism in multijoin queries on heterogeneous shared nothing architectures, pp.127-134, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00460656

M. A. , H. Hassan, and M. Bamha, An efficient parallel algorithm for evaluating join queries on heterogeneous distributed systems, HiPC 2009 : Proceedings of the 16th international conference on high performance computing, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00460647

M. A. , H. Hassan, and M. Bamha, Semi-join computation on distributed file systems using map-reduce-merge model, Proceedings of the 25th Symposium On Applied Computing, pp.22-26, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00460665

M. A. , H. Hassan, and M. Bamha, Parallel processing of 'group-by join' queries on Shared Nothing machines, Proceedings of the International Conference on Software and Data Technologies. Also appears in 'ICSOFT Best papers Book, pp.301-307, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00460664

M. A. , H. Hassan, and M. Bamha, An optimal evaluation of groupby-join queries in distributed architectures, Proceedings of the third International Conference on Web Information Systems and Technologies, pp.246-252, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00462208

K. A. Hua and C. Lee, Handling data skew in multiprocessor database computers using partition tuning, Proc. of the 17th International Conference on Very Large Data Bases, pp.525-535, 1991.

A. Kien, W. Hua, H. C. Tavanapong, and . Young, A performance evaluation of load balancing techniques for join operations on multicomputer database systems, ICDE '95 : Proceedings of the Eleventh International Conference on Data Engineering, pp.44-51, 1995.

R. Huebsch, J. M. Hellerstein, N. Lanham, . Boon-thau, S. Loo et al., Querying the Internet with PIER, VLDB '2003 : Proceedings of the 29th international conference on Very large data bases, pp.321-332, 2003.
DOI : 10.1016/B978-012722442-8/50036-7

Y. Jiang, K. H. Liu, and C. H. Leung, Parallel Algorithms for Queries with Aggregate Functions in the Presence of Data Skew, HiPC '99 : Proceedings of the 6th International Conference on High Performance Computing, pp.207-211, 1999.
DOI : 10.1007/978-3-540-46642-0_30

H. Karatza and R. Hilzer, Load sharing in heterogeneous distributed systems, Proceedings of the Winter Simulation Conference, pp.489-496, 2002.
DOI : 10.1109/WSC.2002.1172921

M. Kitsuregawa and Y. Ogawa, Bucket spreading parallel hash : A new, robust, parallel hash join method for skew in the super database computer (SDC), Very Large Data Bases : 16th International Conference on Very Large Data Bases, pp.210-221, 1990.

M. Kitsuregawa, H. Tanaka, and T. Moto-oka, Application of hash to data base machine and its architecture, New Generation Computing, vol.22, issue.1, pp.63-74, 1983.
DOI : 10.1007/BF03037022

R. Lämmel, Google???s MapReduce programming model ??? Revisited, Science of Computer Programming, vol.70, issue.1, pp.208-237, 2007.
DOI : 10.1016/j.scico.2007.07.001

C. Li, K. Chen-chuan, I. F. Chang, and . Ilyas, Efficient processing of ad-hoc top-k aggregate queries in olap, 2005.

W. Liang and M. E. Orlowska, Computing multidimensional aggregates in parallel, Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250), 2000.
DOI : 10.1109/ICPADS.1998.741024

B. Liu and E. A. Rundensteiner, Revisiting pipelined parallelism in multi-join query processing, VLDB '05 : Proceedings of the 31st international conference on Very large data bases, pp.829-840, 2005.

W. Löwe, W. Zimmermann, and J. Eisenbiegler, On linear schedules of task graphs for generalized logp-machines, Euro-Par '97 : Proceedings of the Third International Euro-Par Conference on Parallel Processing, pp.895-904, 1997.

H. Lu and K. Tan, Dynamic and load-balanced task-oriented database query processing in parallel systems, Proceedings of Advances in Database Technology (EDBT '92), pp.357-372, 1992.
DOI : 10.1007/BFb0032442

S. Lynden, A. Mukherjee, A. C. Hume, A. A. Fernandes, N. W. Paton et al., The design and implementation of OGSA-DQP: A service-based distributed query processor, Future Generation Computer Systems, vol.25, issue.3
DOI : 10.1016/j.future.2008.08.003

P. S. Yu, M. Chen, and K. Wu, Scheduling and processor allocation for the execution of multi-join queries, International Conference on Data Engineering

W. Mach and E. Schikuta, Parallel Database Join Operations in Heterogeneous Grids, Eighth International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2007), pp.236-243, 2007.
DOI : 10.1109/PDCAT.2007.79

A. N. Mourad, R. J. Morris, A. Swami, and H. C. Young, Limits of parallelism in hash join algorithms. Performance evaluation, pp.301-316, 1994.

N. W. Paton, J. B. Chávez, M. Chen, V. Raman, G. Swart et al., Autonomic query parallelization using non-dedicated computers: an evaluation of adaptivity options, The VLDB Journal, vol.53, issue.3, pp.119-140, 2009.
DOI : 10.1007/s00778-007-0090-x

R. Pike, S. Dorward, R. Griesemer, and S. Quinlan, Interpreting the Data: Parallel Analysis with Sawzall, Scientific Programming, vol.13, issue.4, 2005.
DOI : 10.1155/2005/962135

E. Rahm, Dynamic load balancing in parallel database systems, Proc. of EURO-PAR, pp.37-52, 1996.
DOI : 10.1007/3-540-61626-8_4

W. Vijayshankar-raman, I. Han, and . Narang, Parallel querying with non-dedicated computers, VLDB '05 : Proceedings of the 31st international conference on Very large data bases, pp.61-72, 2005.

K. Sattler, E. Buchmann, and K. Böhm, A physical query algebra for dht-based p2p systems, 6th Workshop on Distributed Data and Structures (WDAS'2004, pp.8-9, 2004.

D. A. Schneider and D. J. Dewitt, Tradeoffs in processing complex join queries via hashing in multiprocessor data-base machines, International Conference On Very Large Data Bases (VLDB '90), pp.469-480, 1990.

A. Donovan, D. J. Schneider, and . Dewitt, A performance evaluation of four parallel join algorithms in a shared-nothing multiprocessor environment, Proceedings of the 1989 ACM SIGMOD international conference on Management of data, pp.110-121, 1989.

M. Seetha and P. S. Yu, Effectiveness of parallel joins, IEEE Transactions on Knowledge and Data Engineering, vol.2, issue.4, pp.410-424, 1990.
DOI : 10.1109/69.63253

A. Shatdal and J. F. Naughton, Adaptive parallel aggregation algorithms, SIG- MOD Record (ACM Special Interest Group on Management of Data), vol.24, issue.2, pp.104-114, 1995.

A. Shatdal and J. F. Naughton, Processing aggregates in parallel database systems, 1994.

J. Smith and P. Watson, Fault-Tolerance in Distributed Query Processing, 9th International Database Engineering & Application Symposium (IDEAS'05), pp.329-338, 2005.
DOI : 10.1109/IDEAS.2005.29

K. Stocker, D. Kossmann, R. Braumandl, and A. Kemper, Integrating semi-join-reducers into state-of-the-art query processors, Proceedings 17th International Conference on Data Engineering, pp.575-584, 2001.
DOI : 10.1109/ICDE.2001.914872

D. Taniar, Y. Jiang, K. H. Liu, and C. H. Leung, Aggregate-join query processing in parallel database systems, Proceedings Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region, pp.824-829, 2000.
DOI : 10.1109/HPC.2000.843554

D. Taniar and J. W. Rahayu, Parallel processing of "GroupBy-Before-Join" queries in cluster architecture, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.178-185, 2001.
DOI : 10.1109/CCGRID.2001.923191

D. Taniar and W. Rahayu, Parallel "GroupBy-Before-Join" Query Processing for High Performance Parallel/Distributed Database Systems, 20th International Conference on Advanced Information Networking and Applications, Volume 1 (AINA'06), pp.693-700, 2006.
DOI : 10.1109/AINA.2006.256

D. Taniar, R. Boon-noi-tan, C. H. Leung, and K. H. Liu, Performance analysis of ???Groupby-After-Join??? query processing in parallel database systems, Information Sciences, vol.168, issue.1-4, pp.25-50, 2004.
DOI : 10.1016/j.ins.2003.09.029

D. Taniar, C. H. Leung, J. W. Rahayu, and S. Goel, High Performance Parallel Database Processing and Grid Databases, 2008.
DOI : 10.1002/9780470391365

D. Taniar, C. H. Leung, W. Rahayu, and S. Goel, High Performance Parallel Database Processing and Grid Databases, 2008.
DOI : 10.1002/9780470391365

L. G. Valiant, A bridging model for parallel computation, Communications of the ACM, vol.33, issue.8, pp.103-111, 1990.
DOI : 10.1145/79173.79181

C. B. Walton, A. G. Dale, and R. M. Jenevein, A taxonomy and performance model of data skew effects in parallel joins, Proceedings of the 17th Conference on Very Large Databases, 1991.

R. Wankar, Grid computing with globus : An overview and research challenges, IJCSA, vol.5, issue.3, pp.56-69, 2008.

N. Annita, P. M. Wilschut, and . Apers, Dataflow query execution in a parallel main-memory environment, PDIS '91 : Proceedings of the first international conference on Parallel and distributed information systems, pp.68-77, 1991.

N. W. Annita, P. M. Flokstra, and . Apers, Parallel evaluation of multi-join queries. SIGMOD Rec, pp.115-126, 1995.

J. L. Wolf, D. M. Dias, P. S. Yu, and J. Turek, New algorithms for parallelizing relational database joins in the presence of data skew, IEEE Transactions on Knowledge and Data Engineering, vol.6, issue.6, pp.990-997, 1994.
DOI : 10.1109/69.334888

P. Weipeng, P. Yan, and . Larson, Performing group-by before join, Proceedings of the Tenth International Conference on Data Engineering, pp.89-100, 1994.

P. Weipeng, P. Yan, and . Larson, Eager aggregation and lazy aggregation, VLDB '95 : Proceedings of the 21th International Conference on Very Large Data Bases, pp.345-357, 1995.

D. Yang, J. Li, and Q. Rasool, Join Algorithm Using Multiple Replicas in Data Grid, WAIM, pp.416-427, 2005.
DOI : 10.1007/11563952_37

H. Yang, A. Dasdan, R. Hsiao, and D. Parker, Map-reducemerge : simplified relational data processing on large clusters, SIGMOD '07 : Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp.1029-1040, 2007.

H. Zeller and J. Gray, An adaptive hash join algorithm for multiuser environments, Proceedings of the sixteenth international conference on Very large databases, pp.186-197, 1990.

X. Zhang, T. Kurc, T. Pan, U. Catalyurek, S. Narayanan et al., Strategies for using additional resources in parallel hash-based join algorithms, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004., pp.4-13, 2004.
DOI : 10.1109/HPDC.2004.1323471

G. K. Zipf, Human Behavior and the Principle of Least Effort : An Introduction to Human Ecology, 1949.