J. F. Kingman, The effect of queue discipline on waiting time variance, Mathematical Proceedings of the Cambridge Philosophical Society, vol.58, issue.1, p.163164, 1962.

, Some inequalities for the queue gi/g/1, Biometrika, vol.49, issue.3-4, pp.315-324, 1962.

P. Humblet, Determinism Minimizes Waiting Time in Queues, ser. LIDS-P-1207. Laboratory for Information and Decision Systems, 1982.

Z. Liu and R. Righter, Optimal load balancing on distributed homogeneous unreliable processors, Operations Research, vol.46, issue.4, pp.563-573, 1998.
URL : https://hal.archives-ouvertes.fr/inria-00074030

M. Harchol-balter, M. E. Crovella, and C. D. Murta, On choosing a task assignment policy for a distributed server system, Journal of Parallel and Distributed Computing, vol.59, issue.2, pp.204-228, 1999.

J. Anselmi and J. Doncel, Asymptotically optimal size-interval task assignments, IEEE Transactions on Parallel and Distributed Systems
URL : https://hal.archives-ouvertes.fr/hal-02318576

D. Gamarnik, J. N. Tsitsiklis, and M. Zubeldia, Delay, memory, and messaging tradeoffs in distributed service systems," ser. SIG-METRICS '16, pp.1-12, 2016.

W. Winston, Optimality of the shortest line discipline, Journal of Applied Probability, vol.14, issue.1, pp.181-189, 1977.

R. R. Weber, On the optimal assignment of customers to parallel servers, J. of App. Prob, vol.15, issue.2, pp.406-413, 1978.

M. Mitzenmacher, The power of two choices in randomized load balancing, IEEE Trans. Parallel Distrib. Syst, vol.12, issue.10, pp.1094-1104, 2001.

D. Mukherjee, S. C. Borst, J. S. Van-leeuwaarden, and P. A. Whiting, Asymptotic Optimality of Power-of-d Load Balancing in Large-Scale Systems, 2016.

J. Anselmi and F. Dufour, Power-of-d-choices with memory: Fluid limit and optimality, Mathematics of Operations Research

Y. Lu, Q. Xie, G. Kliot, A. Geller, J. R. Larus et al., Join-idle-queue: A novel load balancing algorithm for dynamically scalable web services, Perform. Eval, vol.68, issue.11, pp.1056-1071, 2011.

A. L. Stolyar, Pull-based load distribution among heterogeneous parallel servers: The case of multiple routers, Queueing Syst. Theory Appl, vol.85, issue.1-2, pp.31-65, 2017.

M. El-taha and B. Maddah, Allocation of service time in a multiserver system, Management Science, vol.52, issue.4, pp.623-637, 2006.

Q. Zhang, A. Riska, W. Sun, E. Smirni, and G. Ciardo, Workloadaware load balancing for clustered web servers, IEEE Transactions on Parallel and Distributed Systems, vol.16, issue.3, pp.219-233, 2005.

K. Oida and K. Shinjo, Characteristics of deterministic optimal routing for a simple traffic control problem, Proceedings of the IEEE International Performance Computing and Communications Conference, pp.386-392, 1999.

G. Ciardo, A. Riska, and E. Smirni, Equiload: A load balancing policy for clustered web servers, Perform. Eval, vol.46, issue.2-3, pp.101-124, 2001.

M. Harchol-balter, A. Scheller-wolf, and A. R. Young, Surprising results on task assignment in server farms with high-variability workloads, ser. SIGMETRICS '09, pp.287-298, 2009.

B. Schroeder and M. Harchol-balter, Evaluation of task assignment policies for supercomputing servers: The case for load unbalancing and fairness, Cluster Computing, vol.7, issue.2, pp.151-161, 2004.

L. Cherkasova and M. Karlsson, Scalable web server cluster design with workload-aware request distribution strategy ward, Proc. 3rd Int, pp.212-221, 2001.

M. Harchol-balter, Task assignment with unknown duration, J. ACM, vol.49, issue.2, pp.260-288, 2002.

A. Riska, W. Sun, E. Smirni, and G. Ciardo, Adaptload: effective balancing in clustered web servers under transient load conditions, Proceedings 22nd International Conference on Distributed Computing Systems, pp.104-111, 2002.

E. Bachmat and A. Natanzon, Analysis of sita queues with many servers and spacetime geometry, Eval. Rev, vol.40, issue.3, pp.92-94, 2012.

W. Willinger, M. S. Taqqu, R. Sherman, and D. V. Wilson, Selfsimilarity through high-variability: statistical analysis of ethernet lan traffic at the source level, IEEE/ACM Transactions on Networking, vol.5, issue.1, pp.71-86, 1997.

M. E. Crovella and A. Bestavros, Self-similarity in world wide web traffic: evidence and possible causes, IEEE/ACM Transactions on Networking, vol.5, issue.6, pp.835-846, 1997.

M. E. Crovella, M. S. Taqqu, and A. Bestavros, A practical guide to heavy tails, pp.3-25, 1998.

M. Mitzenmacher, Analyzing distributed join-idle-queue: A fluid limit approach, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp.312-318, 2016.

E. Bachmat and H. Sarfati, Analysis of sita policies, Perform. Eval, vol.67, issue.2, pp.102-120, 2010.

J. Almeida, V. Almeida, D. Ardagna, C. Cunha, M. Francalanci et al., Joint admission control and resource allocation in virtualized servers, Journal of Parallel and Distributed Computing, vol.70, issue.4, pp.344-362, 2010.

M. Shaked and J. G. Shanthikumar, Stochastic orders and their applications. Academic Pr, 1994.

S. Asmussen, Applied Probability and Queues, 1987.

L. Kleinrock, Queueing Systems, vol.2, 1976.