H. Akaike, Statistical predictor identification, Annals of the Institute for Statistical Mathematics, vol.22, pp.203-217, 1970.

S. Arlot, Choosing a penalty for model selection in heteroscedastic regression, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00347811

S. Arlot and P. Massart, Data-driven calibration of penalties for least-squares regression, Journal of Machine Learning Research, vol.10, pp.245-279, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00287631

Y. Baraud, Model selection for regression on a fixed design, Probability Theory and Related Fields, vol.117, pp.467-493, 2000.

Y. Baraud, Model selection for regression on a random design, ESAIM: Probability and Statistics, vol.6, pp.127-146, 2002.

Y. Baraud, F. Comte, and G. Viennet, Adaptive estimation in autoregression or ?-mixing regression via model selection, Annals of Statistics, vol.29, issue.3, pp.839-875, 2001.

L. Birgé and P. Massart, From model selection to adaptive estimation, Festschrift for Lucien Lecam: Research Papers in Probability and Statistics, pp.55-87, 1997.

L. Birgé and P. Massart, Minimum contrast estimators on sieves: exponential bounds and rates of convergence, Bernoulli, vol.4, pp.329-375, 1998.

L. Birgé and P. Massart, An adaptive compression algorithm in Besov spaces, Constructive Approximation, vol.16, pp.1-36, 2000.

L. Birgé and P. Massart, Gaussian model selection, Journal of the European Mathematical Society, vol.3, issue.3, pp.203-268, 2001.

L. Birgé and P. Massart, Minimal penalties for gaussian model selection, Probability Theory and Related Fields, vol.138, pp.33-73, 2007.

L. Breiman and J. H. Friedman, Estimating optimal transformations for multiple regression and correlations (with discussion), Journal of the American Statistical Association, vol.80, issue.391, pp.580-619, 1985.

E. Brunel and F. Comte, Adaptive nonparametric regression estimation in presence of right censoring, Mathematical Methods of Statistics, vol.15, issue.3, pp.233-255, 2006.

E. Brunel and F. Comte, Model selection for additive regression models in the presence of censoring, Mathematical Methods in Survival Analysis, Reliability and Quality of Life, pp.17-31, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00264450

A. Buja, T. J. Hastie, and R. J. Tibshirani, Linear smoothers and additive models (with discussion), Annals of Statistics, vol.17, pp.453-555, 1989.

F. Comte and Y. Rozenholc, Adaptive estimation of mean and volatility functions in (auto-)regressive models, Stochastic Processes and Their Applications, vol.97, pp.111-145, 2002.
URL : https://hal.archives-ouvertes.fr/hal-00170763

X. Gendre, Simultaneous estimation of the mean and the variance in heteroscedastic gaussian regression, Electronic Journal of Statistics, vol.2, pp.1345-1372, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00298436

W. Härdle, M. Müller, S. Sperlich, and A. Werwatz, Nonparametric and Semiparametric Models, 2004.

T. J. Hastie and R. J. Tibshirani, Generalized additive models, 1990.

R. A. Horn and C. R. Johnson, Matrix analysis, 1990.

B. Laurent, J. M. Loubes, and C. Marteau, Testing inverse problems: a direct or an indirect problem, Journal of Statistical Planning and Inference, vol.141, pp.1849-1861, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00528909

B. Laurent and P. Massart, Adaptive estimation of a quadratic functional by model selection, Annals of Statistics, vol.28, issue.5, pp.1302-1338, 2000.

W. Leontief, Introduction to a theory of the internal structure of functional relationships, Econometrica, vol.15, pp.361-373, 1947.

O. Linton and J. P. Nielsen, A kernel method of estimating structured nonparametric regression based on marginal integration, Biometrika, vol.82, pp.93-101, 1995.

C. L. Mallows, Some comments on cp. Technometrics, vol.15, pp.661-675, 1973.

E. Mammen, O. Linton, and J. P. Nielsen, The existence and asymptotic properties of a backfitting projection algorithm under weak conditions, Annals of Statistics, vol.27, pp.1443-1490, 1999.

P. Massart, Concentration inequalities and model selection, Lectures from the 33rd Summer School on Probability Theory, vol.1896, 2003.

A. D. Mcquarrie and C. L. Tsai, Regression and times series model selection, 1998.

L. Meier, S. Van-de-geer, and P. Bühlmann, High-dimensional additive modeling, Annals of Statistics, vol.37, pp.3779-3821, 2009.

J. Opsomer and D. Ruppert, Fitting a bivariate additive model by local polynomial regression, Annals of Statistics, vol.25, pp.186-211, 1997.

V. V. Petrov, Limit theorems of probability theory: sequences of independent random variables, Oxford Studies in Probability, vol.4, 1995.

P. D. Ravikumar, H. Liu, J. D. Lafferty, and L. A. Wasserman, Sparse additive models, Journal of the Royal Statistical Society, vol.71, pp.1009-1030, 2009.

S. Robin, F. Rodolphe, and S. Schbath, DNA, Words and Models, 2005.

D. Ruppert and M. P. Wand, Multivariate locally weighted least squares regression, Annals of Statistics, vol.22, issue.3, pp.1346-1370, 1994.

H. Scheffé, The analysis of variance, 1959.

E. Severance-lossin and S. Sperlich, Estimation of derivatives for additive separable models, Statistics, vol.33, pp.241-265, 1999.

C. J. Stone, Additive regression and other nonparametric models, Annals of Statistics, vol.14, issue.2, pp.590-606, 1985.

D. Tjøstheim and B. Auestad, Nonparametric identification of nonlinear time series: Selecting significant lags, Journal of the American Statistical Association, vol.89, pp.1410-1430, 1994.

B. Bahr and C. G. Esseen, Inequalities for the rth absolute moment of a sum of random variables 1 r 2, Annals of Mathematical Statistics, vol.36, pp.299-303, 1965.