N. A. Cressie and C. K. Wikle, Statistics for spatio-temporal data, ser. Wiley series in probability and statistics, 2011.

M. T. Bahadori, Q. R. Yu, and Y. Liu, Fast multivariate spatio-temporal analysis via low rank tensor learning, Advances in Neural Information Processing Systems, pp.3491-3499, 2014.

H. Koppula and A. Saxena, Learning spatio-temporal structure from rgbd videos for human activity detection and anticipation, Proceedings of ICML, 2013.

Y. Bengio, Neural net language models, Scholarpedia, vol.3, issue.1, p.3881, 2008.

J. Chung, K. Kastner, L. Dinh, K. Goel, A. C. Courville et al., A recurrent latent variable model for sequential data, Advances in neural information processing systems, pp.2962-2970, 2015.

Y. Li, D. Tarlow, M. Brockschmidt, and R. Zemel, Gated graph sequence neural networks, 2015.

X. Shi, Z. Chen, H. Wang, D. Yeung, W. Wong et al., Convolutional lstm network: A machine learning approach for precipitation nowcasting, Advances in Neural Information Processing Systems, vol.28, pp.802-810, 2015.

N. Srivastava, E. Mansimov, and R. Salakhudinov, Unsupervised learning of video representations using lstms, Proceedings of the 32nd ICML-15, 2015.

N. Kalchbrenner, A. Oord, K. Simonyan, I. Danihelka, O. Vinyals et al., Unsupervised learning of video representations using lstms, Proceedings of the 34nd ICML-17, 2017.

M. L. Stein, Interpolation of spatial data: some theory for kriging, 2012.

J. G. De-gooijer and R. J. Hyndman, 25 years of time series forecasting, International journal of forecasting, vol.22, issue.3, pp.443-473, 2006.

K. R. Muller, A. J. Smola, G. Ratsch, B. Scholkopf, J. Kohlmorgen et al., Using support vector machines for time series prediction, Advances in kernel methodssupport vector learning, 1999.

J. T. Connor, R. D. Martin, and L. E. Atlas, Recurrent neural networks and robust time series prediction, IEEE Transactions on, vol.5, issue.2, pp.240-254, 1994.

A. Graves, A. Mohamed, and G. Hinton, Speech recognition with deep recurrent neural networks, IIIE ICASSP, 2013.

I. Sutskever, J. Martens, and G. E. Hinton, Generating text with recurrent neural networks, Proceedings of the 28th International Conference on Machine Learning, 2011.

K. Cho, B. Van-merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares et al., Learning phrase representations using rnn encoder-decoder for statistical machine translation, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01433235

P. Mirowski and Y. Lecun, Dynamic factor graphs for time series modeling, Machine Learning and Knowledge Discovery in Databases, pp.128-143, 2009.

D. P. Kingma and M. Welling, Auto-encoding variational bayes, Proceedings of the 2nd International Conference on Learning Representations (ICLR, 2013.

J. Bayer and C. Osendorfer, Learning stochastic recurrent networks, 2014.

J. Chung, K. Kastner, L. Dinh, K. Goel, A. C. Courville et al., A recurrent latent variable model for sequential data, Advances in Neural Information Processing Systems, vol.28, pp.2980-2988, 2015.

R. G. Krishnan, U. Shalit, and D. Sontag, Deep kalman filters, 2015.

C. K. Wikle and M. B. Hooten, A general science-based framework for dynamical spatio-temporal models, Test, vol.19, issue.3, pp.417-451, 2010.

C. K. Wikle, Modern perspectives on statistics for spatio-temporal data, Wiley Interdisciplinary Reviews: Computational Statistics, vol.7, issue.1, pp.86-98, 2015.

H. Rue and L. Held, Gaussian Markov random fields: theory and applications, 2005.

M. Ceci, R. Corizzo, F. Fumarola, D. Malerba, and A. Rashkovska, Predictive modeling of pv energy production: How to set up the learning task for a better prediction, IEEE Transactions on Industrial Informatics, vol.13, issue.3, pp.956-966, 2017.

G. Dornhege, B. Blankertz, M. Krauledat, F. Losch, G. Curio et al., Optimizing spatio-temporal filters for improving braincomputer interfacing, 2005.

Y. Ren and Y. Wu, Convolutional deep belief networks for feature extraction of eeg signal, Neural Networks (IJCNN), 2014 International Joint Conference on, pp.2850-2853, 2014.

S. Hochreiter and J. Schmidhuber, Long short-term memory, Neural computation, vol.9, issue.8, pp.1735-1780, 1997.

I. Sutskever, J. Martens, G. Dahl, and G. Hinton, On the importance of initialization and momentum in deep learning, Proceedings of the 30th ICML, 2013.

S. , B. Taieb, and R. Hyndman, Boosting multi-step autoregressive forecasts, Proceedings of The 31st International Conference on Machine Learning, pp.109-117, 2014.

G. Ganeshapillai, J. Guttag, and A. Lo, Learning connections in financial time series, Proceedings of the 30th International Conference on Machine Learning (ICML-13), pp.109-117, 2013.

J. Yuan, Y. Zheng, X. Xie, and G. Sun, Driving with knowledge from the physical world, Proceedings of the 17th ACM SIGKDD, pp.316-324, 2011.

J. Yuan, Y. Zheng, C. Zhang, W. Xie, X. Xie et al., T-drive: driving directions based on taxi trajectories, Proceedings of the 18th SIGSPATIAL, pp.99-108, 2010.