D. Bahdanau, P. Brakel, K. Xu, A. Goyal, R. Lowe et al., An actor-critic algorithm for sequence prediction, 2016.

D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, 2014.

Z. Cao, W. Li, S. Li, and F. Wei, Retrieve, rerank and rewrite: Soft template based neural summarization, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.152-161, 2018.

Z. Cao, F. Wei, W. Li, and S. Li, Faithful to the original: Fact aware neural abstractive summarization, Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), the 30th innovative Applications of Artificial Intelligence (IAAI-18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI-18), pp.4784-4791, 2018.

A. Celikyilmaz, A. Bosselut, X. He, and Y. Choi, Deep communicating agents for abstractive summarization, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.1662-1675, 2018.

Y. Chen and M. Bansal, Fast abstractive summarization with reinforce-selected sentence rewriting, Proceedings of ACL, 2018.

J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence modeling, NIPS 2014 Workshop on Deep Learning, 2014.

D. P. Kingma and J. Ba, Adam: A method for stochastic optimization, 2014.

E. Kiperwasser and M. Ballesteros, Scheduled multitask learning: From syntax to translation, Transactions of the Association for Computational Linguistics, vol.6, pp.225-240, 2018.

W. Kryscinski, R. Paulus, C. Xiong, and R. Socher, Improving abstraction in text summarization, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp.1808-1817, 2018.

A. N. Le, A. Martinez, A. Yoshimoto, and Y. Matsumoto, Improving sequence to sequence neural machine translation by utilizing syntactic dependency information, IJCNLP, 2017.

J. Li, D. Xiong, Z. Tu, M. Zhu, M. Zhang et al., Modeling source syntax for neural machine translation, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.688-697, 2017.

C. Lin, Rouge: A package for automatic evaluation of summaries, Text Summarization Branches Out: Proceedings of the ACL-04, 2004.

, Association for Computational Linguistics, pp.74-81

R. Nallapati, B. Zhou, C. Santos, C. Gulcehre, and B. Xiang, Abstractive text summarization using sequence-to-sequence RNNs and beyond, Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning, pp.280-290, 2016.

S. Narayan, S. B. Cohen, and M. Lapata, Ranking sentences for extractive summarization with reinforcement learning, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.1747-1759, 2018.

R. Pasunuru and M. Bansal, Multi-reward reinforced summarization with saliency and entailment, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.646-653, 2018.

R. Paulus, C. Xiong, and R. Socher, A deep reinforced model for abstractive summarization, Proceedings of the 6th International Conference on Learning Representations, 2018.

M. Ranzato, S. Chopra, M. Auli, and W. Zaremba, Sequence level training with recurrent neural networks, 4th International Conference on Learning Representations, 2016.

S. J. Rennie, E. Marcheret, Y. Mroueh, J. Ross, and V. Goel, Self-critical sequence training for image captioning, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.1179-1195, 2017.

A. M. Rush, S. Chopra, and J. Weston, A neural attention model for abstractive sentence summarization, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp.379-389, 2015.

B. Sankaran, H. Mi, Y. Al-onaizan, and A. Ittycheriah, Temporal attention model for neural machine translation, 2016.

A. See, P. J. Liu, and C. D. Manning, Get to the point: Summarization with pointer-generator networks, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1073-1083, 2017.

R. Sennrich and B. Haddow, Linguistic input features improve neural machine translation, Proceedings of the First Conference on Machine Translation, pp.83-91, 2016.

I. Sutskever, O. Vinyals, and Q. V. Le, Sequence to sequence learning with neural networks, Advances in Neural Information Processing Systems 27, pp.3104-3112, 2014.

R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, 1998.

Z. Tu, Z. Lu, Y. Liu, X. Liu, and H. Li, Modeling coverage for neural machine translation, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.76-85, 2016.

R. J. Williams, Simple statistical gradient-following algorithms for connectionist reinforcement learning, Machine Learning, pp.229-256, 1992.

W. Xu, C. Napoles, E. Pavlick, Q. Chen, and C. Callison-burch, Optimizing statistical machine translation for text simplification, Transactions of the Association for Computational Linguistics, vol.4, pp.401-415, 2016.

W. Zaremba and I. Sutskever, Reinforcement learning neural turing machines, 2015.

X. Zhang and M. Lapata, Sentence simplification with deep reinforcement learning, Proceedings of EMNLP, 2017.