References

Bahdanau, D., Cho, K., and Bengio, Y. (2014) Neural Machine Translation by Jointly Learning to Align and Translate link
Cho, K., van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, and Bengio, Y. (2014) Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation link
Gulcehre, C., Firat, O., Xu, K., Cho, K., Barrault, L., Lin, H., Bougares, F., Schwenk, H., and Bengio, Y. (2015) On Using Monolingual Corpora in Neural Machine Translation link
Li, J., Galley, M., Brockett, C., Spithourakis, G., Gao, J., and Dolan, B. (2016) A Persona-Based Neural Conversation Model (User specific information) link
Luong, M., Pham, H., and Manning, C. (2015) Effective Approaches to Attention-based Neural Machine Translation link
Sennrich, R., Haddow, B., and Birch, A. (2016) Improving Neural Machine Translation Models with Monolingual Data (Incorporating monolingual data) link
Sutskever, I., Vinyals, O., and Le, Q. (2014) Sequence to Sequence Learning with Neural Networks link
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A., Kaiser, L., and Polosukhin, I. (2017) Attention is All You Need. link

Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y. (2014) link
Lamb, A., Goyal, A., Zhang, Y., Zhang, S., Courville, A., and Bengio, Y. (2016) Professor Forcing: A New Algorithm for Training Recurrent Networks link
Yu, L., Zhang, W., Wang, J., and Yu, Y. (2017) SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient link