Neural machine translation by jointly learning to align and translate
D Bahdanau, K Cho, Y Bengio
ICLR 2015, 2014
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
K Cho, B van Merrienboer, C Gulcehre, F Bougares, H Schwenk, ...
Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), 2014
Empirical evaluation of gated recurrent neural networks on sequence modeling
J Chung, C Gulcehre, KH Cho, Y Bengio
arXiv preprint arXiv:1412.3555, 2014
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
K Xu, J Ba, R Kiros, K Cho, A Courville, R Salakhutdinov, R Zemel, ...
International Conference on Machine Learning, 2015
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
K Cho, B van Merrienboer, D Bahdanau, Y Bengio
Eighth Workshop on Syntax, Semantics and Structure in Statistical …, 2014
Attention-based models for speech recognition
J Chorowski, D Bahdanau, D Serdyuk, K Cho, Y Bengio
The Twenty-ninth Annual Conference on Neural Information Processing Systems …, 2015
On the Number of Linear Regions of Deep Neural Networks
G Montúfar, R Pascanu, K Cho, Y Bengio
Advances in Neural Information Processing Systems (NIPS) 27, 2014
Recurrent neural networks for multivariate time series with missing values
Z Che, S Purushotham, K Cho, D Sontag, Y Liu
Scientific reports 8 (1), 6085, 2018
Identifying and attacking the saddle point problem in high-dimensional non-convex optimization
Y Dauphin, R Pascanu, C Gulcehre, K Cho, S Ganguli, Y Bengio
Advances in Neural Information Processing Systems (NIPS) 27, 2014
How to Construct Deep Recurrent Neural Networks
R Pascanu, C Gulcehre, K Cho, Y Bengio
International Conference on Learning Representation (ICLR), 2014
Describing videos by exploiting temporal structure
L Yao, A Torabi, K Cho, N Ballas, C Pal, H Larochelle, A Courville
International Conference on Computer Vision, 2015
On Using Very Large Target Vocabulary for Neural Machine Translation
S Jean, K Cho, R Memisevic, Y Bengio
ACL 2015, 2014
Gated Feedback Recurrent Neural Networks
J Chung, C Gulcehre, K Cho, Y Bengio
International Conference on Machine Learning, 2015
Theano: A Python framework for fast computation of mathematical expressions
R Al-Rfou, G Alain, A Almahairi, C Angermueller, D Bahdanau, N Ballas, ...
arXiv e-prints, arXiv: 1605.02688, 2016
Passage Re-ranking with BERT
R Nogueira, K Cho
arXiv preprint arXiv:1901.04085, 2019
Unsupervised neural machine translation
M Artetxe, G Labaka, E Agirre, K Cho
ICLR, 2018
Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism
O Firat, K Cho, Y Bengio
NAACL, 2016
Joint event extraction via recurrent neural networks
TH Nguyen, K Cho, R Grishman
Proceedings of the 2016 conference of the North American chapter of the …, 2016
On integrating a language model into neural machine translation
C Gulcehre, O Firat, K Xu, K Cho, Y Bengio
Computer Speech & Language 45, 137-148, 2017
