Follow
Ryan Lowe
Ryan Lowe
OpenAI
Verified email at openai.com - Homepage
Title
Cited by
Cited by
Year
Training language models to follow instructions with human feedback
L Ouyang, J Wu, X Jiang, D Almeida, C Wainwright, P Mishkin, C Zhang, ...
Advances in neural information processing systems 35, 27730-27744, 2022
91322022
Multi-agent actor-critic for mixed cooperative-competitive environments
R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch
Advances in neural information processing systems 30, 2017
52662017
Gpt-4 technical report
J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ...
arXiv preprint arXiv:2303.08774, 2023
41452023
How not to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation
CW Liu, R Lowe, IV Serban, M Noseworthy, L Charlin, J Pineau
arXiv preprint arXiv:1603.08023, 2016
15452016
Learning to summarize with human feedback
N Stiennon, L Ouyang, J Wu, D Ziegler, R Lowe, C Voss, A Radford, ...
Advances in Neural Information Processing Systems 33, 3008-3021, 2020
14972020
A hierarchical latent variable encoder-decoder model for generating dialogues
I Serban, A Sordoni, R Lowe, L Charlin, J Pineau, A Courville, Y Bengio
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
13082017
The ubuntu dialogue corpus: A large dataset for research in unstructured multi-turn dialogue systems
R Lowe, N Pow, I Serban, J Pineau
arXiv preprint arXiv:1506.08909, 2015
11492015
An actor-critic algorithm for sequence prediction
D Bahdanau, P Brakel, K Xu, A Goyal, R Lowe, J Pineau, A Courville, ...
arXiv preprint arXiv:1607.07086, 2016
7272016
A survey of available corpora for building data-driven dialogue systems
IV Serban, R Lowe, P Henderson, L Charlin, J Pineau
arXiv preprint arXiv:1512.05742, 2015
4452015
Towards an automatic turing test: Learning to evaluate dialogue responses
R Lowe, M Noseworthy, IV Serban, N Angelard-Gontier, Y Bengio, ...
arXiv preprint arXiv:1708.07149, 2017
4242017
The second conversational intelligence challenge (convai2)
E Dinan, V Logacheva, V Malykh, A Miller, K Shuster, J Urbanek, D Kiela, ...
The NeurIPS'18 Competition: From Machine Learning to Intelligent …, 2020
3802020
Recursively summarizing books with human feedback
J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano
arXiv preprint arXiv:2109.10862, 2021
2272021
Training language models to follow instructions with human feedback, 2022
L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ...
URL https://arxiv. org/abs/2203.02155 13, 1, 2022
2182022
Ethical challenges in data-driven dialogue systems
P Henderson, K Sinha, N Angelard-Gontier, NR Ke, G Fried, R Lowe, ...
Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 123-129, 2018
1952018
Training end-to-end dialogue systems with the ubuntu dialogue corpus
R Lowe, N Pow, IV Serban, L Charlin, CW Liu, J Pineau
Dialogue & Discourse 8 (1), 31-65, 2017
1852017
On the pitfalls of measuring emergent communication
R Lowe, J Foerster, YL Boureau, J Pineau, Y Dauphin
arXiv preprint arXiv:1903.05168, 2019
1442019
Generative deep neural networks for dialogue: A short review
IV Serban, R Lowe, L Charlin, J Pineau
arXiv preprint arXiv:1611.06216, 2016
1062016
Training language models to follow instructions with human feedback. arXiv
L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ...
arXiv preprint arXiv:2203.02155, 2022
1042022
Learning an unreferenced metric for online dialogue evaluation
K Sinha, P Parthasarathi, J Wang, R Lowe, WL Hamilton, J Pineau
arXiv preprint arXiv:2005.00583, 2020
942020
On the evaluation of dialogue systems with next utterance classification
R Lowe, IV Serban, M Noseworthy, L Charlin, J Pineau
arXiv preprint arXiv:1605.05414, 2016
792016
The system can't perform the operation now. Try again later.
Articles 1–20