Codeit: Self-improving language models with prioritized hindsight replay N Butt, B Manczak, A Wiggers, C Rainone, DW Zhang, M Defferrard, ... arXiv preprint arXiv:2402.04858, 2024 | 19 | 2024 |
BENCHAGENTS: Automated Benchmark Creation with Agent Interaction N Butt, V Chandrasekaran, N Joshi, B Nushi, V Balachandran arXiv preprint arXiv:2410.22584, 2024 | 1 | 2024 |
CodeIt: Abstract Reasoning with Iterative Policy-Guided Program Synthesis N Butt, B Manczak, A Wiggers, C Rainone, DW Zhang, M Defferrard, ... | 1 | 2023 |
Program synthesis for integer sequence generation N Butt, A Wiggers, T Cohen, M Welling | 1 | 2022 |
Towards Self-Improving Language Models for Code Generation M Defferrard, C Rainone, DW Zhang, B Manczak, N Butt, T Cohen ICLR 2024 Workshop on Large Language Model (LLM) Agents, 0 | | |