Folgen
John Yang
Titel
Zitiert von
Zitiert von
Jahr
Webshop: Towards scalable real-world web interaction with grounded language agents
S Yao, H Chen, J Yang, K Narasimhan
NeurIPS 2022, 2022
3052022
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
CE Jimenez, J Yang, A Wettig, S Yao, K Pei, O Press, K Narasimhan
ICLR 2024, 2023
2362023
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
J Yang, CE Jimenez, A Wettig, K Lieret, S Yao, K Narasimhan, O Press
NeurIPS 2024, 2024
94*2024
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback
J Yang, A Prabhakar, K Narasimhan, S Yao
NeurIPS 2023 (Datasets & Benchmarks), 2023
792023
Language Agents as Hackers: Evaluating Cybersecurity Skills with Capture the Flag
J Yang, A Prabhakar, S Yao, K Pei, KR Narasimhan
Multi-Agent Security Workshop @ NeurIPS 2023, 2023
132023
Devbench: A comprehensive benchmark for software development
B Li, W Wu, Z Tang, L Shi, J Yang, J Li, S Yao, C Qian, B Hui, Q Zhang, ...
arXiv preprint arXiv:2403.08604, 2024
102024
Referral Augmentation for Zero-Shot Information Retrieval
M Tang, S Yao, J Yang, K Narasimhan
ACL 2024 (Findings), 2023
32023
Quartz: A framework for engineering secure smart contracts
J Kolb, J Yang, RH Katz, DE Culler
EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS …, 2020
32020
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
J Yang, CE Jimenez, AL Zhang, K Lieret, J Yang, X Wu, O Press, ...
arXiv preprint arXiv:2410.03859, 2024
2024
EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
T Abramovich, M Udeshi, M Shao, K Lieret, H Xi, K Milner, S Jancheska, ...
arXiv preprint arXiv:2409.16165, 2024
2024
Learning Language through Interactions with the Digital World
JB Yang
Princeton University, 2023
2023
Towards an Enhanced, Faithful, and Adaptable Web Interaction Environment
J Yang, H Chen, KR Narasimhan
Second Workshop on Language and Reinforcement Learning @ NeurIPS 2022, 2022
2022
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–12