Alert button
Picture for Stephen McAleer

Stephen McAleer

Alert button

Grasper: A Generalist Pursuer for Pursuit-Evasion Problems

Add code
Bookmark button
Alert button
Apr 19, 2024
Pengdeng Li, Shuxin Li, Xinrun Wang, Jakub Cerny, Youzhi Zhang, Stephen McAleer, Hau Chan, Bo An

Viaarxiv icon

AgentKit: Flow Engineering with Graphs, not Coding

Add code
Bookmark button
Alert button
Apr 17, 2024
Yue Wu, Yewen Fan, So Yeon Min, Shrimai Prabhumoye, Stephen McAleer, Yonatan Bisk, Ruslan Salakhutdinov, Yuanzhi Li, Tom Mitchell

Viaarxiv icon

Policy Space Response Oracles: A Survey

Add code
Bookmark button
Alert button
Mar 04, 2024
Ariyan Bighashdel, Yongzhao Wang, Stephen McAleer, Rahul Savani, Frans A. Oliehoek

Figure 1 for Policy Space Response Oracles: A Survey
Figure 2 for Policy Space Response Oracles: A Survey
Viaarxiv icon

Scalable Mechanism Design for Multi-Agent Path Finding

Add code
Bookmark button
Alert button
Jan 30, 2024
Paul Friedrich, Yulun Zhang, Michael Curry, Ludwig Dierks, Stephen McAleer, Jiaoyang Li, Tuomas Sandholm, Sven Seuken

Viaarxiv icon

AI Alignment: A Comprehensive Survey

Add code
Bookmark button
Alert button
Nov 01, 2023
Jiaming Ji, Tianyi Qiu, Boyuan Chen, Borong Zhang, Hantao Lou, Kaile Wang, Yawen Duan, Zhonghao He, Jiayi Zhou, Zhaowei Zhang, Fanzhi Zeng, Kwan Yee Ng, Juntao Dai, Xuehai Pan, Aidan O'Gara, Yingshan Lei, Hua Xu, Brian Tse, Jie Fu, Stephen McAleer, Yaodong Yang, Yizhou Wang, Song-Chun Zhu, Yike Guo, Wen Gao

Viaarxiv icon

Llemma: An Open Language Model For Mathematics

Add code
Bookmark button
Alert button
Oct 16, 2023
Zhangir Azerbayev, Hailey Schoelkopf, Keiran Paster, Marco Dos Santos, Stephen McAleer, Albert Q. Jiang, Jia Deng, Stella Biderman, Sean Welleck

Figure 1 for Llemma: An Open Language Model For Mathematics
Figure 2 for Llemma: An Open Language Model For Mathematics
Figure 3 for Llemma: An Open Language Model For Mathematics
Figure 4 for Llemma: An Open Language Model For Mathematics
Viaarxiv icon

Confronting Reward Model Overoptimization with Constrained RLHF

Add code
Bookmark button
Alert button
Oct 10, 2023
Ted Moskovitz, Aaditya K. Singh, DJ Strouse, Tuomas Sandholm, Ruslan Salakhutdinov, Anca D. Dragan, Stephen McAleer

Viaarxiv icon

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations

Add code
Bookmark button
Alert button
Jul 22, 2023
Yongyuan Liang, Yanchao Sun, Ruijie Zheng, Xiangyu Liu, Tuomas Sandholm, Furong Huang, Stephen McAleer

Figure 1 for Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Figure 2 for Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Figure 3 for Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Figure 4 for Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Viaarxiv icon

Policy Space Diversity for Non-Transitive Games

Add code
Bookmark button
Alert button
Jun 29, 2023
Jian Yao, Weiming Liu, Haobo Fu, Yaodong Yang, Stephen McAleer, Qiang Fu, Wei Yang

Figure 1 for Policy Space Diversity for Non-Transitive Games
Figure 2 for Policy Space Diversity for Non-Transitive Games
Figure 3 for Policy Space Diversity for Non-Transitive Games
Figure 4 for Policy Space Diversity for Non-Transitive Games
Viaarxiv icon