Alert button
Picture for Nevan Wichers

Nevan Wichers

Alert button

Gradient-Based Language Model Red Teaming

Add code
Bookmark button
Alert button
Jan 30, 2024
Nevan Wichers, Carson Denison, Ahmad Beirami

Viaarxiv icon

DRLC: Reinforcement Learning with Dense Rewards from LLM Critic

Add code
Bookmark button
Alert button
Jan 14, 2024
Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng

Viaarxiv icon

Fusion-Eval: Integrating Evaluators with LLMs

Add code
Bookmark button
Alert button
Nov 15, 2023
Lei Shu, Nevan Wichers, Liangchen Luo, Yun Zhu, Yinxiao Liu, Jindong Chen, Lei Meng

Viaarxiv icon

SiRA: Sparse Mixture of Low Rank Adaptation

Add code
Bookmark button
Alert button
Nov 15, 2023
Yun Zhu, Nevan Wichers, Chu-Cheng Lin, Xinyi Wang, Tianlong Chen, Lei Shu, Han Lu, Canoee Liu, Liangchen Luo, Jindong Chen, Lei Meng

Figure 1 for SiRA: Sparse Mixture of Low Rank Adaptation
Figure 2 for SiRA: Sparse Mixture of Low Rank Adaptation
Figure 3 for SiRA: Sparse Mixture of Low Rank Adaptation
Figure 4 for SiRA: Sparse Mixture of Low Rank Adaptation
Viaarxiv icon

SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition

Add code
Bookmark button
Alert button
Feb 10, 2022
Dylan Slack, Yinlam Chow, Bo Dai, Nevan Wichers

Figure 1 for SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Figure 2 for SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Figure 3 for SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Figure 4 for SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Viaarxiv icon

ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces

Add code
Bookmark button
Alert button
Jan 25, 2021
Zecheng He, Srinivas Sunkara, Xiaoxue Zang, Ying Xu, Lijuan Liu, Nevan Wichers, Gabriel Schubiner, Ruby Lee, Jindong Chen, Blaise Agüera y Arcas

Figure 1 for ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces
Figure 2 for ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces
Figure 3 for ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces
Figure 4 for ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces
Viaarxiv icon

RL agents Implicitly Learning Human Preferences

Add code
Bookmark button
Alert button
Feb 14, 2020
Nevan Wichers

Figure 1 for RL agents Implicitly Learning Human Preferences
Figure 2 for RL agents Implicitly Learning Human Preferences
Viaarxiv icon

Resolving Spurious Correlations in Causal Models of Environments via Interventions

Add code
Bookmark button
Alert button
Feb 12, 2020
Sergei Volodin, Nevan Wichers, Jeremy Nixon

Viaarxiv icon

Resolving Referring Expressions in Images With Labeled Elements

Add code
Bookmark button
Alert button
Oct 25, 2018
Nevan Wichers, Dilek Hakkani-Tur, Jindong Chen

Figure 1 for Resolving Referring Expressions in Images With Labeled Elements
Figure 2 for Resolving Referring Expressions in Images With Labeled Elements
Figure 3 for Resolving Referring Expressions in Images With Labeled Elements
Figure 4 for Resolving Referring Expressions in Images With Labeled Elements
Viaarxiv icon