Alert button
Picture for Kianté Brantley

Kianté Brantley

Alert button

Dataset Reset Policy Optimization for RLHF

Add code
Bookmark button
Alert button
Apr 16, 2024
Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun

Viaarxiv icon

Adversarial Imitation Learning via Boosting

Add code
Bookmark button
Alert button
Apr 12, 2024
Jonathan D. Chang, Dhruv Sreenivas, Yingbing Huang, Kianté Brantley, Wen Sun

Viaarxiv icon

RL for Consistency Models: Faster Reward Guided Text-to-Image Generation

Add code
Bookmark button
Alert button
Mar 25, 2024
Owen Oertell, Jonathan D. Chang, Yiyi Zhang, Kianté Brantley, Wen Sun

Viaarxiv icon

A Surprising Failure? Multimodal LLMs and the NLVR Challenge

Add code
Bookmark button
Alert button
Feb 26, 2024
Anne Wu, Kianté Brantley, Yoav Artzi

Viaarxiv icon

Reviewer2: Optimizing Review Generation Through Prompt Generation

Add code
Bookmark button
Alert button
Feb 16, 2024
Zhaolin Gao, Kianté Brantley, Thorsten Joachims

Viaarxiv icon

Policy-Gradient Training of Language Models for Ranking

Add code
Bookmark button
Alert button
Oct 06, 2023
Ge Gao, Jonathan D. Chang, Claire Cardie, Kianté Brantley, Thorsten Joachim

Figure 1 for Policy-Gradient Training of Language Models for Ranking
Figure 2 for Policy-Gradient Training of Language Models for Ranking
Figure 3 for Policy-Gradient Training of Language Models for Ranking
Figure 4 for Policy-Gradient Training of Language Models for Ranking
Viaarxiv icon

Ranking with Long-Term Constraints

Add code
Bookmark button
Alert button
Jul 10, 2023
Kianté Brantley, Zhichong Fang, Sarah Dean, Thorsten Joachims

Viaarxiv icon

Interactive Text Generation

Add code
Bookmark button
Alert button
Mar 17, 2023
Felix Faltings, Michel Galley, Baolin Peng, Kianté Brantley, Weixin Cai, Yizhe Zhang, Jianfeng Gao, Bill Dolan

Figure 1 for Interactive Text Generation
Figure 2 for Interactive Text Generation
Figure 3 for Interactive Text Generation
Figure 4 for Interactive Text Generation
Viaarxiv icon