Alert button
Picture for Nan Jiang

Nan Jiang

Alert button

ROUTERBENCH: A Benchmark for Multi-LLM Routing System

Mar 18, 2024
Qitian Jason Hu, Jacob Bieker, Xiuyu Li, Nan Jiang, Benjamin Keigwin, Gaurav Ranganath, Kurt Keutzer, Shriyash Kaustubh Upadhyay

Viaarxiv icon

Scaling Up Dynamic Human-Scene Interaction Modeling

Mar 13, 2024
Nan Jiang, Zhiyuan Zhang, Hongjie Li, Xiaoxuan Ma, Zan Wang, Yixin Chen, Tengyu Liu, Yixin Zhu, Siyuan Huang

Viaarxiv icon

On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation

Feb 22, 2024
Yuheng Zhang, Nan Jiang

Viaarxiv icon

A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference

Feb 11, 2024
Chenlu Ye, Wei Xiong, Yuheng Zhang, Nan Jiang, Tong Zhang

Viaarxiv icon

Vertical Symbolic Regression via Deep Policy Gradient

Feb 01, 2024
Nan Jiang, Md Nasim, Yexiang Xue

Figure 1 for Vertical Symbolic Regression via Deep Policy Gradient
Figure 2 for Vertical Symbolic Regression via Deep Policy Gradient
Figure 3 for Vertical Symbolic Regression via Deep Policy Gradient
Figure 4 for Vertical Symbolic Regression via Deep Policy Gradient
Viaarxiv icon

Harnessing Density Ratios for Online Reinforcement Learning

Jan 18, 2024
Philip Amortila, Dylan J. Foster, Nan Jiang, Ayush Sekhari, Tengyang Xie

Viaarxiv icon

Vertical Symbolic Regression

Dec 19, 2023
Nan Jiang, Md Nasim, Yexiang Xue

Viaarxiv icon

Gibbs Sampling from Human Feedback: A Provable KL- constrained Framework for RLHF

Dec 18, 2023
Wei Xiong, Hanze Dong, Chenlu Ye, Han Zhong, Nan Jiang, Tong Zhang

Viaarxiv icon

Nova$^+$: Generative Language Models for Binaries

Nov 27, 2023
Nan Jiang, Chengxiao Wang, Kevin Liu, Xiangzhe Xu, Lin Tan, Xiangyu Zhang

Viaarxiv icon

Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture

Nov 01, 2023
Yixin Chen, Junfeng Ni, Nan Jiang, Yaowei Zhang, Yixin Zhu, Siyuan Huang

Viaarxiv icon