Alert button
Picture for Tuo Zhao

Tuo Zhao

Alert button

Data Diversity Matters for Robust Instruction Tuning

Nov 21, 2023
Alexander Bukharin, Tuo Zhao

Viaarxiv icon

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Nov 03, 2023
Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao

Viaarxiv icon

Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms

Oct 30, 2023
Shenao Zhang, Boyi Liu, Zhaoran Wang, Tuo Zhao

Viaarxiv icon

Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult

Oct 26, 2023
Yuqing Wang, Zhenghao Xu, Tuo Zhao, Molei Tao

Viaarxiv icon

SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process

Oct 25, 2023
Zichong Li, Yanbo Xu, Simiao Zuo, Haoming Jiang, Chao Zhang, Tuo Zhao, Hongyuan Zha

Viaarxiv icon

Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process with Uncertainty Quantification

Oct 25, 2023
Zichong Li, Qunzhi Xu, Zhenghao Xu, Yajun Mei, Tuo Zhao, Hongyuan Zha

Viaarxiv icon

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Oct 23, 2023
Yixiao Li, Yifan Yu, Chen Liang, Pengcheng He, Nikos Karampatziakis, Weizhu Chen, Tuo Zhao

Figure 1 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 2 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 3 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 4 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Viaarxiv icon

Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer

Oct 19, 2023
Qingru Zhang, Dhananjay Ram, Cole Hawkins, Sheng Zha, Tuo Zhao

Viaarxiv icon

Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms

Oct 16, 2023
Alexander Bukharin, Yan Li, Yue Yu, Qingru Zhang, Zhehui Chen, Simiao Zuo, Chao Zhang, Songan Zhang, Tuo Zhao

Viaarxiv icon