Alert button
Picture for Tuo Zhao

Tuo Zhao

Alert button

To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO

Add code
Bookmark button
Alert button
Apr 06, 2024
Zi-Hao Qiu, Siqi Guo, Mao Xu, Tuo Zhao, Lijun Zhang, Tianbao Yang

Viaarxiv icon

Stochastic Constrained Decentralized Optimization for Machine Learning with Fewer Data Oracles: a Gradient Sliding Approach

Add code
Bookmark button
Alert button
Apr 03, 2024
Hoang Huy Nguyen, Yan Li, Tuo Zhao

Viaarxiv icon

GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM

Add code
Bookmark button
Alert button
Mar 11, 2024
Hao Kang, Qingru Zhang, Souvik Kundu, Geonhwa Jeong, Zaoxing Liu, Tushar Krishna, Tuo Zhao

Figure 1 for GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
Figure 2 for GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
Figure 3 for GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
Figure 4 for GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
Viaarxiv icon

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Add code
Bookmark button
Alert button
Mar 08, 2024
Hao Kang, Qingru Zhang, Souvik Kundu, Geonhwa Jeong, Zaoxing Liu, Tushar Krishna, Tuo Zhao

Figure 1 for GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
Figure 2 for GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
Figure 3 for GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
Figure 4 for GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
Viaarxiv icon

BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering

Add code
Bookmark button
Alert button
Feb 16, 2024
Haoyu Wang, Tuo Zhao, Jing Gao

Viaarxiv icon

Data Diversity Matters for Robust Instruction Tuning

Add code
Bookmark button
Alert button
Nov 21, 2023
Alexander Bukharin, Tuo Zhao

Viaarxiv icon

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Add code
Bookmark button
Alert button
Nov 03, 2023
Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao

Viaarxiv icon

Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms

Add code
Bookmark button
Alert button
Oct 30, 2023
Shenao Zhang, Boyi Liu, Zhaoran Wang, Tuo Zhao

Viaarxiv icon

Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult

Add code
Bookmark button
Alert button
Oct 26, 2023
Yuqing Wang, Zhenghao Xu, Tuo Zhao, Molei Tao

Viaarxiv icon

SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process

Add code
Bookmark button
Alert button
Oct 25, 2023
Zichong Li, Yanbo Xu, Simiao Zuo, Haoming Jiang, Chao Zhang, Tuo Zhao, Hongyuan Zha

Viaarxiv icon