Alert button
Picture for Liyuan Liu

Liyuan Liu

Alert button

Learning a Decision Tree Algorithm with Transformers

Add code
Bookmark button
Alert button
Feb 06, 2024
Yufan Zhuang, Liyuan Liu, Chandan Singh, Jingbo Shang, Jianfeng Gao

Viaarxiv icon

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Add code
Bookmark button
Alert button
Nov 03, 2023
Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao

Viaarxiv icon

Fast-ELECTRA for Efficient Pre-training

Add code
Bookmark button
Alert button
Oct 11, 2023
Chengyu Dong, Liyuan Liu, Hao Cheng, Jingbo Shang, Jianfeng Gao, Xiaodong Liu

Figure 1 for Fast-ELECTRA for Efficient Pre-training
Figure 2 for Fast-ELECTRA for Efficient Pre-training
Figure 3 for Fast-ELECTRA for Efficient Pre-training
Figure 4 for Fast-ELECTRA for Efficient Pre-training
Viaarxiv icon

Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs

Add code
Bookmark button
Alert button
Oct 07, 2023
Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao

Figure 1 for Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Figure 2 for Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Figure 3 for Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Figure 4 for Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Viaarxiv icon

Sparse Backpropagation for MoE Training

Add code
Bookmark button
Alert button
Oct 01, 2023
Liyuan Liu, Jianfeng Gao, Weizhu Chen

Viaarxiv icon

Bridging Discrete and Backpropagation: Straight-Through and Beyond

Add code
Bookmark button
Alert button
Apr 17, 2023
Liyuan Liu, Chengyu Dong, Xiaodong Liu, Bin Yu, Jianfeng Gao

Figure 1 for Bridging Discrete and Backpropagation: Straight-Through and Beyond
Figure 2 for Bridging Discrete and Backpropagation: Straight-Through and Beyond
Figure 3 for Bridging Discrete and Backpropagation: Straight-Through and Beyond
Figure 4 for Bridging Discrete and Backpropagation: Straight-Through and Beyond
Viaarxiv icon

SoTeacher: A Student-oriented Teacher Network Training Framework for Knowledge Distillation

Add code
Bookmark button
Alert button
Jun 14, 2022
Chengyu Dong, Liyuan Liu, Jingbo Shang

Figure 1 for SoTeacher: A Student-oriented Teacher Network Training Framework for Knowledge Distillation
Figure 2 for SoTeacher: A Student-oriented Teacher Network Training Framework for Knowledge Distillation
Figure 3 for SoTeacher: A Student-oriented Teacher Network Training Framework for Knowledge Distillation
Figure 4 for SoTeacher: A Student-oriented Teacher Network Training Framework for Knowledge Distillation
Viaarxiv icon

PILED: An Identify-and-Localize Framework for Few-Shot Event Detection

Add code
Bookmark button
Alert button
Feb 15, 2022
Sha Li, Liyuan Liu, Yiqing Xie, Heng Ji, Jiawei Han

Figure 1 for PILED: An Identify-and-Localize Framework for Few-Shot Event Detection
Figure 2 for PILED: An Identify-and-Localize Framework for Few-Shot Event Detection
Figure 3 for PILED: An Identify-and-Localize Framework for Few-Shot Event Detection
Figure 4 for PILED: An Identify-and-Localize Framework for Few-Shot Event Detection
Viaarxiv icon

Double Descent in Adversarial Training: An Implicit Label Noise Perspective

Add code
Bookmark button
Alert button
Oct 07, 2021
Chengyu Dong, Liyuan Liu, Jingbo Shang

Figure 1 for Double Descent in Adversarial Training: An Implicit Label Noise Perspective
Figure 2 for Double Descent in Adversarial Training: An Implicit Label Noise Perspective
Figure 3 for Double Descent in Adversarial Training: An Implicit Label Noise Perspective
Figure 4 for Double Descent in Adversarial Training: An Implicit Label Noise Perspective
Viaarxiv icon