Alert button
Picture for Shwai He

Shwai He

Alert button

RESSA: Repair Sparse Vision-Language Models via Sparse Cross-Modality Adaptation

Add code
Bookmark button
Alert button
Apr 03, 2024
Shwai He, Tianlong Chen

Viaarxiv icon

Reformatted Alignment

Add code
Bookmark button
Alert button
Feb 19, 2024
Run-Ze Fan, Xuefeng Li, Haoyang Zou, Junlong Li, Shwai He, Ethan Chern, Jiewen Hu, Pengfei Liu

Viaarxiv icon

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Add code
Bookmark button
Alert button
Feb 15, 2024
Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou

Viaarxiv icon

Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Add code
Bookmark button
Alert button
Feb 01, 2024
Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou

Viaarxiv icon

Merging Experts into One: Improving Computational Efficiency of Mixture of Experts

Add code
Bookmark button
Alert button
Oct 22, 2023
Shwai He, Run-Ze Fan, Liang Ding, Li Shen, Tianyi Zhou, Dacheng Tao

Figure 1 for Merging Experts into One: Improving Computational Efficiency of Mixture of Experts
Figure 2 for Merging Experts into One: Improving Computational Efficiency of Mixture of Experts
Figure 3 for Merging Experts into One: Improving Computational Efficiency of Mixture of Experts
Figure 4 for Merging Experts into One: Improving Computational Efficiency of Mixture of Experts
Viaarxiv icon

Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning

Add code
Bookmark button
Alert button
Oct 18, 2023
Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Heng Huang, Jiuxiang Gu, Tianyi Zhou

Figure 1 for Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning
Figure 2 for Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning
Figure 3 for Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning
Figure 4 for Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning
Viaarxiv icon

MerA: Merging Pretrained Adapters For Few-Shot Learning

Add code
Bookmark button
Alert button
Aug 30, 2023
Shwai He, Run-Ze Fan, Liang Ding, Li Shen, Tianyi Zhou, Dacheng Tao

Figure 1 for MerA: Merging Pretrained Adapters For Few-Shot Learning
Figure 2 for MerA: Merging Pretrained Adapters For Few-Shot Learning
Figure 3 for MerA: Merging Pretrained Adapters For Few-Shot Learning
Figure 4 for MerA: Merging Pretrained Adapters For Few-Shot Learning
Viaarxiv icon

Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks

Add code
Bookmark button
Alert button
Nov 10, 2022
Shwai He, Liang Ding, Daize Dong, Boan Liu, Fuqiang Yu, Dacheng Tao

Figure 1 for Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks
Figure 2 for Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks
Figure 3 for Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks
Figure 4 for Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks
Viaarxiv icon

SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters

Add code
Bookmark button
Alert button
Oct 11, 2022
Shwai He, Liang Ding, Daize Dong, Miao Zhang, Dacheng Tao

Figure 1 for SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters
Figure 2 for SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters
Figure 3 for SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters
Figure 4 for SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters
Viaarxiv icon