Picture for Runxi Cheng

Runxi Cheng

HS-STAR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation

Add code
May 26, 2025
Viaarxiv icon

Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging

Add code
May 26, 2025
Viaarxiv icon

Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors

Add code
Mar 11, 2025
Viaarxiv icon

Multi-Task Model Merging via Adaptive Weight Disentanglement

Add code
Nov 27, 2024
Figure 1 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Figure 2 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Figure 3 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Figure 4 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Viaarxiv icon

Kendall's $τ$ Coefficient for Logits Distillation

Add code
Sep 26, 2024
Figure 1 for Kendall's $τ$ Coefficient for Logits Distillation
Figure 2 for Kendall's $τ$ Coefficient for Logits Distillation
Figure 3 for Kendall's $τ$ Coefficient for Logits Distillation
Figure 4 for Kendall's $τ$ Coefficient for Logits Distillation
Viaarxiv icon

Learn To Learn More Precisely

Add code
Aug 08, 2024
Figure 1 for Learn To Learn More Precisely
Figure 2 for Learn To Learn More Precisely
Figure 3 for Learn To Learn More Precisely
Figure 4 for Learn To Learn More Precisely
Viaarxiv icon