Picture for Yixiao Zhou

Yixiao Zhou

Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL

Add code
Feb 13, 2026
Viaarxiv icon

Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning

Add code
Jan 25, 2025
Figure 1 for Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning
Figure 2 for Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning
Figure 3 for Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning
Figure 4 for Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning
Viaarxiv icon