Picture for Yifan Song

Yifan Song

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Add code
Feb 03, 2026
Viaarxiv icon

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

MiMo-Audio: Audio Language Models are Few-Shot Learners

Add code
Dec 29, 2025
Viaarxiv icon

Personalized Real-time Jargon Support for Online Meetings

Add code
Aug 13, 2025
Figure 1 for Personalized Real-time Jargon Support for Online Meetings
Figure 2 for Personalized Real-time Jargon Support for Online Meetings
Figure 3 for Personalized Real-time Jargon Support for Online Meetings
Figure 4 for Personalized Real-time Jargon Support for Online Meetings
Viaarxiv icon

P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction Synthesis

Add code
Aug 06, 2025
Figure 1 for P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction Synthesis
Figure 2 for P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction Synthesis
Figure 3 for P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction Synthesis
Figure 4 for P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction Synthesis
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

Adding Additional Control to One-Step Diffusion with Joint Distribution Matching

Add code
Mar 09, 2025
Figure 1 for Adding Additional Control to One-Step Diffusion with Joint Distribution Matching
Figure 2 for Adding Additional Control to One-Step Diffusion with Joint Distribution Matching
Figure 3 for Adding Additional Control to One-Step Diffusion with Joint Distribution Matching
Figure 4 for Adding Additional Control to One-Step Diffusion with Joint Distribution Matching
Viaarxiv icon

MPO: Boosting LLM Agents with Meta Plan Optimization

Add code
Mar 04, 2025
Figure 1 for MPO: Boosting LLM Agents with Meta Plan Optimization
Figure 2 for MPO: Boosting LLM Agents with Meta Plan Optimization
Figure 3 for MPO: Boosting LLM Agents with Meta Plan Optimization
Figure 4 for MPO: Boosting LLM Agents with Meta Plan Optimization
Viaarxiv icon

Decoupled Graph Energy-based Model for Node Out-of-Distribution Detection on Heterophilic Graphs

Add code
Feb 25, 2025
Viaarxiv icon

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression

Add code
Dec 17, 2024
Viaarxiv icon