Picture for Jian Luan

Jian Luan

Xiaomi MiMo-VL-Miloco Technical Report

Add code
Dec 22, 2025
Figure 1 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 2 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 3 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 4 for Xiaomi MiMo-VL-Miloco Technical Report
Viaarxiv icon

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Add code
Nov 17, 2025
Viaarxiv icon

STEP: Success-Rate-Aware Trajectory-Efficient Policy Optimization

Add code
Nov 17, 2025
Viaarxiv icon

Revisiting Entropy in Reinforcement Learning for Large Reasoning Models

Add code
Nov 08, 2025
Viaarxiv icon

HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration

Add code
Oct 31, 2025
Viaarxiv icon

DiffRhythm 2: Efficient and High Fidelity Song Generation via Block Flow Matching

Add code
Oct 27, 2025
Viaarxiv icon

Thinking in cocktail party: Chain-of-Thought and reinforcement learning for target speaker automatic speech recognition

Add code
Sep 19, 2025
Viaarxiv icon

BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent

Add code
Sep 19, 2025
Viaarxiv icon

Lightweight speech enhancement guided target speech extraction in noisy multi-speaker scenarios

Add code
Aug 27, 2025
Figure 1 for Lightweight speech enhancement guided target speech extraction in noisy multi-speaker scenarios
Figure 2 for Lightweight speech enhancement guided target speech extraction in noisy multi-speaker scenarios
Figure 3 for Lightweight speech enhancement guided target speech extraction in noisy multi-speaker scenarios
Figure 4 for Lightweight speech enhancement guided target speech extraction in noisy multi-speaker scenarios
Viaarxiv icon

Attention Basin: Why Contextual Position Matters in Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon