Picture for Chi-Min Chan

Chi-Min Chan

Not Just the Destination, But the Journey: Reasoning Traces Causally Shape Generalization Behaviors

Add code
Mar 12, 2026
Viaarxiv icon

DC-W2S: Dual-Consensus Weak-to-Strong Training for Reliable Process Reward Modeling in Biological Reasoning

Add code
Mar 09, 2026
Viaarxiv icon

What, Whether and How? Unveiling Process Reward Models for Thinking with Images Reasoning

Add code
Feb 09, 2026
Viaarxiv icon

Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning

Add code
Jan 20, 2026
Viaarxiv icon

AM$^3$Safety: Towards Data Efficient Alignment of Multi-modal Multi-turn Safety for MLLMs

Add code
Jan 08, 2026
Viaarxiv icon

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

Add code
Dec 11, 2025
Figure 1 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 2 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 3 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 4 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Viaarxiv icon

The Mirage of Multimodality: Where Truth is Tested and Honesty Unravels

Add code
May 26, 2025
Viaarxiv icon

J1: Exploring Simple Test-Time Scaling for LLM-as-a-Judge

Add code
May 17, 2025
Figure 1 for J1: Exploring Simple Test-Time Scaling for LLM-as-a-Judge
Figure 2 for J1: Exploring Simple Test-Time Scaling for LLM-as-a-Judge
Figure 3 for J1: Exploring Simple Test-Time Scaling for LLM-as-a-Judge
Figure 4 for J1: Exploring Simple Test-Time Scaling for LLM-as-a-Judge
Viaarxiv icon

ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs

Add code
Mar 17, 2025
Viaarxiv icon

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Add code
Feb 06, 2025
Viaarxiv icon