Picture for Shuaiwen Leon Song

Shuaiwen Leon Song

Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

Add code
Dec 31, 2025
Viaarxiv icon

Data Diversification Methods In Alignment Enhance Math Performance In LLMs

Add code
Jul 02, 2025
Figure 1 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 2 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 3 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 4 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Viaarxiv icon

Disentangling Reasoning and Knowledge in Medical Large Language Models

Add code
May 16, 2025
Figure 1 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 2 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 3 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 4 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Viaarxiv icon

Improving Model Alignment Through Collective Intelligence of Open-Source LLMS

Add code
May 05, 2025
Figure 1 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Figure 2 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Figure 3 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Figure 4 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Viaarxiv icon

How Well Can General Vision-Language Models Learn Medicine By Watching Public Educational Videos?

Add code
Apr 19, 2025
Viaarxiv icon

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods

Add code
Apr 18, 2025
Viaarxiv icon

Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping

Add code
Jan 11, 2025
Figure 1 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Figure 2 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Figure 3 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Figure 4 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Viaarxiv icon

CorDA: Context-Oriented Decomposition Adaptation of Large Language Models

Add code
Jun 07, 2024
Figure 1 for CorDA: Context-Oriented Decomposition Adaptation of Large Language Models
Figure 2 for CorDA: Context-Oriented Decomposition Adaptation of Large Language Models
Figure 3 for CorDA: Context-Oriented Decomposition Adaptation of Large Language Models
Figure 4 for CorDA: Context-Oriented Decomposition Adaptation of Large Language Models
Viaarxiv icon

Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model

Add code
Jun 03, 2024
Figure 1 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 2 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 3 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 4 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Viaarxiv icon

FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Add code
Jan 25, 2024
Figure 1 for FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design
Figure 2 for FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design
Figure 3 for FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design
Figure 4 for FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design
Viaarxiv icon