Picture for Lu Hou

Lu Hou

Huawei Noah's Ark Lab

InSight-o3: Empowering Multimodal Foundation Models with Generalized Visual Search

Add code
Dec 21, 2025
Viaarxiv icon

DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning

Add code
Dec 14, 2025
Figure 1 for DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Figure 2 for DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Figure 3 for DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Figure 4 for DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Viaarxiv icon

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving

Add code
Oct 14, 2025
Figure 1 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Figure 2 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Figure 3 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Figure 4 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Viaarxiv icon

Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance

Add code
Aug 10, 2025
Figure 1 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 2 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 3 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 4 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Viaarxiv icon

Efficient Reasoning for Large Reasoning Language Models via Certainty-Guided Reflection Suppression

Add code
Aug 07, 2025
Viaarxiv icon

The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs

Add code
Jul 10, 2025
Figure 1 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Figure 2 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Figure 3 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Figure 4 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Viaarxiv icon

A Simple Linear Patch Revives Layer-Pruned Large Language Models

Add code
May 30, 2025
Figure 1 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Figure 2 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Figure 3 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Figure 4 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Viaarxiv icon

Faster and Better LLMs via Latency-Aware Test-Time Scaling

Add code
May 26, 2025
Viaarxiv icon

Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging

Add code
May 26, 2025
Figure 1 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 2 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 3 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 4 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Viaarxiv icon

Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models

Add code
Apr 07, 2025
Viaarxiv icon