Picture for Lu Hou

Lu Hou

Huawei Noah's Ark Lab

See, Remember, Explore: A Benchmark and Baselines for Streaming Spatial Reasoning

Add code
Mar 25, 2026
Viaarxiv icon

DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding

Add code
Mar 19, 2026
Viaarxiv icon

Deeper Thought, Weaker Aim: Understanding and Mitigating Perceptual Impairment during Reasoning in Multimodal Large Language Models

Add code
Mar 15, 2026
Viaarxiv icon

DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving

Add code
Mar 11, 2026
Viaarxiv icon

What Makes Low-Bit Quantization-Aware Training Work for Reasoning LLMs? A Systematic Study

Add code
Jan 21, 2026
Viaarxiv icon

InSight-o3: Empowering Multimodal Foundation Models with Generalized Visual Search

Add code
Dec 21, 2025
Viaarxiv icon

DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning

Add code
Dec 14, 2025
Figure 1 for DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Figure 2 for DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Figure 3 for DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Figure 4 for DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Viaarxiv icon

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving

Add code
Oct 14, 2025
Figure 1 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Figure 2 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Figure 3 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Figure 4 for DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Viaarxiv icon

Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance

Add code
Aug 10, 2025
Figure 1 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 2 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 3 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 4 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Viaarxiv icon

Efficient Reasoning for Large Reasoning Language Models via Certainty-Guided Reflection Suppression

Add code
Aug 07, 2025
Viaarxiv icon