Picture for Lei He

Lei He

ArchSIBench: Benchmarking the Architectural Spatial Intelligence of Vision-Language Models

Add code
May 20, 2026
Viaarxiv icon

Sketch2MinSurf: Vision-Language Guided Generation of Editable Minimal Surfaces from Hand-Drawn Sketches

Add code
May 20, 2026
Viaarxiv icon

Geo-EVS: Geometry-Conditioned Extrapolative View Synthesis for Autonomous Driving

Add code
Apr 08, 2026
Viaarxiv icon

Not All Agents Matter: From Global Attention Dilution to Risk-Prioritized Game Planning

Add code
Apr 07, 2026
Viaarxiv icon

PodEval: A Multimodal Evaluation Framework for Podcast Audio Generation

Add code
Oct 01, 2025
Viaarxiv icon

Fine-Tuning Large Multimodal Models for Automatic Pronunciation Assessment

Add code
Sep 19, 2025
Viaarxiv icon

TransforMARS: Fault-Tolerant Self-Reconfiguration for Arbitrarily Shaped Modular Aerial Robot Systems

Add code
Sep 17, 2025
Viaarxiv icon

SIGN: Safety-Aware Image-Goal Navigation for Autonomous Drones via Reinforcement Learning

Add code
Aug 17, 2025
Viaarxiv icon

Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring

Add code
Aug 12, 2025
Viaarxiv icon

VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception

Add code
Aug 12, 2025
Viaarxiv icon