Picture for Wei Zhai

Wei Zhai

University of Science and Technology of China, China, JD Explore Academy, JD.com, China

Dual-Pathway Geometry-Aware MLLM for Spatial Intelligence

Add code
May 25, 2026
Viaarxiv icon

Self-Consistent Latent Reasoning: Long Latent Sequence Reasoning for Vision-Language Model

Add code
May 13, 2026
Viaarxiv icon

AnchorSeg: Language Grounded Query Banks for Reasoning Segmentation

Add code
Apr 22, 2026
Viaarxiv icon

Gloria: Consistent Character Video Generation via Content Anchors

Add code
Mar 31, 2026
Viaarxiv icon

End-to-End Spatial-Temporal Transformer for Real-time 4D HOI Reconstruction

Add code
Mar 15, 2026
Viaarxiv icon

EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning

Add code
Mar 12, 2026
Viaarxiv icon

Event-based Visual Deformation Measurement

Add code
Feb 16, 2026
Viaarxiv icon

Unbiased Gradient Estimation for Event Binning via Functional Backpropagation

Add code
Feb 13, 2026
Viaarxiv icon

TrackTeller: Temporal Multimodal 3D Grounding for Behavior-Dependent Object References

Add code
Dec 25, 2025
Viaarxiv icon

MatE: Material Extraction from Single-Image via Geometric Prior

Add code
Dec 20, 2025
Viaarxiv icon