Picture for Wei Zhai

Wei Zhai

University of Science and Technology of China, China, JD Explore Academy, JD.com, China

End-to-End Spatial-Temporal Transformer for Real-time 4D HOI Reconstruction

Add code
Mar 15, 2026
Viaarxiv icon

EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning

Add code
Mar 12, 2026
Viaarxiv icon

Event-based Visual Deformation Measurement

Add code
Feb 16, 2026
Viaarxiv icon

Unbiased Gradient Estimation for Event Binning via Functional Backpropagation

Add code
Feb 13, 2026
Viaarxiv icon

TrackTeller: Temporal Multimodal 3D Grounding for Behavior-Dependent Object References

Add code
Dec 25, 2025
Viaarxiv icon

MatE: Material Extraction from Single-Image via Geometric Prior

Add code
Dec 20, 2025
Viaarxiv icon

Anchoring Values in Temporal and Group Dimensions for Flow Matching Model Alignment

Add code
Dec 13, 2025
Viaarxiv icon

Beyond Randomness: Understand the Order of the Noise in Diffusion

Add code
Nov 11, 2025
Figure 1 for Beyond Randomness: Understand the Order of the Noise in Diffusion
Figure 2 for Beyond Randomness: Understand the Order of the Noise in Diffusion
Figure 3 for Beyond Randomness: Understand the Order of the Noise in Diffusion
Figure 4 for Beyond Randomness: Understand the Order of the Noise in Diffusion
Viaarxiv icon

AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model

Add code
Jun 05, 2025
Viaarxiv icon

GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images

Add code
May 10, 2025
Figure 1 for GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images
Figure 2 for GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images
Figure 3 for GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images
Figure 4 for GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images
Viaarxiv icon