Picture for Zhensong Zhang

Zhensong Zhang

GRVS: a Generalizable and Recurrent Approach to Monocular Dynamic View Synthesis

Add code
Mar 31, 2026
Viaarxiv icon

Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing

Add code
Mar 23, 2026
Viaarxiv icon

Diffusion-Based Makeup Transfer with Facial Region-Aware Makeup Features

Add code
Mar 20, 2026
Viaarxiv icon

LatSearch: Latent Reward-Guided Search for Faster Inference-Time Scaling in Video Diffusion

Add code
Mar 15, 2026
Viaarxiv icon

Egocentric Co-Pilot: Web-Native Smart-Glasses Agents for Assistive Egocentric AI

Add code
Mar 01, 2026
Viaarxiv icon

ICo3D: An Interactive Conversational 3D Virtual Human

Add code
Jan 19, 2026
Viaarxiv icon

Map2Thought: Explicit 3D Spatial Reasoning via Metric Cognitive Maps

Add code
Jan 16, 2026
Viaarxiv icon

Optimizing Multimodal LLMs for Egocentric Video Understanding: A Solution for the HD-EPIC VQA Challenge

Add code
Jan 15, 2026
Viaarxiv icon

Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting

Add code
Dec 17, 2025
Figure 1 for Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Figure 2 for Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Figure 3 for Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Figure 4 for Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Viaarxiv icon

Charge: A Comprehensive Novel View Synthesis Benchmark and Dataset to Bind Them All

Add code
Dec 15, 2025
Viaarxiv icon