Picture for James M. Rehg

James M. Rehg

Immune2V: Image Immunization Against Dual-Stream Image-to-Video Generation

Add code
Apr 12, 2026
Viaarxiv icon

Self-Improving 4D Perception via Self-Distillation

Add code
Apr 09, 2026
Viaarxiv icon

STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding

Add code
Mar 29, 2026
Viaarxiv icon

How Well Do Multimodal Models Reason on ECG Signals?

Add code
Feb 27, 2026
Viaarxiv icon

Vinedresser3D: Agentic Text-guided 3D Editing

Add code
Feb 23, 2026
Viaarxiv icon

Toward Cognitive Supersensing in Multimodal Large Language Model

Add code
Feb 02, 2026
Viaarxiv icon

How Much 3D Do Video Foundation Models Encode?

Add code
Dec 23, 2025
Viaarxiv icon

Improving Personalized Search with Regularized Low-Rank Parameter Updates

Add code
Jun 11, 2025
Viaarxiv icon

LSM-2: Learning from Incomplete Wearable Sensor Data

Add code
Jun 05, 2025
Figure 1 for LSM-2: Learning from Incomplete Wearable Sensor Data
Figure 2 for LSM-2: Learning from Incomplete Wearable Sensor Data
Figure 3 for LSM-2: Learning from Incomplete Wearable Sensor Data
Figure 4 for LSM-2: Learning from Incomplete Wearable Sensor Data
Viaarxiv icon

Incorporating Flexible Image Conditioning into Text-to-Video Diffusion Models without Training

Add code
May 27, 2025
Figure 1 for Incorporating Flexible Image Conditioning into Text-to-Video Diffusion Models without Training
Figure 2 for Incorporating Flexible Image Conditioning into Text-to-Video Diffusion Models without Training
Figure 3 for Incorporating Flexible Image Conditioning into Text-to-Video Diffusion Models without Training
Figure 4 for Incorporating Flexible Image Conditioning into Text-to-Video Diffusion Models without Training
Viaarxiv icon