Picture for James M. Rehg

James M. Rehg

LSM-2: Learning from Incomplete Wearable Sensor Data

Add code
Jun 05, 2025
Viaarxiv icon

Incorporating Flexible Image Conditioning into Text-to-Video Diffusion Models without Training

Add code
May 27, 2025
Viaarxiv icon

MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models

Add code
May 26, 2025
Viaarxiv icon

ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models

Add code
May 12, 2025
Viaarxiv icon

SocialGesture: Delving into Multi-person Gesture Understanding

Add code
Apr 03, 2025
Viaarxiv icon

Learning Predictive Visuomotor Coordination

Add code
Mar 30, 2025
Viaarxiv icon

Towards Online Multi-Modal Social Interaction Understanding

Add code
Mar 25, 2025
Viaarxiv icon

Pulse-PPG: An Open-Source Field-Trained PPG Foundation Model for Wearable Applications Across Lab and Field Settings

Add code
Feb 03, 2025
Viaarxiv icon

SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images

Add code
Jan 08, 2025
Figure 1 for SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
Figure 2 for SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
Figure 3 for SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
Figure 4 for SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
Viaarxiv icon

Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders

Add code
Dec 12, 2024
Viaarxiv icon