Picture for James M. Rehg

James M. Rehg

3x2: 3D Object Part Segmentation by 2D Semantic Correspondences

Add code
Jul 12, 2024
Viaarxiv icon

Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation

Add code
Jun 27, 2024
Figure 1 for Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation
Figure 2 for Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation
Figure 3 for Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation
Figure 4 for Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation
Viaarxiv icon

MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs

Add code
Jun 24, 2024
Viaarxiv icon

What is the Visual Cognition Gap between Humans and Multimodal LLMs?

Add code
Jun 14, 2024
Viaarxiv icon

PointInfinity: Resolution-Invariant Point Diffusion Models

Add code
Apr 04, 2024
Figure 1 for PointInfinity: Resolution-Invariant Point Diffusion Models
Figure 2 for PointInfinity: Resolution-Invariant Point Diffusion Models
Figure 3 for PointInfinity: Resolution-Invariant Point Diffusion Models
Figure 4 for PointInfinity: Resolution-Invariant Point Diffusion Models
Viaarxiv icon

Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations

Add code
Mar 04, 2024
Figure 1 for Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations
Figure 2 for Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations
Figure 3 for Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations
Figure 4 for Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations
Viaarxiv icon

ZeroShape: Regression-based Zero-shot Shape Reconstruction

Add code
Jan 16, 2024
Figure 1 for ZeroShape: Regression-based Zero-shot Shape Reconstruction
Figure 2 for ZeroShape: Regression-based Zero-shot Shape Reconstruction
Figure 3 for ZeroShape: Regression-based Zero-shot Shape Reconstruction
Figure 4 for ZeroShape: Regression-based Zero-shot Shape Reconstruction
Viaarxiv icon

The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective

Add code
Dec 20, 2023
Figure 1 for The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective
Figure 2 for The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective
Figure 3 for The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective
Figure 4 for The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective
Viaarxiv icon

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models

Add code
Dec 07, 2023
Viaarxiv icon

LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs

Add code
Dec 07, 2023
Figure 1 for LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs
Figure 2 for LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs
Figure 3 for LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs
Figure 4 for LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs
Viaarxiv icon