Picture for Cordelia Schmid

Cordelia Schmid

Thoth

InteractVLM: 3D Interaction Reasoning from 2D Foundational Models

Add code
Apr 07, 2025
Viaarxiv icon

Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs

Add code
Mar 31, 2025
Viaarxiv icon

HORT: Monocular Hand-held Objects Reconstruction with Transformers

Add code
Mar 27, 2025
Figure 1 for HORT: Monocular Hand-held Objects Reconstruction with Transformers
Figure 2 for HORT: Monocular Hand-held Objects Reconstruction with Transformers
Figure 3 for HORT: Monocular Hand-held Objects Reconstruction with Transformers
Figure 4 for HORT: Monocular Hand-held Objects Reconstruction with Transformers
Viaarxiv icon

Online 3D Scene Reconstruction Using Neural Object Priors

Add code
Mar 24, 2025
Viaarxiv icon

Large-scale Pre-training for Grounded Video Caption Generation

Add code
Mar 13, 2025
Viaarxiv icon

What Are You Doing? A Closer Look at Controllable Human Video Generation

Add code
Mar 06, 2025
Figure 1 for What Are You Doing? A Closer Look at Controllable Human Video Generation
Figure 2 for What Are You Doing? A Closer Look at Controllable Human Video Generation
Figure 3 for What Are You Doing? A Closer Look at Controllable Human Video Generation
Figure 4 for What Are You Doing? A Closer Look at Controllable Human Video Generation
Viaarxiv icon

FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement

Add code
Mar 06, 2025
Viaarxiv icon

Causal Lifting of Neural Representations: Zero-Shot Generalization for Causal Inferences

Add code
Feb 10, 2025
Figure 1 for Causal Lifting of Neural Representations: Zero-Shot Generalization for Causal Inferences
Figure 2 for Causal Lifting of Neural Representations: Zero-Shot Generalization for Causal Inferences
Figure 3 for Causal Lifting of Neural Representations: Zero-Shot Generalization for Causal Inferences
Figure 4 for Causal Lifting of Neural Representations: Zero-Shot Generalization for Causal Inferences
Viaarxiv icon

Neptune: The Long Orbit to Benchmarking Long Video Understanding

Add code
Dec 12, 2024
Figure 1 for Neptune: The Long Orbit to Benchmarking Long Video Understanding
Figure 2 for Neptune: The Long Orbit to Benchmarking Long Video Understanding
Figure 3 for Neptune: The Long Orbit to Benchmarking Long Video Understanding
Figure 4 for Neptune: The Long Orbit to Benchmarking Long Video Understanding
Viaarxiv icon

Visual Lexicon: Rich Image Features in Language Space

Add code
Dec 09, 2024
Figure 1 for Visual Lexicon: Rich Image Features in Language Space
Viaarxiv icon