Picture for Cordelia Schmid

Cordelia Schmid

Thoth

ComposeAnything: Composite Object Priors for Text-to-Image Generation

Add code
May 30, 2025
Viaarxiv icon

Feasibility with Language Models for Open-World Compositional Zero-Shot Learning

Add code
May 16, 2025
Viaarxiv icon

LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance

Add code
May 16, 2025
Viaarxiv icon

MINERVA: Evaluating Complex Video Reasoning

Add code
May 01, 2025
Viaarxiv icon

Memory-Modular Classification: Learning to Generalize with Memory Replacement

Add code
Apr 08, 2025
Viaarxiv icon

InteractVLM: 3D Interaction Reasoning from 2D Foundational Models

Add code
Apr 07, 2025
Viaarxiv icon

Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs

Add code
Mar 31, 2025
Viaarxiv icon

HORT: Monocular Hand-held Objects Reconstruction with Transformers

Add code
Mar 27, 2025
Viaarxiv icon

Online 3D Scene Reconstruction Using Neural Object Priors

Add code
Mar 24, 2025
Viaarxiv icon

Large-scale Pre-training for Grounded Video Caption Generation

Add code
Mar 13, 2025
Viaarxiv icon