Picture for Alexander Schwing

Alexander Schwing

ETH Zurich

REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders

Add code
May 23, 2025
Viaarxiv icon

The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation

Add code
Mar 13, 2025
Viaarxiv icon

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Add code
Dec 19, 2024
Viaarxiv icon

MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds

Add code
Dec 09, 2024
Viaarxiv icon

RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations

Add code
Dec 02, 2024
Viaarxiv icon

On Inductive Biases That Enable Generalization of Diffusion Transformers

Add code
Oct 28, 2024
Figure 1 for On Inductive Biases That Enable Generalization of Diffusion Transformers
Figure 2 for On Inductive Biases That Enable Generalization of Diffusion Transformers
Figure 3 for On Inductive Biases That Enable Generalization of Diffusion Transformers
Figure 4 for On Inductive Biases That Enable Generalization of Diffusion Transformers
Viaarxiv icon

Pixel-Aligned Multi-View Generation with Depth Guided Decoder

Add code
Aug 26, 2024
Figure 1 for Pixel-Aligned Multi-View Generation with Depth Guided Decoder
Figure 2 for Pixel-Aligned Multi-View Generation with Depth Guided Decoder
Figure 3 for Pixel-Aligned Multi-View Generation with Depth Guided Decoder
Figure 4 for Pixel-Aligned Multi-View Generation with Depth Guided Decoder
Viaarxiv icon

NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows

Add code
Jun 15, 2024
Viaarxiv icon

Virtual Pets: Animatable Animal Generation in 3D Scenes

Add code
Dec 21, 2023
Viaarxiv icon

Putting the Object Back into Video Object Segmentation

Add code
Oct 19, 2023
Viaarxiv icon