Picture for Roger Zimmermann

Roger Zimmermann

Audio-Visual Intelligence in Large Foundation Models

Add code
May 05, 2026
Viaarxiv icon

Geometry over Density: Few-Shot Cross-Domain OOD Detection

Add code
May 05, 2026
Viaarxiv icon

SOAR: Self-Correction for Optimal Alignment and Refinement in Diffusion Models

Add code
Apr 14, 2026
Viaarxiv icon

OSCBench: Benchmarking Object State Change in Text-to-Video Generation

Add code
Mar 12, 2026
Viaarxiv icon

CaPulse: Detecting Anomalies by Tuning in to the Causal Rhythms of Time Series

Add code
Aug 06, 2025
Figure 1 for CaPulse: Detecting Anomalies by Tuning in to the Causal Rhythms of Time Series
Figure 2 for CaPulse: Detecting Anomalies by Tuning in to the Causal Rhythms of Time Series
Figure 3 for CaPulse: Detecting Anomalies by Tuning in to the Causal Rhythms of Time Series
Figure 4 for CaPulse: Detecting Anomalies by Tuning in to the Causal Rhythms of Time Series
Viaarxiv icon

Uni3D-MoE: Scalable Multimodal 3D Scene Understanding via Mixture of Experts

Add code
May 27, 2025
Viaarxiv icon

JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation

Add code
May 15, 2025
Viaarxiv icon

OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models

Add code
Apr 30, 2025
Viaarxiv icon

Reimagining Urban Science: Scaling Causal Inference with Large Language Models

Add code
Apr 15, 2025
Figure 1 for Reimagining Urban Science: Scaling Causal Inference with Large Language Models
Figure 2 for Reimagining Urban Science: Scaling Causal Inference with Large Language Models
Figure 3 for Reimagining Urban Science: Scaling Causal Inference with Large Language Models
Figure 4 for Reimagining Urban Science: Scaling Causal Inference with Large Language Models
Viaarxiv icon

TAIL: Text-Audio Incremental Learning

Add code
Mar 06, 2025
Figure 1 for TAIL: Text-Audio Incremental Learning
Figure 2 for TAIL: Text-Audio Incremental Learning
Figure 3 for TAIL: Text-Audio Incremental Learning
Figure 4 for TAIL: Text-Audio Incremental Learning
Viaarxiv icon