Picture for David Doermann

David Doermann

CWCD: Category-Wise Contrastive Decoding for Structured Medical Report Generation

Add code
Apr 12, 2026
Viaarxiv icon

FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants

Add code
Mar 27, 2026
Viaarxiv icon

Kirchhoff-Inspired Neural Networks for Evolving High-Order Perception

Add code
Mar 25, 2026
Viaarxiv icon

Accelerating Social Science Research via Agentic Hypothesization and Experimentation

Add code
Feb 08, 2026
Viaarxiv icon

AutoEdit: Automatic Hyperparameter Tuning for Image Editing

Add code
Sep 18, 2025
Viaarxiv icon

On Measuring Intrinsic Causal Attributions in Deep Neural Networks

Add code
May 14, 2025
Viaarxiv icon

YOLOv12: Attention-Centric Real-Time Object Detectors

Add code
Feb 18, 2025
Viaarxiv icon

Personalized Large Vision-Language Models

Add code
Dec 23, 2024
Figure 1 for Personalized Large Vision-Language Models
Figure 2 for Personalized Large Vision-Language Models
Figure 3 for Personalized Large Vision-Language Models
Figure 4 for Personalized Large Vision-Language Models
Viaarxiv icon

ETLNet: An Efficient TCN-BiLSTM Network for Road Anomaly Detection Using Smartphone Sensors

Add code
Dec 06, 2024
Viaarxiv icon

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Add code
Jul 15, 2024
Figure 1 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 2 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 3 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 4 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Viaarxiv icon