Image


DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

Add code
Oct 14, 2025
Viaarxiv icon

UniFusion: Vision-Language Model as Unified Encoder in Image Generation

Add code
Oct 14, 2025
Figure 1 for UniFusion: Vision-Language Model as Unified Encoder in Image Generation
Figure 2 for UniFusion: Vision-Language Model as Unified Encoder in Image Generation
Figure 3 for UniFusion: Vision-Language Model as Unified Encoder in Image Generation
Figure 4 for UniFusion: Vision-Language Model as Unified Encoder in Image Generation
Viaarxiv icon

Wavefront Coding for Accommodation-Invariant Near-Eye Displays

Add code
Oct 14, 2025
Viaarxiv icon

KoALA: KL-L0 Adversarial Detector via Label Agreement

Add code
Oct 14, 2025
Viaarxiv icon

DiffEM: Learning from Corrupted Data with Diffusion Models via Expectation Maximization

Add code
Oct 14, 2025
Viaarxiv icon

CoRA: Covariate-Aware Adaptation of Time Series Foundation Models

Add code
Oct 14, 2025
Viaarxiv icon

Zero-Shot CFC: Fast Real-World Image Denoising based on Cross-Frequency Consistency

Add code
Oct 14, 2025
Viaarxiv icon

SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression

Add code
Oct 14, 2025
Viaarxiv icon

Time-Correlated Video Bridge Matching

Add code
Oct 14, 2025
Viaarxiv icon

A Review of Longitudinal Radiology Report Generation: Dataset Composition, Methods, and Performance Evaluation

Add code
Oct 14, 2025
Viaarxiv icon