Image


VGG-T$^3$: Offline Feed-Forward 3D Reconstruction at Scale

Add code
Feb 26, 2026
Viaarxiv icon

A Dataset is Worth 1 MB

Add code
Feb 26, 2026
Viaarxiv icon

PRIMA: Pre-training with Risk-integrated Image-Metadata Alignment for Medical Diagnosis via LLM

Add code
Feb 26, 2026
Viaarxiv icon

ManifoldGD: Training-Free Hierarchical Manifold Guidance for Diffusion-Based Dataset Distillation

Add code
Feb 26, 2026
Viaarxiv icon

MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction

Add code
Feb 26, 2026
Viaarxiv icon

ColoDiff: Integrating Dynamic Consistency With Content Awareness for Colonoscopy Video Generation

Add code
Feb 26, 2026
Viaarxiv icon

Uni-Animator: Towards Unified Visual Colorization

Add code
Feb 26, 2026
Viaarxiv icon

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Add code
Feb 26, 2026
Viaarxiv icon

PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering

Add code
Feb 26, 2026
Viaarxiv icon

Cytoarchitecture in Words: Weakly Supervised Vision-Language Modeling for Human Brain Microscopy

Add code
Feb 26, 2026
Viaarxiv icon