Image


PrevizWhiz: Combining Rough 3D Scenes and 2D Video to Guide Generative Video Previsualization

Add code
Feb 03, 2026
Viaarxiv icon

Continuous Control of Editing Models via Adaptive-Origin Guidance

Add code
Feb 03, 2026
Viaarxiv icon

Progressive Checkerboards for Autoregressive Multiscale Image Generation

Add code
Feb 03, 2026
Viaarxiv icon

Fast Sampling for Flows and Diffusions with Lazy and Point Mass Stochastic Interpolants

Add code
Feb 03, 2026
Viaarxiv icon

Zero-shot large vision-language model prompting for automated bone identification in paleoradiology x-ray archives

Add code
Feb 03, 2026
Viaarxiv icon

Edge-Optimized Vision-Language Models for Underground Infrastructure Assessment

Add code
Feb 03, 2026
Viaarxiv icon

OmniRAG-Agent: Agentic Omnimodal Reasoning for Low-Resource Long Audio-Video Question Answering

Add code
Feb 03, 2026
Viaarxiv icon

MM-SCALE: Grounded Multimodal Moral Reasoning via Scalar Judgment and Listwise Alignment

Add code
Feb 03, 2026
Viaarxiv icon

Quasi-multimodal-based pathophysiological feature learning for retinal disease diagnosis

Add code
Feb 03, 2026
Viaarxiv icon

A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures

Add code
Feb 03, 2026
Viaarxiv icon