Picture for Serge Belongie

Serge Belongie

Cornell Tech

HiddenObjects: Scalable Diffusion-Distilled Spatial Priors for Object Placement

Add code
Apr 12, 2026
Viaarxiv icon

VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward

Add code
Mar 27, 2026
Viaarxiv icon

Video Understanding: From Geometry and Semantics to Unified Models

Add code
Mar 18, 2026
Viaarxiv icon

Revisiting the Perception-Distortion Trade-off with Spatial-Semantic Guided Super-Resolution

Add code
Mar 14, 2026
Viaarxiv icon

The Latent Color Subspace: Emergent Order in High-Dimensional Chaos

Add code
Mar 12, 2026
Viaarxiv icon

RAIGen: Rare Attribute Identification in Text-to-Image Generative Models

Add code
Feb 06, 2026
Viaarxiv icon

MMEarth-Bench: Global Model Adaptation via Multimodal Test-Time Training

Add code
Feb 06, 2026
Viaarxiv icon

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Add code
Jan 28, 2026
Viaarxiv icon

SuperF: Neural Implicit Fields for Multi-Image Super-Resolution

Add code
Dec 09, 2025
Viaarxiv icon

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Add code
Dec 08, 2025
Figure 1 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 2 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 3 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 4 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Viaarxiv icon