Picture for Xiaoyan Sun

Xiaoyan Sun

Holo-World: Unified Camera, Object and Weather Control for Video World Model

Add code
Jun 18, 2026
Viaarxiv icon

LaME: Learning to Think in Latent Space for Multimodal Embedding via Information Bottleneck

Add code
Jun 11, 2026
Viaarxiv icon

A Comprehensive Ecosystem for Open-Domain Customized Video Generation

Add code
Jun 10, 2026
Viaarxiv icon

Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning

Add code
May 20, 2026
Viaarxiv icon

Beyond Chain-of-Thought: Rewrite as a Universal Interface for Generative Multimodal Embeddings

Add code
Apr 24, 2026
Viaarxiv icon

ReconMIL: Synergizing Latent Space Reconstruction with Bi-Stream Mamba for Whole Slide Image Analysis

Add code
Mar 20, 2026
Viaarxiv icon

RiO-DETR: DETR for Real-time Oriented Object Detection

Add code
Mar 10, 2026
Viaarxiv icon

Generalizable and Interpretable RF Fingerprinting with Shapelet-Enhanced Large Language Models

Add code
Feb 03, 2026
Viaarxiv icon

TALON: Confidence-Aware Speculative Decoding with Adaptive Token Trees

Add code
Jan 12, 2026
Viaarxiv icon

MAFS: Multi-head Attention Feature Selection for High-Dimensional Data via Deep Fusion of Filter Methods

Add code
Jan 06, 2026
Viaarxiv icon