Picture for Xiao Zhang

Xiao Zhang

HYDRA: Unifying Multi-modal Generation and Understanding via Representation-Harmonized Tokenization

Add code
Mar 17, 2026
Viaarxiv icon

Bringing Model Editing to Generative Recommendation in Cold-Start Scenarios

Add code
Mar 15, 2026
Viaarxiv icon

HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios

Add code
Mar 12, 2026
Viaarxiv icon

Inverse Contextual Bandits without Rewards: Learning from a Non-Stationary Learner via Suffix Imitation

Add code
Mar 04, 2026
Viaarxiv icon

Entropy-Guided GRVQ for Ultra-Low Bitrate Neural Speech Codec

Add code
Mar 02, 2026
Viaarxiv icon

Following the Diagnostic Trace: Visual Cognition-guided Cooperative Network for Chest X-Ray Diagnosis

Add code
Feb 25, 2026
Viaarxiv icon

Enhancing Bandit Algorithms with LLMs for Time-varying User Preferences in Streaming Recommendations

Add code
Feb 08, 2026
Viaarxiv icon

Time Series Reasoning via Process-Verifiable Thinking Data Synthesis and Scheduling for Tailored LLM Reasoning

Add code
Feb 08, 2026
Viaarxiv icon

DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents

Add code
Feb 03, 2026
Viaarxiv icon

Stronger Semantic Encoders Can Harm Relighting Performance: Probing Visual Priors via Augmented Latent Intrinsics

Add code
Feb 01, 2026
Viaarxiv icon