Picture for Kun Xu

Kun Xu

Generative Retrieval via Diffusion Transformer with Metric-Ordered Sequence Training and Hybrid-Policy Preference Optimization

Add code
Jun 25, 2026
Viaarxiv icon

Bridging the Morphology Gap: Adapting VLA Models to Dexterous Manipulation via Intent-Conditioned Fine-Tuning

Add code
Jun 10, 2026
Viaarxiv icon

MLT-Dedup: Efficient Large-Scale Online Video Deduplication via Multi-Level Representations and Spatial-Temporal Matching

Add code
Jun 10, 2026
Viaarxiv icon

MatchLM2Lite: A Scalable MLLM-to-Lite Framework for Reproduced Content Identification

Add code
Jun 10, 2026
Viaarxiv icon

ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations

Add code
Jun 09, 2026
Viaarxiv icon

BitLM: Unlocking Multi-Token Language Generation with Bitwise Continuous Diffusion

Add code
May 12, 2026
Viaarxiv icon

ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning

Add code
Mar 31, 2026
Viaarxiv icon

CAMEL: Confidence-Gated Reflection for Reward Modeling

Add code
Feb 24, 2026
Viaarxiv icon

UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model

Add code
Feb 15, 2026
Viaarxiv icon

WideSeek: Advancing Wide Research via Multi-Agent Scaling

Add code
Feb 02, 2026
Viaarxiv icon