Picture for Xin Lu

Xin Lu

Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings

Add code
Feb 14, 2026
Viaarxiv icon

Bidirectional Reward-Guided Diffusion for Real-World Image Super-Resolution

Add code
Feb 05, 2026
Viaarxiv icon

A Universal Load Balancing Principle and Its Application to Large Language Model Serving

Add code
Jan 25, 2026
Viaarxiv icon

STAR-S: Improving Safety Alignment through Self-Taught Reasoning on Safety Rules

Add code
Jan 07, 2026
Viaarxiv icon

ThinkRL-Edit: Thinking in Reinforcement Learning for Reasoning-Centric Image Editing

Add code
Jan 06, 2026
Viaarxiv icon

InstructMoLE: Instruction-Guided Mixture of Low-rank Experts for Multi-Conditional Image Generation

Add code
Dec 25, 2025
Viaarxiv icon

StoryMem: Multi-shot Long Video Storytelling with Memory

Add code
Dec 22, 2025
Figure 1 for StoryMem: Multi-shot Long Video Storytelling with Memory
Figure 2 for StoryMem: Multi-shot Long Video Storytelling with Memory
Figure 3 for StoryMem: Multi-shot Long Video Storytelling with Memory
Figure 4 for StoryMem: Multi-shot Long Video Storytelling with Memory
Viaarxiv icon

CompEvent: Complex-valued Event-RGB Fusion for Low-light Video Enhancement and Deblurring

Add code
Nov 18, 2025
Viaarxiv icon

MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation

Add code
Nov 14, 2025
Figure 1 for MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
Figure 2 for MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
Figure 3 for MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
Figure 4 for MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
Viaarxiv icon

DiSE: A diffusion probabilistic model for automatic structure elucidation of organic compounds

Add code
Oct 30, 2025
Viaarxiv icon