Picture for Bo Li

Bo Li

Beijing Key Laboratory of Digital Media, School of Computer Science and Engineering, Beihang University, Beijing, China

Making MLLMs Blind: Adversarial Smuggling Attacks in MLLM Content Moderation

Add code
Apr 09, 2026
Viaarxiv icon

Data Selection for Multi-turn Dialogue Instruction Tuning

Add code
Apr 09, 2026
Viaarxiv icon

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Add code
Apr 06, 2026
Viaarxiv icon

ShieldNet: Network-Level Guardrails against Emerging Supply-Chain Injections in Agentic Systems

Add code
Apr 06, 2026
Viaarxiv icon

MMTalker: Multiresolution 3D Talking Head Synthesis with Multimodal Feature Fusion

Add code
Apr 03, 2026
Viaarxiv icon

Xuanwu: Evolving General Multimodal Models into an Industrial-Grade Foundation for Content Ecosystems

Add code
Mar 31, 2026
Viaarxiv icon

Decompose and Transfer: CoT-Prompting Enhanced Alignment for Open-Vocabulary Temporal Action Detection

Add code
Mar 25, 2026
Viaarxiv icon

ReDiffuse: Rotation Equivariant Diffusion Model for Multi-focus Image Fusion

Add code
Mar 22, 2026
Viaarxiv icon

Demystifing Video Reasoning

Add code
Mar 17, 2026
Viaarxiv icon

Good Arguments Against the People Pleasers: How Reasoning Mitigates (Yet Masks) LLM Sycophancy

Add code
Mar 17, 2026
Viaarxiv icon