Picture for Bo Li

Bo Li

Beijing Key Laboratory of Digital Media, School of Computer Science and Engineering, Beihang University, Beijing, China

BrowseComp-$V^3$: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents

Add code
Feb 13, 2026
Viaarxiv icon

Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents

Add code
Feb 13, 2026
Viaarxiv icon

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Add code
Feb 09, 2026
Viaarxiv icon

FlowConsist: Make Your Flow Consistent with Real Trajectory

Add code
Feb 06, 2026
Viaarxiv icon

ROMAN: Reward-Orchestrated Multi-Head Attention Network for Autonomous Driving System Testing

Add code
Feb 05, 2026
Viaarxiv icon

Copyright Detective: A Forensic System to Evidence LLMs Flickering Copyright Leakage Risks

Add code
Feb 05, 2026
Viaarxiv icon

Journey to the Centre of Cluster: Harnessing Interior Nodes for A/B Testing under Network Interference

Add code
Feb 04, 2026
Viaarxiv icon

Dissecting Outlier Dynamics in LLM NVFP4 Pretraining

Add code
Feb 02, 2026
Viaarxiv icon

InfoTok: Regulating Information Flow for Capacity-Constrained Shared Visual Tokenization in Unified MLLMs

Add code
Feb 02, 2026
Viaarxiv icon

Trust but Verify: Adaptive Conditioning for Reference-Based Diffusion Super-Resolution via Implicit Reference Correlation Modeling

Add code
Feb 02, 2026
Viaarxiv icon