Picture for Bo Li

Bo Li

Beijing Key Laboratory of Digital Media, School of Computer Science and Engineering, Beihang University, Beijing, China

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Add code
Mar 03, 2026
Viaarxiv icon

Cross-Family Speculative Prefill: Training-Free Long-Context Compression with Small Draft Models

Add code
Mar 03, 2026
Viaarxiv icon

Micro-expression Recognition Based on Dual-branch Feature Extraction and Fusion

Add code
Feb 27, 2026
Viaarxiv icon

Tokenization, Fusion and Decoupling: Bridging the Granularity Mismatch Between Large Language Models and Knowledge Graphs

Add code
Feb 26, 2026
Viaarxiv icon

AnimeAgent: Is the Multi-Agent via Image-to-Video models a Good Disney Storytelling Artist?

Add code
Feb 24, 2026
Viaarxiv icon

A Very Big Video Reasoning Suite

Add code
Feb 24, 2026
Viaarxiv icon

FAMOSE: A ReAct Approach to Automated Feature Discovery

Add code
Feb 19, 2026
Viaarxiv icon

The Limits of Long-Context Reasoning in Automated Bug Fixing

Add code
Feb 17, 2026
Viaarxiv icon

Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents

Add code
Feb 13, 2026
Viaarxiv icon

BrowseComp-$V^3$: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents

Add code
Feb 13, 2026
Viaarxiv icon