Picture for Wenbin Li

Wenbin Li

Department of Computing, Imperial College London, London UK, SW7 2AZ

UHR-BAT: Budget-Aware Token Compression Vision-Language model for Ultra-High-Resolution Remote Sensing

Add code
Apr 15, 2026
Viaarxiv icon

CLASP: Class-Adaptive Layer Fusion and Dual-Stage Pruning for Multimodal Large Language Models

Add code
Apr 14, 2026
Viaarxiv icon

VideoTIR: Accurate Understanding for Long Videos with Efficient Tool-Integrated Reasoning

Add code
Mar 26, 2026
Viaarxiv icon

TAU-R1: Visual Language Model for Traffic Anomaly Understanding

Add code
Mar 19, 2026
Viaarxiv icon

Prompt-Free Universal Region Proposal Network

Add code
Mar 18, 2026
Viaarxiv icon

Annotation-Free Visual Reasoning for High-Resolution Large Multimodal Models via Reinforcement Learning

Add code
Feb 27, 2026
Viaarxiv icon

Hepato-LLaVA: An Expert MLLM with Sparse Topo-Pack Attention for Hepatocellular Pathology Analysis on Whole Slide Images

Add code
Feb 26, 2026
Viaarxiv icon

AviationLMM: A Large Multimodal Foundation Model for Civil Aviation

Add code
Jan 16, 2026
Viaarxiv icon

FUSE-RSVLM: Feature Fusion Vision-Language Model for Remote Sensing

Add code
Dec 30, 2025
Viaarxiv icon

LibContinual: A Comprehensive Library towards Realistic Continual Learning

Add code
Dec 26, 2025
Viaarxiv icon