Picture for Bowei He

Bowei He

MINER: Mining Multimodal Internal Representation for Efficient Retrieval

Add code
May 07, 2026
Viaarxiv icon

Dual-Pool Token-Budget Routing for Cost-Efficient and Reliable LLM Serving

Add code
Apr 09, 2026
Viaarxiv icon

Learning Quantised Structure-Preserving Motion Representations for Dance Fingerprinting

Add code
Apr 01, 2026
Viaarxiv icon

From Inference Routing to Agent Orchestration: Declarative Policy Compilation with Cross-Layer Verification

Add code
Mar 28, 2026
Viaarxiv icon

Knowledge Access Beats Model Size: Memory Augmented Routing for Persistent AI Agents

Add code
Mar 24, 2026
Viaarxiv icon

The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project

Add code
Mar 22, 2026
Viaarxiv icon

From Token to Item: Enhancing Large Language Models for Recommendation via Item-aware Attention Mechanism

Add code
Mar 20, 2026
Viaarxiv icon

Conflict-Free Policy Languages for Probabilistic ML Predicates: A Framework and Case Study with the Semantic Router DSL

Add code
Mar 18, 2026
Viaarxiv icon

Visual Confused Deputy: Exploiting and Defending Perception Failures in Computer-Using Agents

Add code
Mar 16, 2026
Viaarxiv icon

Adaptive Vision-Language Model Routing for Computer Use Agents

Add code
Mar 13, 2026
Viaarxiv icon