Picture for Huamin Chen

Huamin Chen

Knowledge Access Beats Model Size: Memory Augmented Routing for Persistent AI Agents

Add code
Mar 24, 2026
Viaarxiv icon

The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project

Add code
Mar 22, 2026
Viaarxiv icon

Conflict-Free Policy Languages for Probabilistic ML Predicates: A Framework and Case Study with the Semantic Router DSL

Add code
Mar 18, 2026
Viaarxiv icon

Visual Confused Deputy: Exploiting and Defending Perception Failures in Computer-Using Agents

Add code
Mar 16, 2026
Viaarxiv icon

98$\times$ Faster LLM Routing Without a Dedicated GPU: Flash Attention, Prompt Compression, and Near-Streaming for the vLLM Semantic Router

Add code
Mar 13, 2026
Viaarxiv icon

Adaptive Vision-Language Model Routing for Computer Use Agents

Add code
Mar 13, 2026
Viaarxiv icon

Outcome-Aware Tool Selection for Semantic Routers: Latency-Constrained Learning Without LLM Inference

Add code
Mar 13, 2026
Viaarxiv icon

Building Trust: Foundations of Security, Safety and Transparency in AI

Add code
Nov 19, 2024
Viaarxiv icon

Leveraging Large Language Models for Patient Engagement: The Power of Conversational AI in Digital Health

Add code
Jun 19, 2024
Figure 1 for Leveraging Large Language Models for Patient Engagement: The Power of Conversational AI in Digital Health
Figure 2 for Leveraging Large Language Models for Patient Engagement: The Power of Conversational AI in Digital Health
Figure 3 for Leveraging Large Language Models for Patient Engagement: The Power of Conversational AI in Digital Health
Figure 4 for Leveraging Large Language Models for Patient Engagement: The Power of Conversational AI in Digital Health
Viaarxiv icon