Picture for Yuheng Wu

Yuheng Wu

Last But Not Least: Boundary Attention CalibratiON for Multimodal KV Cache Compression

Add code
Jun 16, 2026
Viaarxiv icon

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

Add code
Jun 09, 2026
Viaarxiv icon

LACO: Adaptive Latent Communication for Collaborative Driving

Add code
May 21, 2026
Viaarxiv icon

Delta Forcing: Trust Region Steering for Interactive Autoregressive Video Generation

Add code
May 14, 2026
Viaarxiv icon

AgenticCache: Cache-Driven Asynchronous Planning for Embodied AI Agents

Add code
Apr 27, 2026
Viaarxiv icon

PISCO: Precise Video Instance Insertion with Sparse Control

Add code
Feb 09, 2026
Viaarxiv icon

ARGaze: Autoregressive Transformers for Online Egocentric Gaze Estimation

Add code
Feb 04, 2026
Viaarxiv icon

LLM-FSM: Scaling Large Language Models for Finite-State Reasoning in RTL Code Generation

Add code
Feb 03, 2026
Viaarxiv icon

AI-Driven Prediction of Cancer Pain Episodes: A Hybrid Decision Support Approach

Add code
Dec 18, 2025
Viaarxiv icon

P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats

Add code
Nov 16, 2025
Figure 1 for P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
Figure 2 for P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
Figure 3 for P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
Figure 4 for P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
Viaarxiv icon