Picture for Yihua Cheng

Yihua Cheng

VL4Gaze: Unleashing Vision-Language Models for Gaze Following

Add code
Dec 23, 2025
Viaarxiv icon

EVICPRESS: Joint KV-Cache Compression and Eviction for Efficient LLM Serving

Add code
Dec 16, 2025
Figure 1 for EVICPRESS: Joint KV-Cache Compression and Eviction for Efficient LLM Serving
Figure 2 for EVICPRESS: Joint KV-Cache Compression and Eviction for Efficient LLM Serving
Figure 3 for EVICPRESS: Joint KV-Cache Compression and Eviction for Efficient LLM Serving
Figure 4 for EVICPRESS: Joint KV-Cache Compression and Eviction for Efficient LLM Serving
Viaarxiv icon

RTGaze: Real-Time 3D-Aware Gaze Redirection from a Single Image

Add code
Nov 14, 2025
Viaarxiv icon

Excavate the potential of Single-Scale Features: A Decomposition Network for Water-Related Optical Image Enhancement

Add code
Aug 06, 2025
Viaarxiv icon

Learning Velocity and Acceleration: Self-Supervised Motion Consistency for Pedestrian Trajectory Prediction

Add code
Mar 31, 2025
Viaarxiv icon

Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSM

Add code
Mar 13, 2025
Viaarxiv icon

PersonaBooth: Personalized Text-to-Motion Generation

Add code
Mar 10, 2025
Viaarxiv icon

3D Prior is All You Need: Cross-Task Few-shot 2D Gaze Estimation

Add code
Feb 06, 2025
Viaarxiv icon

Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models

Add code
Sep 30, 2024
Figure 1 for Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models
Figure 2 for Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models
Figure 3 for Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models
Figure 4 for Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models
Viaarxiv icon

NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model

Add code
Jul 17, 2024
Viaarxiv icon