Picture for Yuzhuang Xu

Yuzhuang Xu

Think Before You Accept: Semantic Reflective Verification for Faster Speculative Decoding

Add code
May 24, 2025
Viaarxiv icon

Lookahead Q-Cache: Achieving More Consistent KV Cache Eviction via Pseudo Query

Add code
May 24, 2025
Viaarxiv icon

Perspective Transition of Large Language Models for Solving Subjective Tasks

Add code
Jan 16, 2025
Viaarxiv icon

CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs

Add code
Dec 12, 2024
Viaarxiv icon

ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models

Add code
Oct 07, 2024
Viaarxiv icon

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models

Add code
Jun 13, 2024
Figure 1 for Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
Figure 2 for Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
Figure 3 for Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
Figure 4 for Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
Viaarxiv icon

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset

Add code
Feb 18, 2024
Viaarxiv icon

OneBit: Towards Extremely Low-bit Large Language Models

Add code
Feb 17, 2024
Viaarxiv icon

Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf

Add code
Sep 09, 2023
Viaarxiv icon

Pluggable Neural Machine Translation Models via Memory-augmented Adapters

Add code
Jul 12, 2023
Viaarxiv icon