Picture for Haoli Bai

Haoli Bai

Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance

Add code
Aug 10, 2025
Viaarxiv icon

The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs

Add code
Jul 10, 2025
Viaarxiv icon

A Simple Linear Patch Revives Layer-Pruned Large Language Models

Add code
May 30, 2025
Viaarxiv icon

Faster and Better LLMs via Latency-Aware Test-Time Scaling

Add code
May 26, 2025
Viaarxiv icon

FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension

Add code
May 01, 2025
Viaarxiv icon

Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models

Add code
Apr 07, 2025
Viaarxiv icon

WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models

Add code
Mar 03, 2025
Figure 1 for WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models
Figure 2 for WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models
Figure 3 for WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models
Figure 4 for WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models
Viaarxiv icon

TreeKV: Smooth Key-Value Cache Compression with Tree Structures

Add code
Jan 09, 2025
Figure 1 for TreeKV: Smooth Key-Value Cache Compression with Tree Structures
Figure 2 for TreeKV: Smooth Key-Value Cache Compression with Tree Structures
Figure 3 for TreeKV: Smooth Key-Value Cache Compression with Tree Structures
Figure 4 for TreeKV: Smooth Key-Value Cache Compression with Tree Structures
Viaarxiv icon

FlatQuant: Flatness Matters for LLM Quantization

Add code
Oct 12, 2024
Figure 1 for FlatQuant: Flatness Matters for LLM Quantization
Figure 2 for FlatQuant: Flatness Matters for LLM Quantization
Figure 3 for FlatQuant: Flatness Matters for LLM Quantization
Figure 4 for FlatQuant: Flatness Matters for LLM Quantization
Viaarxiv icon

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Add code
Sep 26, 2024
Figure 1 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 2 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 3 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 4 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Viaarxiv icon