Picture for Hao Gu

Hao Gu

QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training under Training--Inference Mismatch

Add code
Apr 09, 2026
Viaarxiv icon

Bit-by-Bit: Progressive QAT Strategy with Outlier Channel Splitting for Stable Low-Bit LLMs

Add code
Apr 09, 2026
Viaarxiv icon

Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon

Spectral Imbalance Causes Forgetting in Low-Rank Continual Adaptation

Add code
Jan 31, 2026
Viaarxiv icon

OV-InstructTTS: Towards Open-Vocabulary Instruct Text-to-Speech

Add code
Jan 04, 2026
Viaarxiv icon

R4ec: A Reasoning, Reflection, and Refinement Framework for Recommendation Systems

Add code
Jul 23, 2025
Viaarxiv icon

Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model

Add code
May 26, 2025
Viaarxiv icon

Hearing from Silence: Reasoning Audio Descriptions from Silent Videos via Vision-Language Model

Add code
May 19, 2025
Viaarxiv icon

$\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection

Add code
May 16, 2025
Figure 1 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Figure 2 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Figure 3 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Figure 4 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Viaarxiv icon

Relative Contrastive Learning for Sequential Recommendation with Similarity-based Positive Pair Selection

Add code
Apr 27, 2025
Viaarxiv icon