Picture for Qi Zhang

Qi Zhang

School of Information, North China University of Technology

Improving RL Exploration for LLM Reasoning through Retrospective Replay

Add code
Apr 19, 2025
Viaarxiv icon

MAIN: Mutual Alignment Is Necessary for instruction tuning

Add code
Apr 17, 2025
Viaarxiv icon

Seedream 3.0 Technical Report

Add code
Apr 16, 2025
Viaarxiv icon

LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models

Add code
Apr 14, 2025
Viaarxiv icon

DynClean: Training Dynamics-based Label Cleaning for Distantly-Supervised Named Entity Recognition

Add code
Apr 06, 2025
Viaarxiv icon

Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations

Add code
Mar 19, 2025
Viaarxiv icon

Deep Reinforcement Learning-based Video-Haptic Radio Resource Slicing in Tactile Internet

Add code
Mar 18, 2025
Viaarxiv icon

Highly Efficient Direct Analytics on Semantic-aware Time Series Data Compression

Add code
Mar 17, 2025
Viaarxiv icon

U2AD: Uncertainty-based Unsupervised Anomaly Detection Framework for Detecting T2 Hyperintensity in MRI Spinal Cord

Add code
Mar 17, 2025
Viaarxiv icon

Mitigating Ambiguities in 3D Classification with Gaussian Splatting

Add code
Mar 11, 2025
Viaarxiv icon