Picture for Dan Zhang

Dan Zhang

A Novel Reconfigurable Dexterous Hand Based on Triple-Symmetric Bricard Parallel Mechanism

Add code
Mar 01, 2026
Viaarxiv icon

R-Diverse: Mitigating Diversity Illusion in Self-Play LLM Training

Add code
Feb 16, 2026
Viaarxiv icon

Eureka-Audio: Triggering Audio Intelligence in Compact Language Models

Add code
Feb 15, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

Self-Reconfiguration Planning for Deformable Quadrilateral Modular Robots

Add code
Jan 27, 2026
Viaarxiv icon

Rhombot: Rhombus-shaped Modular Robots for Stable, Medium-Independent Reconfiguration Motion

Add code
Jan 27, 2026
Viaarxiv icon

CORD: Bridging the Audio-Text Reasoning Gap via Weighted On-policy Cross-modal Distillation

Add code
Jan 23, 2026
Viaarxiv icon

MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free

Add code
Jan 08, 2026
Viaarxiv icon

TDRM: Smooth Reward Models with Temporal Difference for LLM RL and Inference

Add code
Sep 18, 2025
Figure 1 for TDRM: Smooth Reward Models with Temporal Difference for LLM RL and Inference
Figure 2 for TDRM: Smooth Reward Models with Temporal Difference for LLM RL and Inference
Figure 3 for TDRM: Smooth Reward Models with Temporal Difference for LLM RL and Inference
Figure 4 for TDRM: Smooth Reward Models with Temporal Difference for LLM RL and Inference
Viaarxiv icon

ReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding

Add code
Aug 27, 2025
Viaarxiv icon