Picture for Daxin Jiang

Daxin Jiang

WithAnyone: Towards Controllable and ID Consistent Image Generation

Add code
Oct 16, 2025
Viaarxiv icon

Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models

Add code
Oct 10, 2025
Viaarxiv icon

GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging

Add code
Aug 26, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Viaarxiv icon

Can Mixture-of-Experts Surpass Dense LLMs Under Strictly Equal Resources?

Add code
Jun 13, 2025
Viaarxiv icon

Farseer: A Refined Scaling Law in Large Language Models

Add code
Jun 12, 2025
Figure 1 for Farseer: A Refined Scaling Law in Large Language Models
Figure 2 for Farseer: A Refined Scaling Law in Large Language Models
Figure 3 for Farseer: A Refined Scaling Law in Large Language Models
Figure 4 for Farseer: A Refined Scaling Law in Large Language Models
Viaarxiv icon

DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning

Add code
Jun 11, 2025
Viaarxiv icon

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

Add code
Jun 10, 2025
Figure 1 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 2 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 3 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 4 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Viaarxiv icon

Beyond the First Error: Process Reward Models for Reflective Mathematical Reasoning

Add code
May 20, 2025
Viaarxiv icon

Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning

Add code
May 20, 2025
Viaarxiv icon