Picture for Yun Chen

Yun Chen

Heterogeneous Multi-Expert Reinforcement Learning for Long-Horizon Multi-Goal Tasks in Autonomous Forklifts

Add code
Jan 12, 2026
Viaarxiv icon

Infer As You Train: A Symmetric Paradigm of Masked Generative for Click-Through Rate Prediction

Add code
Nov 18, 2025
Viaarxiv icon

DGenCTR: Towards a Universal Generative Paradigm for Click-Through Rate Prediction via Discrete Diffusion

Add code
Aug 20, 2025
Figure 1 for DGenCTR: Towards a Universal Generative Paradigm for Click-Through Rate Prediction via Discrete Diffusion
Figure 2 for DGenCTR: Towards a Universal Generative Paradigm for Click-Through Rate Prediction via Discrete Diffusion
Figure 3 for DGenCTR: Towards a Universal Generative Paradigm for Click-Through Rate Prediction via Discrete Diffusion
Figure 4 for DGenCTR: Towards a Universal Generative Paradigm for Click-Through Rate Prediction via Discrete Diffusion
Viaarxiv icon

Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks

Add code
Aug 06, 2025
Figure 1 for Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks
Figure 2 for Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks
Figure 3 for Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks
Figure 4 for Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks
Viaarxiv icon

Towards Bridging the Reward-Generation Gap in Direct Alignment Algorithms

Add code
Jun 11, 2025
Viaarxiv icon

Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers

Add code
Jun 05, 2025
Viaarxiv icon

AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion

Add code
May 28, 2025
Figure 1 for AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion
Figure 2 for AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion
Figure 3 for AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion
Figure 4 for AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion
Viaarxiv icon

TAG-INSTRUCT: Controlled Instruction Complexity Enhancement through Structure-based Augmentation

Add code
May 24, 2025
Viaarxiv icon

ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs

Add code
Apr 17, 2025
Viaarxiv icon

Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning

Add code
Mar 21, 2025
Figure 1 for Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning
Figure 2 for Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning
Figure 3 for Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning
Figure 4 for Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning
Viaarxiv icon