Picture for Yun Chen

Yun Chen

Towards Bridging the Reward-Generation Gap in Direct Alignment Algorithms

Add code
Jun 11, 2025
Viaarxiv icon

Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers

Add code
Jun 05, 2025
Viaarxiv icon

AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion

Add code
May 28, 2025
Viaarxiv icon

TAG-INSTRUCT: Controlled Instruction Complexity Enhancement through Structure-based Augmentation

Add code
May 24, 2025
Viaarxiv icon

ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs

Add code
Apr 17, 2025
Viaarxiv icon

Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning

Add code
Mar 21, 2025
Viaarxiv icon

A Hybrid Model/Data-Driven Solution to Channel, Position and Orientation Tracking in mmWave Vehicular Systems

Add code
Mar 07, 2025
Viaarxiv icon

LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy

Add code
Feb 17, 2025
Viaarxiv icon

Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic Perspective

Add code
Dec 02, 2024
Viaarxiv icon

Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions

Add code
Nov 15, 2024
Figure 1 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 2 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 3 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 4 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Viaarxiv icon