Picture for Chen Huang

Chen Huang

E2Edev: Benchmarking Large Language Models in End-to-End Software Development Task

Add code
Oct 16, 2025
Viaarxiv icon

PEAR: Phase Entropy Aware Reward for Efficient Reasoning

Add code
Oct 09, 2025
Viaarxiv icon

CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking

Add code
Sep 04, 2025
Viaarxiv icon

Mirroring Users: Towards Building Preference-aligned User Simulator with User Feedback in Recommendation

Add code
Aug 25, 2025
Figure 1 for Mirroring Users: Towards Building Preference-aligned User Simulator with User Feedback in Recommendation
Figure 2 for Mirroring Users: Towards Building Preference-aligned User Simulator with User Feedback in Recommendation
Figure 3 for Mirroring Users: Towards Building Preference-aligned User Simulator with User Feedback in Recommendation
Figure 4 for Mirroring Users: Towards Building Preference-aligned User Simulator with User Feedback in Recommendation
Viaarxiv icon

Through the Valley: Path to Effective Long CoT Training for Small Language Models

Add code
Jun 09, 2025
Viaarxiv icon

Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting

Add code
May 30, 2025
Figure 1 for Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting
Figure 2 for Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting
Figure 3 for Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting
Figure 4 for Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting
Viaarxiv icon

ELABORATION: A Comprehensive Benchmark on Human-LLM Competitive Programming

Add code
May 22, 2025
Figure 1 for ELABORATION: A Comprehensive Benchmark on Human-LLM Competitive Programming
Figure 2 for ELABORATION: A Comprehensive Benchmark on Human-LLM Competitive Programming
Figure 3 for ELABORATION: A Comprehensive Benchmark on Human-LLM Competitive Programming
Figure 4 for ELABORATION: A Comprehensive Benchmark on Human-LLM Competitive Programming
Viaarxiv icon

Can Large Language Models Understand Internet Buzzwords Through User-Generated Content

Add code
May 21, 2025
Viaarxiv icon

Enhancing Code LLM Training with Programmer Attention

Add code
Mar 19, 2025
Figure 1 for Enhancing Code LLM Training with Programmer Attention
Figure 2 for Enhancing Code LLM Training with Programmer Attention
Figure 3 for Enhancing Code LLM Training with Programmer Attention
Figure 4 for Enhancing Code LLM Training with Programmer Attention
Viaarxiv icon

Breaking the Stigma! Unobtrusively Probe Symptoms in Depression Disorder Diagnosis Dialogue

Add code
Jan 25, 2025
Figure 1 for Breaking the Stigma! Unobtrusively Probe Symptoms in Depression Disorder Diagnosis Dialogue
Figure 2 for Breaking the Stigma! Unobtrusively Probe Symptoms in Depression Disorder Diagnosis Dialogue
Figure 3 for Breaking the Stigma! Unobtrusively Probe Symptoms in Depression Disorder Diagnosis Dialogue
Figure 4 for Breaking the Stigma! Unobtrusively Probe Symptoms in Depression Disorder Diagnosis Dialogue
Viaarxiv icon