Picture for Yun He

Yun He

LangDriveCTRL: Natural Language Controllable Driving Scene Editing with Multi-modal Agents

Add code
Dec 19, 2025
Viaarxiv icon

Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following

Add code
Nov 13, 2025
Viaarxiv icon

Boosting LLM Reasoning via Spontaneous Self-Correction

Add code
Jun 07, 2025
Figure 1 for Boosting LLM Reasoning via Spontaneous Self-Correction
Figure 2 for Boosting LLM Reasoning via Spontaneous Self-Correction
Figure 3 for Boosting LLM Reasoning via Spontaneous Self-Correction
Figure 4 for Boosting LLM Reasoning via Spontaneous Self-Correction
Viaarxiv icon

Towards An Efficient LLM Training Paradigm for CTR Prediction

Add code
Mar 02, 2025
Viaarxiv icon

Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization

Add code
Jan 31, 2025
Viaarxiv icon

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Add code
Jan 18, 2025
Viaarxiv icon

Unifying Generative and Dense Retrieval for Sequential Recommendation

Add code
Nov 27, 2024
Figure 1 for Unifying Generative and Dense Retrieval for Sequential Recommendation
Figure 2 for Unifying Generative and Dense Retrieval for Sequential Recommendation
Figure 3 for Unifying Generative and Dense Retrieval for Sequential Recommendation
Figure 4 for Unifying Generative and Dense Retrieval for Sequential Recommendation
Viaarxiv icon

Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Add code
Oct 21, 2024
Viaarxiv icon

The Perfect Blend: Redefining RLHF with Mixture of Judges

Add code
Sep 30, 2024
Figure 1 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 2 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 3 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 4 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Viaarxiv icon

PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts

Add code
Jun 07, 2023
Figure 1 for PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts
Figure 2 for PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts
Figure 3 for PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts
Figure 4 for PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts
Viaarxiv icon