Picture for Yuchuan Wu

Yuchuan Wu

TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence

Add code
May 30, 2025
Viaarxiv icon

ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents

Add code
May 29, 2025
Viaarxiv icon

Reverse Preference Optimization for Complex Instruction Following

Add code
May 28, 2025
Viaarxiv icon

OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction

Add code
May 26, 2025
Viaarxiv icon

EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning

Add code
Feb 18, 2025
Viaarxiv icon

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

Add code
Jan 08, 2025
Viaarxiv icon

SDPO: Segment-Level Direct Preference Optimization for Social Agents

Add code
Jan 03, 2025
Figure 1 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Figure 2 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Figure 3 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Figure 4 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Viaarxiv icon

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Add code
Sep 09, 2024
Figure 1 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 2 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 3 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 4 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Viaarxiv icon

FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents

Add code
Jun 21, 2024
Figure 1 for FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Figure 2 for FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Figure 3 for FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Figure 4 for FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Viaarxiv icon

A Survey on Self-Evolution of Large Language Models

Add code
Apr 22, 2024
Figure 1 for A Survey on Self-Evolution of Large Language Models
Figure 2 for A Survey on Self-Evolution of Large Language Models
Figure 3 for A Survey on Self-Evolution of Large Language Models
Figure 4 for A Survey on Self-Evolution of Large Language Models
Viaarxiv icon