Picture for Xinghua Zhang

Xinghua Zhang

LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment

Add code
Jun 13, 2025
Viaarxiv icon

EIFBENCH: Extremely Complex Instruction Following Benchmark for Large Language Models

Add code
Jun 10, 2025
Viaarxiv icon

Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns

Add code
May 29, 2025
Viaarxiv icon

Graph Wave Networks

Add code
May 26, 2025
Viaarxiv icon

Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents

Add code
May 04, 2025
Viaarxiv icon

S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Add code
Apr 14, 2025
Viaarxiv icon

Exploring Preference-Guided Diffusion Model for Cross-Domain Recommendation

Add code
Jan 20, 2025
Viaarxiv icon

DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling

Add code
Dec 06, 2024
Figure 1 for DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
Figure 2 for DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
Figure 3 for DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
Figure 4 for DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
Viaarxiv icon

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization

Add code
Nov 09, 2024
Figure 1 for IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization
Figure 2 for IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization
Figure 3 for IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization
Figure 4 for IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization
Viaarxiv icon

On the Role of Attention Heads in Large Language Model Safety

Add code
Oct 17, 2024
Figure 1 for On the Role of Attention Heads in Large Language Model Safety
Figure 2 for On the Role of Attention Heads in Large Language Model Safety
Figure 3 for On the Role of Attention Heads in Large Language Model Safety
Figure 4 for On the Role of Attention Heads in Large Language Model Safety
Viaarxiv icon