Picture for Xinghua Zhang

Xinghua Zhang

S2CDR: Smoothing-Sharpening Process Model for Cross-Domain Recommendation

Add code
Mar 03, 2026
Viaarxiv icon

MemPO: Self-Memory Policy Optimization for Long-Horizon Agents

Add code
Feb 28, 2026
Viaarxiv icon

ExpSeek: Self-Triggered Experience Seeking for Web Agents

Add code
Jan 13, 2026
Viaarxiv icon

Towards Comprehensible Recommendation with Large Language Model Fine-tuning

Add code
Aug 11, 2025
Viaarxiv icon

LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment

Add code
Jun 13, 2025
Viaarxiv icon

EIFBENCH: Extremely Complex Instruction Following Benchmark for Large Language Models

Add code
Jun 10, 2025
Viaarxiv icon

Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns

Add code
May 29, 2025
Viaarxiv icon

Graph Wave Networks

Add code
May 26, 2025
Figure 1 for Graph Wave Networks
Figure 2 for Graph Wave Networks
Figure 3 for Graph Wave Networks
Figure 4 for Graph Wave Networks
Viaarxiv icon

Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents

Add code
May 04, 2025
Figure 1 for Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents
Figure 2 for Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents
Figure 3 for Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents
Figure 4 for Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents
Viaarxiv icon

S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Add code
Apr 14, 2025
Figure 1 for S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models
Figure 2 for S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models
Figure 3 for S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models
Figure 4 for S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models
Viaarxiv icon