Picture for Sirui Han

Sirui Han

LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning

Add code
Jun 09, 2025
Viaarxiv icon

SafeLawBench: Towards Safe Alignment of Large Language Models

Add code
Jun 07, 2025
Viaarxiv icon

Follow-Your-Motion: Video Motion Transfer via Efficient Spatial-Temporal Decoupled Finetuning

Add code
Jun 05, 2025
Viaarxiv icon

FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation

Add code
May 30, 2025
Viaarxiv icon

InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback

Add code
May 29, 2025
Viaarxiv icon

The Mirage of Multimodality: Where Truth is Tested and Honesty Unravels

Add code
May 26, 2025
Viaarxiv icon

Generative RLHF-V: Learning Principles from Multi-modal Human Preference

Add code
May 24, 2025
Viaarxiv icon

Mitigating Deceptive Alignment via Self-Monitoring

Add code
May 24, 2025
Viaarxiv icon

Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon

J1: Exploring Simple Test-Time Scaling for LLM-as-a-Judge

Add code
May 17, 2025
Viaarxiv icon