Picture for Zhuo Liu

Zhuo Liu

RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models

Add code
Dec 08, 2025
Viaarxiv icon

AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Figure 2 for AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Figure 3 for AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Figure 4 for AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Viaarxiv icon

Assistant-Guided Mitigation of Teacher Preference Bias in LLM-as-a-Judge

Add code
May 25, 2025
Viaarxiv icon

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Add code
Apr 09, 2025
Viaarxiv icon

Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment

Add code
Feb 20, 2025
Figure 1 for Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
Figure 2 for Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
Figure 3 for Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
Figure 4 for Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
Viaarxiv icon

Mitigating Hallucinations in Multimodal Spatial Relations through Constraint-Aware Prompting

Add code
Feb 12, 2025
Viaarxiv icon

Same Company, Same Signal: The Role of Identity in Earnings Call Transcripts

Add code
Dec 23, 2024
Figure 1 for Same Company, Same Signal: The Role of Identity in Earnings Call Transcripts
Figure 2 for Same Company, Same Signal: The Role of Identity in Earnings Call Transcripts
Figure 3 for Same Company, Same Signal: The Role of Identity in Earnings Call Transcripts
Figure 4 for Same Company, Same Signal: The Role of Identity in Earnings Call Transcripts
Viaarxiv icon

What Are Step-Level Reward Models Rewarding? Counterintuitive Findings from MCTS-Boosted Mathematical Reasoning

Add code
Dec 20, 2024
Viaarxiv icon

On the Role of Model Prior in Real-World Inductive Reasoning

Add code
Dec 18, 2024
Figure 1 for On the Role of Model Prior in Real-World Inductive Reasoning
Figure 2 for On the Role of Model Prior in Real-World Inductive Reasoning
Figure 3 for On the Role of Model Prior in Real-World Inductive Reasoning
Figure 4 for On the Role of Model Prior in Real-World Inductive Reasoning
Viaarxiv icon

CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models

Add code
Sep 04, 2024
Figure 1 for CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
Figure 2 for CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
Figure 3 for CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
Figure 4 for CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
Viaarxiv icon