Picture for Youliang Yuan

Youliang Yuan

Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards

Add code
Oct 09, 2025
Viaarxiv icon

Towards Evaluating Proactive Risk Awareness of Multimodal Language Models

Add code
May 23, 2025
Viaarxiv icon

VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models

Add code
Mar 10, 2025
Figure 1 for VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models
Figure 2 for VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models
Figure 3 for VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models
Figure 4 for VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models
Viaarxiv icon

VisFactor: Benchmarking Fundamental Visual Cognition in Multimodal Large Language Models

Add code
Feb 23, 2025
Figure 1 for VisFactor: Benchmarking Fundamental Visual Cognition in Multimodal Large Language Models
Figure 2 for VisFactor: Benchmarking Fundamental Visual Cognition in Multimodal Large Language Models
Figure 3 for VisFactor: Benchmarking Fundamental Visual Cognition in Multimodal Large Language Models
Figure 4 for VisFactor: Benchmarking Fundamental Visual Cognition in Multimodal Large Language Models
Viaarxiv icon

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability

Add code
Dec 24, 2024
Figure 1 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 2 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 3 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 4 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Viaarxiv icon

Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs

Add code
Oct 15, 2024
Figure 1 for Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
Figure 2 for Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
Figure 3 for Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
Figure 4 for Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
Viaarxiv icon

Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs

Add code
Oct 10, 2024
Figure 1 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Figure 2 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Figure 3 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Figure 4 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Viaarxiv icon

Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step

Add code
Oct 04, 2024
Viaarxiv icon

Learning to Ask: When LLMs Meet Unclear Instruction

Add code
Aug 31, 2024
Viaarxiv icon

On the Resilience of Multi-Agent Systems with Malicious Agents

Add code
Aug 02, 2024
Viaarxiv icon