Picture for Xiangyang Zhu

Xiangyang Zhu

A^3: Towards Advertising Aesthetic Assessment

Add code
Mar 25, 2026
Viaarxiv icon

UniDial-EvalKit: A Unified Toolkit for Evaluating Multi-Faceted Conversational Abilities

Add code
Mar 24, 2026
Viaarxiv icon

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

Add code
Mar 02, 2026
Viaarxiv icon

Surveillance Facial Image Quality Assessment: A Multi-dimensional Dataset and Lightweight Model

Add code
Feb 07, 2026
Viaarxiv icon

Automated Safety Benchmarking: A Multi-agent Pipeline for LVLMs

Add code
Jan 27, 2026
Viaarxiv icon

QualiRAG: Retrieval-Augmented Generation for Visual Quality Understanding

Add code
Jan 26, 2026
Viaarxiv icon

Embodied Image Compression

Add code
Dec 12, 2025
Figure 1 for Embodied Image Compression
Figure 2 for Embodied Image Compression
Figure 3 for Embodied Image Compression
Figure 4 for Embodied Image Compression
Viaarxiv icon

One Battle After Another: Probing LLMs' Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework

Add code
Nov 05, 2025
Viaarxiv icon

VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results

Add code
Sep 11, 2025
Figure 1 for VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results
Figure 2 for VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results
Figure 3 for VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results
Figure 4 for VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results
Viaarxiv icon

SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models

Add code
Aug 08, 2025
Figure 1 for SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models
Figure 2 for SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models
Figure 3 for SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models
Figure 4 for SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models
Viaarxiv icon