Picture for Ziyang Luo

Ziyang Luo

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Add code
Jan 07, 2026
Viaarxiv icon

Towards Comprehensive Stage-wise Benchmarking of Large Language Models in Fact-Checking

Add code
Jan 06, 2026
Viaarxiv icon

MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique

Add code
Nov 12, 2025
Figure 1 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Figure 2 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Figure 3 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Figure 4 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Viaarxiv icon

MemeArena: Automating Context-Aware Unbiased Evaluation of Harmfulness Understanding for Multimodal Large Language Models

Add code
Oct 31, 2025
Viaarxiv icon

EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty

Add code
Oct 01, 2025
Figure 1 for EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty
Figure 2 for EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty
Figure 3 for EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty
Figure 4 for EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty
Viaarxiv icon

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Add code
Aug 20, 2025
Figure 1 for MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers
Figure 2 for MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers
Figure 3 for MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers
Figure 4 for MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers
Viaarxiv icon

RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns

Add code
Aug 18, 2025
Figure 1 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 2 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 3 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 4 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Viaarxiv icon

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Add code
Jul 02, 2025
Viaarxiv icon

TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models

Add code
Jun 13, 2025
Viaarxiv icon

Long-Distance Field Demonstration of Imaging-Free Drone Identification in Intracity Environments

Add code
Apr 26, 2025
Figure 1 for Long-Distance Field Demonstration of Imaging-Free Drone Identification in Intracity Environments
Figure 2 for Long-Distance Field Demonstration of Imaging-Free Drone Identification in Intracity Environments
Figure 3 for Long-Distance Field Demonstration of Imaging-Free Drone Identification in Intracity Environments
Figure 4 for Long-Distance Field Demonstration of Imaging-Free Drone Identification in Intracity Environments
Viaarxiv icon