Picture for Xiyang Wu

Xiyang Wu

Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks

Add code
Apr 22, 2026
Viaarxiv icon

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

Add code
Apr 07, 2026
Viaarxiv icon

SABER: A Stealthy Agentic Black-Box Attack Framework for Vision-Language-Action Models

Add code
Mar 26, 2026
Viaarxiv icon

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Add code
Mar 10, 2026
Viaarxiv icon

First Frame Is the Place to Go for Video Content Customization

Add code
Nov 19, 2025
Viaarxiv icon

Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation

Add code
Jun 18, 2025
Viaarxiv icon

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations for Synthetic Videos

Add code
May 02, 2025
Viaarxiv icon

Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey

Add code
Jan 04, 2025
Figure 1 for Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey
Figure 2 for Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey
Figure 3 for Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey
Figure 4 for Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey
Viaarxiv icon

SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining

Add code
Sep 26, 2024
Figure 1 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 2 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 3 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 4 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Viaarxiv icon

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

Add code
Jun 16, 2024
Figure 1 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Figure 2 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Figure 3 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Figure 4 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Viaarxiv icon