Picture for Xuanyu Zheng

Xuanyu Zheng

VCap: Hypergeometric Rewards for Weak-to-Strong Visual Captioning

Add code
May 27, 2026
Viaarxiv icon

From Pixels to Words -- Towards Native One-Vision Models at Scale

Add code
May 27, 2026
Viaarxiv icon

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

Add code
Mar 24, 2026
Viaarxiv icon

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Add code
Aug 29, 2025
Figure 1 for ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding
Figure 2 for ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding
Figure 3 for ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding
Figure 4 for ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding
Viaarxiv icon

Recent Advances in Data-driven Intelligent Control for Wireless Communication: A Comprehensive Survey

Add code
Aug 06, 2024
Viaarxiv icon