Picture for Xuanyu Zhang

Xuanyu Zhang

VQ-Jarvis: Retrieval-Augmented Video Restoration Agent with Sharp Vision and Fast Thought

Add code
Mar 24, 2026
Viaarxiv icon

OARS: Process-Aware Online Alignment for Generative Real-World Image Super-Resolution

Add code
Mar 13, 2026
Viaarxiv icon

Improved Adversarial Diffusion Compression for Real-World Video Super-Resolution

Add code
Feb 28, 2026
Viaarxiv icon

TourPlanner: A Competitive Consensus Framework with Constraint-Gated Reinforcement Learning for Travel Planning

Add code
Jan 08, 2026
Viaarxiv icon

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Add code
Nov 16, 2025
Viaarxiv icon

Voxtral

Add code
Jul 17, 2025
Viaarxiv icon

Magistral

Add code
Jun 12, 2025
Figure 1 for Magistral
Figure 2 for Magistral
Figure 3 for Magistral
Figure 4 for Magistral
Viaarxiv icon

AvatarShield: Visual Reinforcement Learning for Human-Centric Video Forgery Detection

Add code
May 21, 2025
Viaarxiv icon

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Add code
May 08, 2025
Figure 1 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Figure 2 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Figure 3 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Figure 4 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Viaarxiv icon

Q-Insight: Understanding Image Quality via Visual Reinforcement Learning

Add code
Mar 28, 2025
Figure 1 for Q-Insight: Understanding Image Quality via Visual Reinforcement Learning
Figure 2 for Q-Insight: Understanding Image Quality via Visual Reinforcement Learning
Figure 3 for Q-Insight: Understanding Image Quality via Visual Reinforcement Learning
Figure 4 for Q-Insight: Understanding Image Quality via Visual Reinforcement Learning
Viaarxiv icon