Picture for Xiaoyu Yue

Xiaoyu Yue

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Add code
Sep 18, 2025
Viaarxiv icon

Transition Models: Rethinking the Generative Learning Objective

Add code
Sep 04, 2025
Figure 1 for Transition Models: Rethinking the Generative Learning Objective
Figure 2 for Transition Models: Rethinking the Generative Learning Objective
Figure 3 for Transition Models: Rethinking the Generative Learning Objective
Figure 4 for Transition Models: Rethinking the Generative Learning Objective
Viaarxiv icon

RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts

Add code
Aug 17, 2025
Figure 1 for RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts
Figure 2 for RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts
Figure 3 for RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts
Figure 4 for RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts
Viaarxiv icon

EarthLink: A Self-Evolving AI Agent for Climate Science

Add code
Jul 24, 2025
Figure 1 for EarthLink: A Self-Evolving AI Agent for Climate Science
Figure 2 for EarthLink: A Self-Evolving AI Agent for Climate Science
Figure 3 for EarthLink: A Self-Evolving AI Agent for Climate Science
Viaarxiv icon

MSEarth: A Benchmark for Multimodal Scientific Comprehension of Earth Science

Add code
May 27, 2025
Viaarxiv icon

EarthSE: A Benchmark Evaluating Earth Scientific Exploration Capability for Large Language Models

Add code
May 22, 2025
Viaarxiv icon

Diffusion Models Need Visual Priors for Image Generation

Add code
Oct 11, 2024
Figure 1 for Diffusion Models Need Visual Priors for Image Generation
Figure 2 for Diffusion Models Need Visual Priors for Image Generation
Figure 3 for Diffusion Models Need Visual Priors for Image Generation
Figure 4 for Diffusion Models Need Visual Priors for Image Generation
Viaarxiv icon

OV-PARTS: Towards Open-Vocabulary Part Segmentation

Add code
Oct 08, 2023
Figure 1 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Figure 2 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Figure 3 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Figure 4 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Viaarxiv icon

Understanding Masked Autoencoders From a Local Contrastive Perspective

Add code
Oct 03, 2023
Viaarxiv icon

In Defense of Clip-based Video Relation Detection

Add code
Jul 18, 2023
Viaarxiv icon