Picture for Xiaoyu Yue

Xiaoyu Yue

Transition Models: Rethinking the Generative Learning Objective

Add code
Sep 04, 2025
Viaarxiv icon

RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts

Add code
Aug 17, 2025
Viaarxiv icon

EarthLink: A Self-Evolving AI Agent for Climate Science

Add code
Jul 24, 2025
Viaarxiv icon

MSEarth: A Benchmark for Multimodal Scientific Comprehension of Earth Science

Add code
May 27, 2025
Viaarxiv icon

EarthSE: A Benchmark Evaluating Earth Scientific Exploration Capability for Large Language Models

Add code
May 22, 2025
Viaarxiv icon

Diffusion Models Need Visual Priors for Image Generation

Add code
Oct 11, 2024
Figure 1 for Diffusion Models Need Visual Priors for Image Generation
Figure 2 for Diffusion Models Need Visual Priors for Image Generation
Figure 3 for Diffusion Models Need Visual Priors for Image Generation
Figure 4 for Diffusion Models Need Visual Priors for Image Generation
Viaarxiv icon

OV-PARTS: Towards Open-Vocabulary Part Segmentation

Add code
Oct 08, 2023
Figure 1 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Figure 2 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Figure 3 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Figure 4 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Viaarxiv icon

Understanding Masked Autoencoders From a Local Contrastive Perspective

Add code
Oct 03, 2023
Viaarxiv icon

In Defense of Clip-based Video Relation Detection

Add code
Jul 18, 2023
Viaarxiv icon

Rethinking the Two-Stage Framework for Grounded Situation Recognition

Add code
Dec 10, 2021
Figure 1 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Figure 2 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Figure 3 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Figure 4 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Viaarxiv icon