Picture for Guangyao Zhou

Guangyao Zhou

GeoReason: Aligning Thinking And Answering In Remote Sensing Vision-Language Models Via Logical Consistency Reinforcement Learning

Add code
Jan 08, 2026
Viaarxiv icon

SLGNet: Synergizing Structural Priors and Language-Guided Modulation for Multimodal Object Detection

Add code
Jan 05, 2026
Viaarxiv icon

Direct Motion Models for Assessing Generated Videos

Add code
Apr 30, 2025
Viaarxiv icon

Distributional Diffusion Models with Scoring Rules

Add code
Feb 04, 2025
Viaarxiv icon

Diffusion Model Predictive Control

Add code
Oct 07, 2024
Figure 1 for Diffusion Model Predictive Control
Figure 2 for Diffusion Model Predictive Control
Figure 3 for Diffusion Model Predictive Control
Figure 4 for Diffusion Model Predictive Control
Viaarxiv icon

DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors

Add code
Sep 26, 2024
Figure 1 for DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors
Figure 2 for DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors
Figure 3 for DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors
Figure 4 for DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors
Viaarxiv icon

Learning Cognitive Maps from Transformer Representations for Efficient Planning in Partially Observed Environments

Add code
Jan 11, 2024
Viaarxiv icon

Multi Loss-based Feature Fusion and Top Two Voting Ensemble Decision Strategy for Facial Expression Recognition in the Wild

Add code
Nov 06, 2023
Figure 1 for Multi Loss-based Feature Fusion and Top Two Voting Ensemble Decision Strategy for Facial Expression Recognition in the Wild
Figure 2 for Multi Loss-based Feature Fusion and Top Two Voting Ensemble Decision Strategy for Facial Expression Recognition in the Wild
Figure 3 for Multi Loss-based Feature Fusion and Top Two Voting Ensemble Decision Strategy for Facial Expression Recognition in the Wild
Figure 4 for Multi Loss-based Feature Fusion and Top Two Voting Ensemble Decision Strategy for Facial Expression Recognition in the Wild
Viaarxiv icon

RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation

Add code
Aug 31, 2023
Figure 1 for RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation
Figure 2 for RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation
Figure 3 for RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation
Figure 4 for RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation
Viaarxiv icon

Graph schemas as abstractions for transfer learning, inference, and planning

Add code
Feb 14, 2023
Viaarxiv icon