Picture for Xiangyu Zhao

Xiangyu Zhao

Victor

AdaSwitch: Adaptive Switching Generation for Knowledge Distillation

Add code
Oct 09, 2025
Viaarxiv icon

Causality-aware Graph Aggregation Weight Estimator for Popularity Debiasing in Top-K Recommendation

Add code
Oct 06, 2025
Viaarxiv icon

Empowering Denoising Sequential Recommendation with Large Language Model Embeddings

Add code
Oct 05, 2025
Viaarxiv icon

GenExam: A Multidisciplinary Text-to-Image Exam

Add code
Sep 17, 2025
Viaarxiv icon

GLEAM: Learning to Match and Explain in Cross-View Geo-Localization

Add code
Sep 09, 2025
Viaarxiv icon

Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems

Add code
Sep 09, 2025
Viaarxiv icon

GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization

Add code
Sep 04, 2025
Figure 1 for GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization
Figure 2 for GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization
Figure 3 for GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization
Figure 4 for GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization
Viaarxiv icon

Generative Auto-Bidding in Large-Scale Competitive Auctions via Diffusion Completer-Aligner

Add code
Sep 03, 2025
Viaarxiv icon

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Viaarxiv icon

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Add code
Aug 25, 2025
Viaarxiv icon