Picture for Yubo Wang

Yubo Wang

Hong Kong University of Science and Technology

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Add code
Aug 18, 2025
Viaarxiv icon

BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models

Add code
Aug 09, 2025
Viaarxiv icon

CROP: Integrating Topological and Spatial Structures via Cross-View Prefixes for Molecular LLMs

Add code
Aug 09, 2025
Viaarxiv icon

C-TLSAN: Content-Enhanced Time-Aware Long- and Short-Term Attention Network for Personalized Recommendation

Add code
Jun 16, 2025
Viaarxiv icon

Optimizing Recall or Relevance? A Multi-Task Multi-Head Approach for Item-to-Item Retrieval in Recommendation

Add code
Jun 06, 2025
Viaarxiv icon

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Add code
May 25, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

Add code
Apr 19, 2025
Viaarxiv icon

A Survey of Large Language Models in Mental Health Disorder Detection on Social Media

Add code
Apr 03, 2025
Figure 1 for A Survey of Large Language Models in Mental Health Disorder Detection on Social Media
Figure 2 for A Survey of Large Language Models in Mental Health Disorder Detection on Social Media
Figure 3 for A Survey of Large Language Models in Mental Health Disorder Detection on Social Media
Figure 4 for A Survey of Large Language Models in Mental Health Disorder Detection on Social Media
Viaarxiv icon

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Add code
Apr 03, 2025
Viaarxiv icon

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

Add code
Feb 23, 2025
Figure 1 for Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Figure 2 for Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Figure 3 for Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Figure 4 for Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Viaarxiv icon