Picture for Hanchao Yu

Hanchao Yu

Optimizing Recall or Relevance? A Multi-Task Multi-Head Approach for Item-to-Item Retrieval in Recommendation

Add code
Jun 06, 2025
Viaarxiv icon

VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

Add code
May 25, 2025
Viaarxiv icon

Inference Compute-Optimal Video Vision Language Models

Add code
May 24, 2025
Viaarxiv icon

Learning Critically: Selective Self Distillation in Federated Learning on Non-IID Data

Add code
Apr 20, 2025
Viaarxiv icon

CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning

Add code
Mar 25, 2025
Viaarxiv icon

Towards An Efficient LLM Training Paradigm for CTR Prediction

Add code
Mar 02, 2025
Viaarxiv icon

BRIDLE: Generalized Self-supervised Learning with Quantization

Add code
Feb 04, 2025
Figure 1 for BRIDLE: Generalized Self-supervised Learning with Quantization
Figure 2 for BRIDLE: Generalized Self-supervised Learning with Quantization
Figure 3 for BRIDLE: Generalized Self-supervised Learning with Quantization
Figure 4 for BRIDLE: Generalized Self-supervised Learning with Quantization
Viaarxiv icon

CompCap: Improving Multimodal Large Language Models with Composite Captions

Add code
Dec 06, 2024
Figure 1 for CompCap: Improving Multimodal Large Language Models with Composite Captions
Figure 2 for CompCap: Improving Multimodal Large Language Models with Composite Captions
Figure 3 for CompCap: Improving Multimodal Large Language Models with Composite Captions
Figure 4 for CompCap: Improving Multimodal Large Language Models with Composite Captions
Viaarxiv icon

RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training

Add code
Dec 07, 2023
Viaarxiv icon

MMViT: Multiscale Multiview Vision Transformers

Add code
Apr 28, 2023
Viaarxiv icon