Picture for Wei Zhao

Wei Zhao

HF-VTON: High-Fidelity Virtual Try-On via Consistent Geometric and Semantic Alignment

Add code
May 26, 2025
Viaarxiv icon

Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning

Add code
May 22, 2025
Viaarxiv icon

Conf-GNNRec: Quantifying and Calibrating the Prediction Confidence for GNN-based Recommendation Methods

Add code
May 22, 2025
Viaarxiv icon

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Add code
May 18, 2025
Viaarxiv icon

Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions

Add code
May 16, 2025
Viaarxiv icon

Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Add code
May 12, 2025
Viaarxiv icon

LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering

Add code
May 09, 2025
Viaarxiv icon

TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering

Add code
May 08, 2025
Viaarxiv icon

Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning

Add code
Apr 22, 2025
Viaarxiv icon

Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens

Add code
Apr 20, 2025
Viaarxiv icon