Picture for Wei Zhao

Wei Zhao

Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions

Add code
May 16, 2025
Viaarxiv icon

Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Add code
May 12, 2025
Viaarxiv icon

LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering

Add code
May 09, 2025
Figure 1 for LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering
Figure 2 for LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering
Figure 3 for LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering
Figure 4 for LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering
Viaarxiv icon

TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering

Add code
May 08, 2025
Figure 1 for TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering
Figure 2 for TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering
Figure 3 for TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering
Figure 4 for TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering
Viaarxiv icon

Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning

Add code
Apr 22, 2025
Viaarxiv icon

Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens

Add code
Apr 20, 2025
Figure 1 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Figure 2 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Figure 3 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Figure 4 for Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Viaarxiv icon

DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration

Add code
Apr 07, 2025
Figure 1 for DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration
Figure 2 for DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration
Figure 3 for DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration
Figure 4 for DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration
Viaarxiv icon

Seesaw: High-throughput LLM Inference via Model Re-sharding

Add code
Mar 09, 2025
Viaarxiv icon

Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding

Add code
Mar 04, 2025
Viaarxiv icon

Metering Error Estimation of Fast-Charging Stations Using Charging Data Analytics

Add code
Mar 03, 2025
Figure 1 for Metering Error Estimation of Fast-Charging Stations Using Charging Data Analytics
Figure 2 for Metering Error Estimation of Fast-Charging Stations Using Charging Data Analytics
Figure 3 for Metering Error Estimation of Fast-Charging Stations Using Charging Data Analytics
Figure 4 for Metering Error Estimation of Fast-Charging Stations Using Charging Data Analytics
Viaarxiv icon