Picture for Wei Li

Wei Li

Tsinghua University, Beijing, China

JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on

Add code
Aug 25, 2025
Figure 1 for JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on
Figure 2 for JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on
Figure 3 for JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on
Figure 4 for JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on
Viaarxiv icon

Boosting Generic Semi-Supervised Medical Image Segmentation via Diverse Teaching and Label Propagation

Add code
Aug 12, 2025
Viaarxiv icon

EnergyPatchTST: Multi-scale Time Series Transformers with Uncertainty Estimation for Energy Forecasting

Add code
Aug 07, 2025
Viaarxiv icon

CliCARE: Grounding Large Language Models in Clinical Guidelines for Decision Support over Longitudinal Cancer Electronic Health Records

Add code
Jul 30, 2025
Viaarxiv icon

Cross-domain Hyperspectral Image Classification based on Bi-directional Domain Adaptation

Add code
Jul 03, 2025
Viaarxiv icon

F^2TTA: Free-Form Test-Time Adaptation on Cross-Domain Medical Image Classification via Image-Level Disentangled Prompt Tuning

Add code
Jul 03, 2025
Viaarxiv icon

UAVD-Mamba: Deformable Token Fusion Vision Mamba for Multimodal UAV Detection

Add code
Jul 01, 2025
Viaarxiv icon

A Survey: Learning Embodied Intelligence from Physical Simulators and World Models

Add code
Jul 01, 2025
Figure 1 for A Survey: Learning Embodied Intelligence from Physical Simulators and World Models
Figure 2 for A Survey: Learning Embodied Intelligence from Physical Simulators and World Models
Figure 3 for A Survey: Learning Embodied Intelligence from Physical Simulators and World Models
Figure 4 for A Survey: Learning Embodied Intelligence from Physical Simulators and World Models
Viaarxiv icon

MMSearch-R1: Incentivizing LMMs to Search

Add code
Jun 25, 2025
Figure 1 for MMSearch-R1: Incentivizing LMMs to Search
Figure 2 for MMSearch-R1: Incentivizing LMMs to Search
Figure 3 for MMSearch-R1: Incentivizing LMMs to Search
Figure 4 for MMSearch-R1: Incentivizing LMMs to Search
Viaarxiv icon

video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models

Add code
Jun 18, 2025
Viaarxiv icon