Picture for Wenkang Qin

Wenkang Qin

EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer

Add code
Sep 26, 2025
Viaarxiv icon

MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training

Add code
Sep 26, 2025
Viaarxiv icon

ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction

Add code
Aug 11, 2025
Viaarxiv icon

Humanoid Occupancy: Enabling A Generalized Multimodal Occupancy Perception System on Humanoid Robots

Add code
Jul 27, 2025
Viaarxiv icon

RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer

Add code
May 29, 2025
Viaarxiv icon

WonderTurbo: Generating Interactive 3D World in 0.72 Seconds

Add code
Apr 03, 2025
Figure 1 for WonderTurbo: Generating Interactive 3D World in 0.72 Seconds
Figure 2 for WonderTurbo: Generating Interactive 3D World in 0.72 Seconds
Figure 3 for WonderTurbo: Generating Interactive 3D World in 0.72 Seconds
Figure 4 for WonderTurbo: Generating Interactive 3D World in 0.72 Seconds
Viaarxiv icon

ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration

Add code
Nov 29, 2024
Figure 1 for ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration
Figure 2 for ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration
Figure 3 for ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration
Figure 4 for ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration
Viaarxiv icon

PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology

Add code
Aug 13, 2024
Figure 1 for PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology
Figure 2 for PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology
Figure 3 for PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology
Figure 4 for PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology
Viaarxiv icon

SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training

Add code
Nov 10, 2023
Figure 1 for SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training
Figure 2 for SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training
Figure 3 for SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training
Figure 4 for SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training
Viaarxiv icon

What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning

Add code
Oct 31, 2023
Figure 1 for What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning
Figure 2 for What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning
Figure 3 for What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning
Figure 4 for What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning
Viaarxiv icon