Picture for Wentao Wang

Wentao Wang

Bridging the Know-Act Gap via Task-Level Autoregressive Reasoning

Add code
Mar 23, 2026
Viaarxiv icon

Cluster-Aware Attention-Based Deep Reinforcement Learning for Pickup and Delivery Problems

Add code
Mar 09, 2026
Viaarxiv icon

RELIC: Evaluating Compositional Instruction Following via Language Recognition

Add code
Jun 05, 2025
Viaarxiv icon

High-Quality 3D Head Reconstruction from Any Single Portrait Image

Add code
Mar 11, 2025
Figure 1 for High-Quality 3D Head Reconstruction from Any Single Portrait Image
Figure 2 for High-Quality 3D Head Reconstruction from Any Single Portrait Image
Figure 3 for High-Quality 3D Head Reconstruction from Any Single Portrait Image
Figure 4 for High-Quality 3D Head Reconstruction from Any Single Portrait Image
Viaarxiv icon

Rapid Word Learning Through Meta In-Context Learning

Add code
Feb 20, 2025
Figure 1 for Rapid Word Learning Through Meta In-Context Learning
Figure 2 for Rapid Word Learning Through Meta In-Context Learning
Figure 3 for Rapid Word Learning Through Meta In-Context Learning
Figure 4 for Rapid Word Learning Through Meta In-Context Learning
Viaarxiv icon

TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection

Add code
Jan 24, 2025
Figure 1 for TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection
Figure 2 for TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection
Figure 3 for TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection
Figure 4 for TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection
Viaarxiv icon

GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Add code
Nov 27, 2024
Figure 1 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data
Figure 2 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data
Figure 3 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data
Figure 4 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data
Viaarxiv icon

Towards Efficient Moion Planning for UAVs: Lazy A* Search with Motion Primitives

Add code
Oct 02, 2024
Figure 1 for Towards Efficient Moion Planning for UAVs: Lazy A* Search with Motion Primitives
Figure 2 for Towards Efficient Moion Planning for UAVs: Lazy A* Search with Motion Primitives
Figure 3 for Towards Efficient Moion Planning for UAVs: Lazy A* Search with Motion Primitives
Figure 4 for Towards Efficient Moion Planning for UAVs: Lazy A* Search with Motion Primitives
Viaarxiv icon

LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details

Add code
Oct 01, 2024
Figure 1 for LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details
Figure 2 for LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details
Figure 3 for LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details
Figure 4 for LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details
Viaarxiv icon

FoAM: Foresight-Augmented Multi-Task Imitation Policy for Robotic Manipulation

Add code
Sep 29, 2024
Figure 1 for FoAM: Foresight-Augmented Multi-Task Imitation Policy for Robotic Manipulation
Figure 2 for FoAM: Foresight-Augmented Multi-Task Imitation Policy for Robotic Manipulation
Figure 3 for FoAM: Foresight-Augmented Multi-Task Imitation Policy for Robotic Manipulation
Figure 4 for FoAM: Foresight-Augmented Multi-Task Imitation Policy for Robotic Manipulation
Viaarxiv icon