Picture for Tong Wu

Tong Wu

Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity

Add code
Aug 07, 2025
Figure 1 for Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity
Figure 2 for Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity
Figure 3 for Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity
Figure 4 for Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity
Viaarxiv icon

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Add code
Aug 06, 2025
Viaarxiv icon

Video World Models with Long-term Spatial Memory

Add code
Jun 05, 2025
Figure 1 for Video World Models with Long-term Spatial Memory
Figure 2 for Video World Models with Long-term Spatial Memory
Figure 3 for Video World Models with Long-term Spatial Memory
Figure 4 for Video World Models with Long-term Spatial Memory
Viaarxiv icon

Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation

Add code
Jun 05, 2025
Figure 1 for Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation
Figure 2 for Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation
Figure 3 for Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation
Figure 4 for Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation
Viaarxiv icon

ICDM: Interference Cancellation Diffusion Models for Wireless Semantic Communications

Add code
May 26, 2025
Viaarxiv icon

SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams

Add code
May 26, 2025
Figure 1 for SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams
Figure 2 for SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams
Figure 3 for SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams
Figure 4 for SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams
Viaarxiv icon

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Add code
May 19, 2025
Viaarxiv icon

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Add code
May 07, 2025
Figure 1 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 2 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 3 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 4 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Viaarxiv icon

VTire: A Bimodal Visuotactile Tire with High-Resolution Sensing Capability

Add code
Apr 27, 2025
Viaarxiv icon

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

Add code
Apr 10, 2025
Viaarxiv icon