Picture for Gaoang Wang

Gaoang Wang

Zhejiang University-University of Illinois at Urbana-Champaign Institute, Zhejiang University

STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft

Add code
Jun 17, 2024
Figure 1 for STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft
Figure 2 for STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft
Figure 3 for STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft
Viaarxiv icon

BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection

Add code
Jun 13, 2024
Figure 1 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Figure 2 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Figure 3 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Figure 4 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Viaarxiv icon

CityCraft: A Real Crafter for 3D City Generation

Add code
Jun 07, 2024
Viaarxiv icon

S4Fusion: Saliency-aware Selective State Space Model for Infrared Visible Image Fusion

Add code
Jun 03, 2024
Viaarxiv icon

FlexiFilm: Long Video Generation with Flexible Conditions

Add code
Apr 29, 2024
Viaarxiv icon

MovieChat+: Question-aware Sparse Memory for Long Video Question Answering

Add code
Apr 26, 2024
Viaarxiv icon

Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model

Add code
Apr 06, 2024
Figure 1 for Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Figure 2 for Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Figure 3 for Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Figure 4 for Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Viaarxiv icon

VersaT2I: Improving Text-to-Image Models with Versatile Reward

Add code
Mar 27, 2024
Figure 1 for VersaT2I: Improving Text-to-Image Models with Versatile Reward
Figure 2 for VersaT2I: Improving Text-to-Image Models with Versatile Reward
Figure 3 for VersaT2I: Improving Text-to-Image Models with Versatile Reward
Figure 4 for VersaT2I: Improving Text-to-Image Models with Versatile Reward
Viaarxiv icon

Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation

Add code
Mar 18, 2024
Figure 1 for Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
Figure 2 for Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
Figure 3 for Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
Figure 4 for Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
Viaarxiv icon

MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant

Add code
Mar 07, 2024
Figure 1 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Figure 2 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Figure 3 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Figure 4 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Viaarxiv icon