Picture for Ruoyu Wang

Ruoyu Wang

CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition

Add code
Jun 18, 2024
Viaarxiv icon

MALT: Multi-scale Action Learning Transformer for Online Action Detection

Add code
May 31, 2024
Viaarxiv icon

Quality-aware Masked Diffusion Transformer for Enhanced Music Generation

Add code
May 24, 2024
Viaarxiv icon

LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots

Add code
May 24, 2024
Figure 1 for LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots
Figure 2 for LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots
Figure 3 for LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots
Figure 4 for LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots
Viaarxiv icon

Multi-agent Traffic Prediction via Denoised Endpoint Distribution

Add code
May 11, 2024
Figure 1 for Multi-agent Traffic Prediction via Denoised Endpoint Distribution
Figure 2 for Multi-agent Traffic Prediction via Denoised Endpoint Distribution
Figure 3 for Multi-agent Traffic Prediction via Denoised Endpoint Distribution
Figure 4 for Multi-agent Traffic Prediction via Denoised Endpoint Distribution
Viaarxiv icon

Behind the Veil: Enhanced Indoor 3D Scene Reconstruction with Occluded Surfaces Completion

Add code
Apr 03, 2024
Figure 1 for Behind the Veil: Enhanced Indoor 3D Scene Reconstruction with Occluded Surfaces Completion
Figure 2 for Behind the Veil: Enhanced Indoor 3D Scene Reconstruction with Occluded Surfaces Completion
Figure 3 for Behind the Veil: Enhanced Indoor 3D Scene Reconstruction with Occluded Surfaces Completion
Figure 4 for Behind the Veil: Enhanced Indoor 3D Scene Reconstruction with Occluded Surfaces Completion
Viaarxiv icon

TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes

Add code
Apr 03, 2024
Figure 1 for TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes
Figure 2 for TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes
Figure 3 for TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes
Figure 4 for TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes
Viaarxiv icon

UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation

Add code
Mar 23, 2024
Viaarxiv icon

Multitask frame-level learning for few-shot sound event detection

Add code
Mar 17, 2024
Figure 1 for Multitask frame-level learning for few-shot sound event detection
Figure 2 for Multitask frame-level learning for few-shot sound event detection
Figure 3 for Multitask frame-level learning for few-shot sound event detection
Figure 4 for Multitask frame-level learning for few-shot sound event detection
Viaarxiv icon

StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images

Add code
Mar 14, 2024
Figure 1 for StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images
Figure 2 for StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images
Figure 3 for StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images
Figure 4 for StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images
Viaarxiv icon