Picture for Yongtao Wang

Yongtao Wang

RFTF: Reinforcement Fine-tuning for Embodied Agents with Temporal Feedback

Add code
May 26, 2025
Viaarxiv icon

VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion

Add code
May 25, 2025
Viaarxiv icon

T2VUnlearning: A Concept Erasing Method for Text-to-Video Diffusion Models

Add code
May 23, 2025
Viaarxiv icon

RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection

Add code
Feb 18, 2025
Viaarxiv icon

OccGS: Zero-shot 3D Occupancy Reconstruction with Semantic and Geometric-Aware Gaussian Splatting

Add code
Feb 07, 2025
Viaarxiv icon

OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection

Add code
Nov 26, 2024
Figure 1 for OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Figure 2 for OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Figure 3 for OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Figure 4 for OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Viaarxiv icon

TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement

Add code
Oct 15, 2024
Figure 1 for TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement
Figure 2 for TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement
Figure 3 for TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement
Figure 4 for TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement
Viaarxiv icon

Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts

Add code
Oct 08, 2024
Figure 1 for Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
Figure 2 for Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
Figure 3 for Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
Figure 4 for Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
Viaarxiv icon

RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network

Add code
Sep 08, 2024
Figure 1 for RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network
Figure 2 for RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network
Figure 3 for RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network
Figure 4 for RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network
Viaarxiv icon

NAS-BNN: Neural Architecture Search for Binary Neural Networks

Add code
Aug 28, 2024
Figure 1 for NAS-BNN: Neural Architecture Search for Binary Neural Networks
Figure 2 for NAS-BNN: Neural Architecture Search for Binary Neural Networks
Figure 3 for NAS-BNN: Neural Architecture Search for Binary Neural Networks
Figure 4 for NAS-BNN: Neural Architecture Search for Binary Neural Networks
Viaarxiv icon