Picture for Yongtao Wang

Yongtao Wang

QAPruner: Quantization-Aware Vision Token Pruning for Multimodal Large Language Models

Add code
Apr 03, 2026
Viaarxiv icon

ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents

Add code
Mar 25, 2026
Viaarxiv icon

R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection

Add code
Mar 12, 2026
Viaarxiv icon

YOLO-NAS-Bench: A Surrogate Benchmark with Self-Evolving Predictors for YOLO Architecture Search

Add code
Mar 10, 2026
Viaarxiv icon

KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System

Add code
Dec 23, 2025
Viaarxiv icon

HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving

Add code
Nov 10, 2025
Viaarxiv icon

InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection

Add code
Sep 10, 2025
Figure 1 for InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection
Figure 2 for InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection
Figure 3 for InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection
Figure 4 for InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection
Viaarxiv icon

RegCL: Continual Adaptation of Segment Anything Model via Model Merging

Add code
Jul 16, 2025
Figure 1 for RegCL: Continual Adaptation of Segment Anything Model via Model Merging
Figure 2 for RegCL: Continual Adaptation of Segment Anything Model via Model Merging
Figure 3 for RegCL: Continual Adaptation of Segment Anything Model via Model Merging
Figure 4 for RegCL: Continual Adaptation of Segment Anything Model via Model Merging
Viaarxiv icon

RFTF: Reinforcement Fine-tuning for Embodied Agents with Temporal Feedback

Add code
May 26, 2025
Viaarxiv icon

VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion

Add code
May 25, 2025
Figure 1 for VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion
Figure 2 for VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion
Figure 3 for VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion
Figure 4 for VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion
Viaarxiv icon