Picture for Wenbing Tao

Wenbing Tao

PET-DINO: Unifying Visual Cues into Grounding DINO with Prompt-Enriched Training

Add code
Apr 01, 2026
Viaarxiv icon

ORMOT: A Dataset and Framework for Omnidirectional Referring Multi-Object Tracking

Add code
Mar 05, 2026
Viaarxiv icon

RT-RMOT: A Dataset and Framework for RGB-Thermal Referring Multi-Object Tracking

Add code
Feb 25, 2026
Viaarxiv icon

VGGT-Motion: Motion-Aware Calibration-Free Monocular SLAM for Long-Range Consistency

Add code
Feb 05, 2026
Viaarxiv icon

DRMOT: A Dataset and Framework for RGBD Referring Multi-Object Tracking

Add code
Feb 04, 2026
Viaarxiv icon

ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking

Add code
May 26, 2025
Figure 1 for ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking
Figure 2 for ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking
Figure 3 for ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking
Figure 4 for ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking
Viaarxiv icon

Perception-R1: Pioneering Perception Policy with Reinforcement Learning

Add code
Apr 10, 2025
Figure 1 for Perception-R1: Pioneering Perception Policy with Reinforcement Learning
Figure 2 for Perception-R1: Pioneering Perception Policy with Reinforcement Learning
Figure 3 for Perception-R1: Pioneering Perception Policy with Reinforcement Learning
Figure 4 for Perception-R1: Pioneering Perception Policy with Reinforcement Learning
Viaarxiv icon

OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer

Add code
Mar 13, 2025
Figure 1 for OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer
Figure 2 for OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer
Figure 3 for OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer
Figure 4 for OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer
Viaarxiv icon

Unhackable Temporal Rewarding for Scalable Video MLLMs

Add code
Feb 17, 2025
Figure 1 for Unhackable Temporal Rewarding for Scalable Video MLLMs
Figure 2 for Unhackable Temporal Rewarding for Scalable Video MLLMs
Figure 3 for Unhackable Temporal Rewarding for Scalable Video MLLMs
Figure 4 for Unhackable Temporal Rewarding for Scalable Video MLLMs
Viaarxiv icon

Cross-View Referring Multi-Object Tracking

Add code
Dec 23, 2024
Figure 1 for Cross-View Referring Multi-Object Tracking
Figure 2 for Cross-View Referring Multi-Object Tracking
Figure 3 for Cross-View Referring Multi-Object Tracking
Figure 4 for Cross-View Referring Multi-Object Tracking
Viaarxiv icon