Picture for Quan Kong

Quan Kong

Woven by Toyota, Inc., Tokyo, Japan

Synthetic Visual Genome

Add code
Jun 09, 2025
Viaarxiv icon

One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory

Add code
May 29, 2025
Viaarxiv icon

Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras

Add code
May 23, 2025
Viaarxiv icon

Evaluation of Mobile Environment for Vehicular Visible Light Communication Using Multiple LEDs and Event Cameras

Add code
May 21, 2025
Viaarxiv icon

Just Dance with $π$! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection

Add code
May 19, 2025
Viaarxiv icon

GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding

Add code
May 15, 2025
Viaarxiv icon

E-VLC: A Real-World Dataset for Event-based Visible Light Communication And Localization

Add code
Apr 25, 2025
Viaarxiv icon

SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance

Add code
Oct 24, 2024
Figure 1 for SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance
Figure 2 for SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance
Figure 3 for SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance
Figure 4 for SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance
Viaarxiv icon

Reprojection Errors as Prompts for Efficient Scene Coordinate Regression

Add code
Sep 06, 2024
Viaarxiv icon

WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding

Add code
Jul 22, 2024
Figure 1 for WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding
Figure 2 for WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding
Figure 3 for WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding
Figure 4 for WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding
Viaarxiv icon