Picture for Quan Kong

Quan Kong

Woven by Toyota, Inc., Tokyo, Japan

ParallelVLM: Lossless Video-LLM Acceleration with Visual Alignment Aware Parallel Speculative Decoding

Add code
Mar 23, 2026
Viaarxiv icon

Vision-TTT: Efficient and Expressive Visual Representation Learning with Test-Time Training

Add code
Feb 28, 2026
Viaarxiv icon

TrajTok: Learning Trajectory Tokens enables better Video Understanding

Add code
Feb 26, 2026
Viaarxiv icon

Mixture of Experts Guided by Gaussian Splatters Matters: A new Approach to Weakly-Supervised Video Anomaly Detection

Add code
Aug 08, 2025
Viaarxiv icon

Synthetic Visual Genome

Add code
Jun 09, 2025
Viaarxiv icon

One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory

Add code
May 29, 2025
Viaarxiv icon

Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras

Add code
May 23, 2025
Viaarxiv icon

Evaluation of Mobile Environment for Vehicular Visible Light Communication Using Multiple LEDs and Event Cameras

Add code
May 21, 2025
Viaarxiv icon

Just Dance with $π$! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection

Add code
May 19, 2025
Viaarxiv icon

GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding

Add code
May 15, 2025
Viaarxiv icon