Picture for Kento Sasaki

Kento Sasaki

Understanding Sensitivity of Differential Attention through the Lens of Adversarial Robustness

Add code
Oct 01, 2025
Viaarxiv icon

STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes

Add code
Aug 14, 2025
Viaarxiv icon

One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression

Add code
Jan 17, 2025
Figure 1 for One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
Figure 2 for One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
Figure 3 for One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
Figure 4 for One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
Viaarxiv icon

CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving

Add code
Aug 19, 2024
Figure 1 for CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
Figure 2 for CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
Figure 3 for CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
Figure 4 for CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
Viaarxiv icon

Heron-Bench: A Benchmark for Evaluating Vision Language Models in Japanese

Add code
Apr 11, 2024
Viaarxiv icon

Machine-learning-enhanced quantum sensors for accurate magnetic field imaging

Add code
Feb 01, 2022
Viaarxiv icon