Scene Recognition


UniSurg: A Video-Native Foundation Model for Universal Understanding of Surgical Videos

Add code
Feb 05, 2026
Viaarxiv icon

TreeLoc: 6-DoF LiDAR Global Localization in Forests via Inter-Tree Geometric Matching

Add code
Feb 03, 2026
Viaarxiv icon

TiCLS : Tightly Coupled Language Text Spotter

Add code
Feb 03, 2026
Viaarxiv icon

GDPR-Compliant Person Recognition in Industrial Environments Using MEMS-LiDAR and Hybrid Data

Add code
Feb 02, 2026
Viaarxiv icon

LLM-Driven Scenario-Aware Planning for Autonomous Driving

Add code
Jan 29, 2026
Viaarxiv icon

Text is All You Need for Vision-Language Model Jailbreaking

Add code
Jan 31, 2026
Viaarxiv icon

Stealthy Coverage Control for Human-enabled Real-Time 3D Reconstruction

Add code
Jan 31, 2026
Viaarxiv icon

Invariance on Manifolds: Understanding Robust Visual Representations for Place Recognition

Add code
Jan 31, 2026
Viaarxiv icon

TIGaussian: Disentangle Gaussians for Spatial-Awared Text-Image-3D Alignment

Add code
Jan 27, 2026
Viaarxiv icon

InspecSafe-V1: A Multimodal Benchmark for Safety Assessment in Industrial Inspection Scenarios

Add code
Jan 29, 2026
Viaarxiv icon