photo


NRGS-SLAM: Monocular Non-Rigid SLAM for Endoscopy via Deformation-Aware 3D Gaussian Splatting

Add code
Feb 19, 2026
Viaarxiv icon

Markerless 6D Pose Estimation and Position-Based Visual Servoing for Endoscopic Continuum Manipulators

Add code
Feb 18, 2026
Viaarxiv icon

Visual Persuasion: What Influences Decisions of Vision-Language Models?

Add code
Feb 17, 2026
Viaarxiv icon

Semantic-aware Adversarial Fine-tuning for CLIP

Add code
Feb 12, 2026
Viaarxiv icon

Advancing Digital Twin Generation Through a Novel Simulation Framework and Quantitative Benchmarking

Add code
Feb 11, 2026
Viaarxiv icon

SemiNFT: Learning to Transfer Presets from Imitation to Appreciation via Hybrid-Sample Reinforcement Learning

Add code
Feb 09, 2026
Viaarxiv icon

Visual Word Sense Disambiguation with CLIP through Dual-Channel Text Prompting and Image Augmentations

Add code
Feb 06, 2026
Viaarxiv icon

CommCP: Efficient Multi-Agent Coordination via LLM-Based Communication with Conformal Prediction

Add code
Feb 05, 2026
Viaarxiv icon

Do Vision-Language Models Respect Contextual Integrity in Location Disclosure?

Add code
Feb 04, 2026
Viaarxiv icon

"I'm happy even though it's not real": GenAI Photo Editing as a Remembering Experience

Add code
Feb 03, 2026
Viaarxiv icon