Picture for Long Bai

Long Bai

More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery

Add code
Dec 10, 2025
Viaarxiv icon

EndoIR: Degradation-Agnostic All-in-One Endoscopic Image Restoration via Noise-Aware Routing Diffusion

Add code
Nov 11, 2025
Viaarxiv icon

Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge

Add code
Jul 22, 2025
Viaarxiv icon

TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast

Add code
Jun 16, 2025
Viaarxiv icon

KnowCoder-V2: Deep Knowledge Analysis

Add code
Jun 07, 2025
Figure 1 for KnowCoder-V2: Deep Knowledge Analysis
Figure 2 for KnowCoder-V2: Deep Knowledge Analysis
Figure 3 for KnowCoder-V2: Deep Knowledge Analysis
Figure 4 for KnowCoder-V2: Deep Knowledge Analysis
Viaarxiv icon

EndoARSS: Adapting Spatially-Aware Foundation Model for Efficient Activity Recognition and Semantic Segmentation in Endoscopic Surgery

Add code
Jun 07, 2025
Viaarxiv icon

EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy

Add code
May 21, 2025
Viaarxiv icon

Mixture Policy based Multi-Hop Reasoning over N-tuple Temporal Knowledge Graphs

Add code
May 19, 2025
Viaarxiv icon

PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition

Add code
Apr 07, 2025
Viaarxiv icon

Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery

Add code
Apr 02, 2025
Viaarxiv icon