Picture for Jinlin Wu

Jinlin Wu

MAT-Cell: A Multi-Agent Tree-Structured Reasoning Framework for Batch-Level Single-Cell Annotation

Add code
Apr 07, 2026
Viaarxiv icon

The Eleventh NTIRE 2026 Efficient Super-Resolution Challenge Report

Add code
Apr 03, 2026
Viaarxiv icon

UniDA3D: A Unified Domain-Adaptive Framework for Multi-View 3D Object Detection

Add code
Mar 30, 2026
Viaarxiv icon

Geometry OR Tracker: Universal Geometric Operating Room Tracking

Add code
Feb 28, 2026
Viaarxiv icon

UniSurg: A Video-Native Foundation Model for Universal Understanding of Surgical Videos

Add code
Feb 05, 2026
Viaarxiv icon

Anatomy-R1: Enhancing Anatomy Reasoning in Multimodal Large Language Models via Anatomical Similarity Curriculum and Group Diversity Augmentation

Add code
Dec 24, 2025
Viaarxiv icon

6DAttack: Backdoor Attacks in the 6DoF Pose Estimation

Add code
Dec 22, 2025
Figure 1 for 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation
Figure 2 for 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation
Figure 3 for 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation
Figure 4 for 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation
Viaarxiv icon

Multimodal Causal-Driven Representation Learning for Generalizable Medical Image Segmentation

Add code
Aug 07, 2025
Viaarxiv icon

SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement

Add code
Jul 03, 2025
Figure 1 for SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement
Figure 2 for SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement
Figure 3 for SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement
Figure 4 for SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement
Viaarxiv icon

SA-Person: Text-Based Person Retrieval with Scene-aware Re-ranking

Add code
May 30, 2025
Viaarxiv icon