Picture for Rui Zhao

Rui Zhao

Department of Radiology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China

Unlocking the Power of SAM 2 for Few-Shot Segmentation

Add code
May 21, 2025
Figure 1 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Figure 2 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Figure 3 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Figure 4 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Viaarxiv icon

Bronchovascular Tree-Guided Weakly Supervised Learning Method for Pulmonary Segment Segmentation

Add code
May 20, 2025
Figure 1 for Bronchovascular Tree-Guided Weakly Supervised Learning Method for Pulmonary Segment Segmentation
Figure 2 for Bronchovascular Tree-Guided Weakly Supervised Learning Method for Pulmonary Segment Segmentation
Figure 3 for Bronchovascular Tree-Guided Weakly Supervised Learning Method for Pulmonary Segment Segmentation
Figure 4 for Bronchovascular Tree-Guided Weakly Supervised Learning Method for Pulmonary Segment Segmentation
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Figure 1 for Seed1.5-VL Technical Report
Figure 2 for Seed1.5-VL Technical Report
Figure 3 for Seed1.5-VL Technical Report
Figure 4 for Seed1.5-VL Technical Report
Viaarxiv icon

Towards Cross-Modality Modeling for Time Series Analytics: A Survey in the LLM Era

Add code
May 05, 2025
Viaarxiv icon

Efficient Multivariate Time Series Forecasting via Calibrated Language Models with Privileged Knowledge Distillation

Add code
May 04, 2025
Viaarxiv icon

On the Suitability of Reinforcement Fine-Tuning to Visual Tasks

Add code
Apr 08, 2025
Figure 1 for On the Suitability of Reinforcement Fine-Tuning to Visual Tasks
Figure 2 for On the Suitability of Reinforcement Fine-Tuning to Visual Tasks
Figure 3 for On the Suitability of Reinforcement Fine-Tuning to Visual Tasks
Viaarxiv icon

Re-Aligning Language to Visual Objects with an Agentic Workflow

Add code
Mar 30, 2025
Viaarxiv icon

Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors

Add code
Mar 25, 2025
Viaarxiv icon

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Add code
Mar 13, 2025
Viaarxiv icon

Motion Anything: Any to Motion Generation

Add code
Mar 10, 2025
Viaarxiv icon