Picture for Jiahao Xia

Jiahao Xia

Interpretable Zero-shot Referring Expression Comprehension with Query-driven Scene Graphs

Add code
Mar 26, 2026
Viaarxiv icon

Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement

Add code
Mar 20, 2026
Viaarxiv icon

Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images

Add code
Jul 29, 2025
Figure 1 for Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images
Figure 2 for Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images
Figure 3 for Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images
Figure 4 for Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images
Viaarxiv icon

Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized Constraints

Add code
Jul 16, 2025
Viaarxiv icon

Mitigating Knowledge Discrepancies among Multiple Datasets for Task-agnostic Unified Face Alignment

Add code
Mar 28, 2025
Figure 1 for Mitigating Knowledge Discrepancies among Multiple Datasets for Task-agnostic Unified Face Alignment
Figure 2 for Mitigating Knowledge Discrepancies among Multiple Datasets for Task-agnostic Unified Face Alignment
Figure 3 for Mitigating Knowledge Discrepancies among Multiple Datasets for Task-agnostic Unified Face Alignment
Figure 4 for Mitigating Knowledge Discrepancies among Multiple Datasets for Task-agnostic Unified Face Alignment
Viaarxiv icon

FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model

Add code
Mar 12, 2025
Viaarxiv icon

Unsupervised Part Discovery via Dual Representation Alignment

Add code
Aug 15, 2024
Figure 1 for Unsupervised Part Discovery via Dual Representation Alignment
Figure 2 for Unsupervised Part Discovery via Dual Representation Alignment
Figure 3 for Unsupervised Part Discovery via Dual Representation Alignment
Figure 4 for Unsupervised Part Discovery via Dual Representation Alignment
Viaarxiv icon

Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection

Add code
Jul 17, 2024
Figure 1 for Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection
Figure 2 for Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection
Figure 3 for Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection
Figure 4 for Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection
Viaarxiv icon

Differential Encoding for Improved Representation Learning over Graphs

Add code
Jul 03, 2024
Figure 1 for Differential Encoding for Improved Representation Learning over Graphs
Figure 2 for Differential Encoding for Improved Representation Learning over Graphs
Figure 3 for Differential Encoding for Improved Representation Learning over Graphs
Figure 4 for Differential Encoding for Improved Representation Learning over Graphs
Viaarxiv icon

Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data

Add code
Oct 24, 2023
Figure 1 for Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data
Figure 2 for Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data
Figure 3 for Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data
Figure 4 for Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data
Viaarxiv icon