Picture for Hang Zhang

Hang Zhang

EagleVision: A Dual-Stage Framework with BEV-grounding-based Chain-of-Thought for Spatial Intelligence

Add code
Dec 17, 2025
Viaarxiv icon

Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation

Add code
Nov 12, 2025
Figure 1 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Figure 2 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Figure 3 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Figure 4 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Viaarxiv icon

LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline

Add code
Oct 30, 2025
Figure 1 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Figure 2 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Figure 3 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Figure 4 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Viaarxiv icon

MSRepaint: Multiple Sclerosis Repaint with Conditional Denoising Diffusion Implicit Model for Bidirectional Lesion Filling and Synthesis

Add code
Oct 02, 2025
Viaarxiv icon

UniMapGen: A Generative Framework for Large-Scale Map Construction from Multi-modal Data

Add code
Sep 26, 2025
Viaarxiv icon

Ideal Registration? Segmentation is All You Need

Add code
Sep 19, 2025
Viaarxiv icon

Gaussian Primitive Optimized Deformable Retinal Image Registration

Add code
Aug 23, 2025
Figure 1 for Gaussian Primitive Optimized Deformable Retinal Image Registration
Figure 2 for Gaussian Primitive Optimized Deformable Retinal Image Registration
Figure 3 for Gaussian Primitive Optimized Deformable Retinal Image Registration
Figure 4 for Gaussian Primitive Optimized Deformable Retinal Image Registration
Viaarxiv icon

PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning

Add code
Aug 20, 2025
Figure 1 for PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning
Figure 2 for PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning
Figure 3 for PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning
Figure 4 for PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning
Viaarxiv icon

Beyond the LUMIR challenge: The pathway to foundational registration models

Add code
May 30, 2025
Figure 1 for Beyond the LUMIR challenge: The pathway to foundational registration models
Figure 2 for Beyond the LUMIR challenge: The pathway to foundational registration models
Figure 3 for Beyond the LUMIR challenge: The pathway to foundational registration models
Figure 4 for Beyond the LUMIR challenge: The pathway to foundational registration models
Viaarxiv icon

ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models

Add code
May 27, 2025
Figure 1 for ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
Figure 2 for ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
Figure 3 for ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
Figure 4 for ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
Viaarxiv icon