Picture for Hao Li

Hao Li

Jack

Envision: Embodied Visual Planning via Goal-Imagery Video Diffusion

Add code
Dec 27, 2025
Viaarxiv icon

Multi-AI Agent Framework Reveals the "Oxide Gatekeeper" in Aluminum Nanoparticle Oxidation

Add code
Dec 27, 2025
Viaarxiv icon

Enabling Ultra-Fast Cardiovascular Imaging Across Heterogeneous Clinical Environments with a Generalist Foundation Model and Multimodal Database

Add code
Dec 25, 2025
Viaarxiv icon

Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition

Add code
Dec 24, 2025
Figure 1 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Figure 2 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Figure 3 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Figure 4 for Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition
Viaarxiv icon

Multimodal Sensing for Robot-Assisted Sub-Tissue Feature Detection in Physiotherapy Palpation

Add code
Dec 24, 2025
Viaarxiv icon

Towards Generative Location Awareness for Disaster Response: A Probabilistic Cross-view Geolocalization Approach

Add code
Dec 23, 2025
Figure 1 for Towards Generative Location Awareness for Disaster Response: A Probabilistic Cross-view Geolocalization Approach
Figure 2 for Towards Generative Location Awareness for Disaster Response: A Probabilistic Cross-view Geolocalization Approach
Figure 3 for Towards Generative Location Awareness for Disaster Response: A Probabilistic Cross-view Geolocalization Approach
Figure 4 for Towards Generative Location Awareness for Disaster Response: A Probabilistic Cross-view Geolocalization Approach
Viaarxiv icon

Distinguishing Visually Similar Actions: Prompt-Guided Semantic Prototype Modulation for Few-Shot Action Recognition

Add code
Dec 22, 2025
Figure 1 for Distinguishing Visually Similar Actions: Prompt-Guided Semantic Prototype Modulation for Few-Shot Action Recognition
Figure 2 for Distinguishing Visually Similar Actions: Prompt-Guided Semantic Prototype Modulation for Few-Shot Action Recognition
Figure 3 for Distinguishing Visually Similar Actions: Prompt-Guided Semantic Prototype Modulation for Few-Shot Action Recognition
Figure 4 for Distinguishing Visually Similar Actions: Prompt-Guided Semantic Prototype Modulation for Few-Shot Action Recognition
Viaarxiv icon

EndoStreamDepth: Temporally Consistent Monocular Depth Estimation for Endoscopic Video Streams

Add code
Dec 20, 2025
Figure 1 for EndoStreamDepth: Temporally Consistent Monocular Depth Estimation for Endoscopic Video Streams
Figure 2 for EndoStreamDepth: Temporally Consistent Monocular Depth Estimation for Endoscopic Video Streams
Figure 3 for EndoStreamDepth: Temporally Consistent Monocular Depth Estimation for Endoscopic Video Streams
Figure 4 for EndoStreamDepth: Temporally Consistent Monocular Depth Estimation for Endoscopic Video Streams
Viaarxiv icon

Geometric Laplace Neural Operator

Add code
Dec 18, 2025
Figure 1 for Geometric Laplace Neural Operator
Figure 2 for Geometric Laplace Neural Operator
Figure 3 for Geometric Laplace Neural Operator
Figure 4 for Geometric Laplace Neural Operator
Viaarxiv icon

Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video

Add code
Dec 18, 2025
Figure 1 for Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video
Figure 2 for Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video
Figure 3 for Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video
Figure 4 for Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video
Viaarxiv icon