Picture for Yuan Huang

Yuan Huang

UrbanCraft: Urban View Extrapolation via Hierarchical Sem-Geometric Priors

Add code
May 29, 2025
Viaarxiv icon

Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework

Add code
Feb 19, 2025
Viaarxiv icon

Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition

Add code
Feb 17, 2025
Figure 1 for Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition
Figure 2 for Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition
Figure 3 for Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition
Figure 4 for Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition
Viaarxiv icon

Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement

Add code
Feb 11, 2025
Viaarxiv icon

GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos

Add code
Dec 03, 2024
Figure 1 for GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos
Figure 2 for GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos
Figure 3 for GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos
Figure 4 for GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos
Viaarxiv icon

Empirical curvelet based Fully Convolutional Network for supervised texture image segmentation

Add code
Oct 28, 2024
Figure 1 for Empirical curvelet based Fully Convolutional Network for supervised texture image segmentation
Figure 2 for Empirical curvelet based Fully Convolutional Network for supervised texture image segmentation
Figure 3 for Empirical curvelet based Fully Convolutional Network for supervised texture image segmentation
Figure 4 for Empirical curvelet based Fully Convolutional Network for supervised texture image segmentation
Viaarxiv icon

Review of wavelet-based unsupervised texture segmentation, advantage of adaptive wavelets

Add code
Oct 24, 2024
Viaarxiv icon

MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

Add code
May 08, 2024
Figure 1 for MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results
Figure 2 for MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results
Figure 3 for MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results
Figure 4 for MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results
Viaarxiv icon

MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot

Add code
Apr 28, 2024
Figure 1 for MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot
Figure 2 for MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot
Figure 3 for MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot
Figure 4 for MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot
Viaarxiv icon

Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

Add code
Apr 25, 2024
Figure 1 for Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey
Figure 2 for Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey
Figure 3 for Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey
Figure 4 for Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey
Viaarxiv icon