Picture for Hao Tang

Hao Tang

ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization

Add code
Jan 17, 2024
Viaarxiv icon

Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation

Add code
Jan 15, 2024
Viaarxiv icon

SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation

Add code
Dec 26, 2023
Figure 1 for SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Figure 2 for SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Figure 3 for SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Figure 4 for SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Viaarxiv icon

Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models

Add code
Dec 26, 2023
Figure 1 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 2 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 3 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 4 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Viaarxiv icon

Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement

Add code
Dec 15, 2023
Figure 1 for Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement
Figure 2 for Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement
Figure 3 for Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement
Figure 4 for Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement
Viaarxiv icon

Learning Contrastive Self-Distillation for Ultra-Fine-Grained Visual Categorization Targeting Limited Samples

Add code
Nov 10, 2023
Viaarxiv icon

Multi-view Information Integration and Propagation for Occluded Person Re-identification

Add code
Nov 09, 2023
Figure 1 for Multi-view Information Integration and Propagation for Occluded Person Re-identification
Figure 2 for Multi-view Information Integration and Propagation for Occluded Person Re-identification
Figure 3 for Multi-view Information Integration and Propagation for Occluded Person Re-identification
Figure 4 for Multi-view Information Integration and Propagation for Occluded Person Re-identification
Viaarxiv icon

Towards High-quality HDR Deghosting with Conditional Diffusion Models

Add code
Nov 02, 2023
Viaarxiv icon

Towards Matching Phones and Speech Representations

Add code
Oct 26, 2023
Figure 1 for Towards Matching Phones and Speech Representations
Figure 2 for Towards Matching Phones and Speech Representations
Figure 3 for Towards Matching Phones and Speech Representations
Figure 4 for Towards Matching Phones and Speech Representations
Viaarxiv icon

Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data

Add code
Oct 24, 2023
Figure 1 for Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data
Figure 2 for Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data
Figure 3 for Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data
Figure 4 for Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data
Viaarxiv icon