Picture for Shanshan Zhao

Shanshan Zhao

InpaintHuman: Reconstructing Occluded Humans with Multi-Scale UV Mapping and Identity-Preserving Diffusion Inpainting

Add code
Jan 05, 2026
Viaarxiv icon

Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images

Add code
Nov 10, 2025
Viaarxiv icon

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Add code
May 05, 2025
Viaarxiv icon

JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation

Add code
Mar 31, 2025
Viaarxiv icon

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Add code
Dec 25, 2024
Figure 1 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 2 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 3 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 4 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Viaarxiv icon

Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation

Add code
Sep 04, 2024
Figure 1 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Figure 2 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Figure 3 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Figure 4 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Viaarxiv icon

Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment

Add code
Aug 29, 2024
Figure 1 for Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Figure 2 for Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Figure 3 for Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Figure 4 for Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Viaarxiv icon

UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather

Add code
Apr 08, 2024
Figure 1 for UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
Figure 2 for UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
Figure 3 for UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
Figure 4 for UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
Viaarxiv icon

Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis

Add code
Mar 17, 2024
Figure 1 for Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis
Figure 2 for Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis
Figure 3 for Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis
Figure 4 for Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis
Viaarxiv icon

When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability

Add code
Mar 01, 2024
Figure 1 for When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Figure 2 for When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Figure 3 for When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Figure 4 for When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Viaarxiv icon