Picture for Junsong Yuan

Junsong Yuan

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Add code
Jul 15, 2024
Viaarxiv icon

Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images

Add code
Jul 12, 2024
Viaarxiv icon

Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation

Add code
Jun 11, 2024
Viaarxiv icon

STAT: Towards Generalizable Temporal Action Localization

Add code
Apr 20, 2024
Figure 1 for STAT: Towards Generalizable Temporal Action Localization
Figure 2 for STAT: Towards Generalizable Temporal Action Localization
Figure 3 for STAT: Towards Generalizable Temporal Action Localization
Figure 4 for STAT: Towards Generalizable Temporal Action Localization
Viaarxiv icon

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

Add code
Mar 18, 2024
Figure 1 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 2 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 3 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 4 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Viaarxiv icon

FSC: Few-point Shape Completion

Add code
Mar 13, 2024
Figure 1 for FSC: Few-point Shape Completion
Figure 2 for FSC: Few-point Shape Completion
Figure 3 for FSC: Few-point Shape Completion
Figure 4 for FSC: Few-point Shape Completion
Viaarxiv icon

Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation

Add code
Mar 03, 2024
Figure 1 for Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation
Figure 2 for Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation
Figure 3 for Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation
Figure 4 for Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation
Viaarxiv icon

AMuSE: Adaptive Multimodal Analysis for Speaker Emotion Recognition in Group Conversations

Add code
Jan 26, 2024
Figure 1 for AMuSE: Adaptive Multimodal Analysis for Speaker Emotion Recognition in Group Conversations
Figure 2 for AMuSE: Adaptive Multimodal Analysis for Speaker Emotion Recognition in Group Conversations
Figure 3 for AMuSE: Adaptive Multimodal Analysis for Speaker Emotion Recognition in Group Conversations
Figure 4 for AMuSE: Adaptive Multimodal Analysis for Speaker Emotion Recognition in Group Conversations
Viaarxiv icon

Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models

Add code
Dec 26, 2023
Figure 1 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 2 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 3 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 4 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Viaarxiv icon

Relit-NeuLF: Efficient Relighting and Novel View Synthesis via Neural 4D Light Field

Add code
Oct 23, 2023
Viaarxiv icon