Picture for Junsong Yuan

Junsong Yuan

Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation

Add code
Jun 11, 2024
Viaarxiv icon

STAT: Towards Generalizable Temporal Action Localization

Add code
Apr 20, 2024
Figure 1 for STAT: Towards Generalizable Temporal Action Localization
Figure 2 for STAT: Towards Generalizable Temporal Action Localization
Figure 3 for STAT: Towards Generalizable Temporal Action Localization
Figure 4 for STAT: Towards Generalizable Temporal Action Localization
Viaarxiv icon

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

Add code
Mar 18, 2024
Figure 1 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 2 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 3 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 4 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Viaarxiv icon

FSC: Few-point Shape Completion

Add code
Mar 13, 2024
Figure 1 for FSC: Few-point Shape Completion
Figure 2 for FSC: Few-point Shape Completion
Figure 3 for FSC: Few-point Shape Completion
Figure 4 for FSC: Few-point Shape Completion
Viaarxiv icon

Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation

Add code
Mar 03, 2024
Figure 1 for Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation
Figure 2 for Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation
Figure 3 for Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation
Figure 4 for Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation
Viaarxiv icon

AMuSE: Adaptive Multimodal Analysis for Speaker Emotion Recognition in Group Conversations

Add code
Jan 26, 2024
Viaarxiv icon

Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models

Add code
Dec 26, 2023
Figure 1 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 2 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 3 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 4 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Viaarxiv icon

Relit-NeuLF: Efficient Relighting and Novel View Synthesis via Neural 4D Light Field

Add code
Oct 23, 2023
Figure 1 for Relit-NeuLF: Efficient Relighting and Novel View Synthesis via Neural 4D Light Field
Figure 2 for Relit-NeuLF: Efficient Relighting and Novel View Synthesis via Neural 4D Light Field
Figure 3 for Relit-NeuLF: Efficient Relighting and Novel View Synthesis via Neural 4D Light Field
Figure 4 for Relit-NeuLF: Efficient Relighting and Novel View Synthesis via Neural 4D Light Field
Viaarxiv icon

NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions

Add code
Sep 27, 2023
Figure 1 for NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions
Figure 2 for NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions
Figure 3 for NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions
Figure 4 for NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions
Viaarxiv icon

SOAR: Scene-debiasing Open-set Action Recognition

Add code
Sep 03, 2023
Figure 1 for SOAR: Scene-debiasing Open-set Action Recognition
Figure 2 for SOAR: Scene-debiasing Open-set Action Recognition
Figure 3 for SOAR: Scene-debiasing Open-set Action Recognition
Figure 4 for SOAR: Scene-debiasing Open-set Action Recognition
Viaarxiv icon