Picture for Boyuan Jiang

Boyuan Jiang

Oracle Bone Inscriptions Multi-modal Dataset

Add code
Jul 04, 2024
Viaarxiv icon

NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models

Add code
May 31, 2024
Viaarxiv icon

M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition

Add code
Jan 22, 2024
Figure 1 for M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition
Figure 2 for M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition
Figure 3 for M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition
Figure 4 for M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition
Viaarxiv icon

PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization

Add code
Dec 11, 2023
Figure 1 for PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
Figure 2 for PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
Figure 3 for PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
Figure 4 for PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
Viaarxiv icon

Dynamic Frame Interpolation in Wavelet Domain

Add code
Sep 21, 2023
Figure 1 for Dynamic Frame Interpolation in Wavelet Domain
Figure 2 for Dynamic Frame Interpolation in Wavelet Domain
Figure 3 for Dynamic Frame Interpolation in Wavelet Domain
Figure 4 for Dynamic Frame Interpolation in Wavelet Domain
Viaarxiv icon

Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation

Add code
Sep 09, 2023
Figure 1 for Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
Figure 2 for Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
Figure 3 for Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
Figure 4 for Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
Viaarxiv icon

Pose-aware Attention Network for Flexible Motion Retargeting by Body Part

Add code
Jun 13, 2023
Viaarxiv icon

IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation

Add code
May 29, 2022
Figure 1 for IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
Figure 2 for IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
Figure 3 for IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
Figure 4 for IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
Viaarxiv icon

Learning Comprehensive Motion Representation for Action Recognition

Add code
Mar 23, 2021
Figure 1 for Learning Comprehensive Motion Representation for Action Recognition
Figure 2 for Learning Comprehensive Motion Representation for Action Recognition
Figure 3 for Learning Comprehensive Motion Representation for Action Recognition
Figure 4 for Learning Comprehensive Motion Representation for Action Recognition
Viaarxiv icon

Multi-Level Adaptive Region of Interest and Graph Learning for Facial Action Unit Recognition

Add code
Feb 24, 2021
Figure 1 for Multi-Level Adaptive Region of Interest and Graph Learning for Facial Action Unit Recognition
Figure 2 for Multi-Level Adaptive Region of Interest and Graph Learning for Facial Action Unit Recognition
Figure 3 for Multi-Level Adaptive Region of Interest and Graph Learning for Facial Action Unit Recognition
Figure 4 for Multi-Level Adaptive Region of Interest and Graph Learning for Facial Action Unit Recognition
Viaarxiv icon