Picture for Yan Zhang

Yan Zhang

Fellow, IEEE

Raformer: Redundancy-Aware Transformer for Video Wire Inpainting

Add code
Apr 24, 2024
Figure 1 for Raformer: Redundancy-Aware Transformer for Video Wire Inpainting
Figure 2 for Raformer: Redundancy-Aware Transformer for Video Wire Inpainting
Figure 3 for Raformer: Redundancy-Aware Transformer for Video Wire Inpainting
Figure 4 for Raformer: Redundancy-Aware Transformer for Video Wire Inpainting
Viaarxiv icon

Cantor: Inspiring Multimodal Chain-of-Thought of MLLM

Add code
Apr 24, 2024
Viaarxiv icon

Logic Dynamic Movement Primitives for Long-horizon Manipulation Tasks in Dynamic Environments

Add code
Apr 24, 2024
Figure 1 for Logic Dynamic Movement Primitives for Long-horizon Manipulation Tasks in Dynamic Environments
Figure 2 for Logic Dynamic Movement Primitives for Long-horizon Manipulation Tasks in Dynamic Environments
Figure 3 for Logic Dynamic Movement Primitives for Long-horizon Manipulation Tasks in Dynamic Environments
Figure 4 for Logic Dynamic Movement Primitives for Long-horizon Manipulation Tasks in Dynamic Environments
Viaarxiv icon

Multi-Modal Prompt Learning on Blind Image Quality Assessment

Add code
Apr 23, 2024
Figure 1 for Multi-Modal Prompt Learning on Blind Image Quality Assessment
Figure 2 for Multi-Modal Prompt Learning on Blind Image Quality Assessment
Figure 3 for Multi-Modal Prompt Learning on Blind Image Quality Assessment
Figure 4 for Multi-Modal Prompt Learning on Blind Image Quality Assessment
Viaarxiv icon

MoE-TinyMed: Mixture of Experts for Tiny Medical Large Vision-Language Models

Add code
Apr 16, 2024
Figure 1 for MoE-TinyMed: Mixture of Experts for Tiny Medical Large Vision-Language Models
Figure 2 for MoE-TinyMed: Mixture of Experts for Tiny Medical Large Vision-Language Models
Figure 3 for MoE-TinyMed: Mixture of Experts for Tiny Medical Large Vision-Language Models
Figure 4 for MoE-TinyMed: Mixture of Experts for Tiny Medical Large Vision-Language Models
Viaarxiv icon

Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models

Add code
Apr 06, 2024
Figure 1 for Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models
Figure 2 for Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models
Figure 3 for Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models
Figure 4 for Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models
Viaarxiv icon

Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal Perspective

Add code
Apr 03, 2024
Viaarxiv icon

RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method

Add code
Mar 28, 2024
Figure 1 for RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method
Figure 2 for RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method
Figure 3 for RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method
Figure 4 for RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method
Viaarxiv icon

Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives

Add code
Mar 26, 2024
Figure 1 for Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
Figure 2 for Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
Figure 3 for Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
Figure 4 for Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
Viaarxiv icon

Graph Neural Networks for Learning Equivariant Representations of Neural Networks

Add code
Mar 20, 2024
Figure 1 for Graph Neural Networks for Learning Equivariant Representations of Neural Networks
Figure 2 for Graph Neural Networks for Learning Equivariant Representations of Neural Networks
Figure 3 for Graph Neural Networks for Learning Equivariant Representations of Neural Networks
Figure 4 for Graph Neural Networks for Learning Equivariant Representations of Neural Networks
Viaarxiv icon