Picture for Yiyi Zhou

Yiyi Zhou

Image Captioning via Dynamic Path Customization

Add code
Jun 01, 2024
Viaarxiv icon

Deep Instruction Tuning for Segment Anything Model

Add code
Mar 31, 2024
Viaarxiv icon

Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models

Add code
Mar 22, 2024
Figure 1 for Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models
Figure 2 for Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models
Figure 3 for Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models
Figure 4 for Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models
Viaarxiv icon

Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization

Add code
Mar 11, 2024
Figure 1 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 2 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 3 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 4 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Viaarxiv icon

Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models

Add code
Mar 05, 2024
Figure 1 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Figure 2 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Figure 3 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Figure 4 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Viaarxiv icon

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks

Add code
Jan 23, 2024
Figure 1 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Figure 2 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Figure 3 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Figure 4 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Viaarxiv icon

Towards Omni-supervised Referring Expression Segmentation

Add code
Nov 01, 2023
Viaarxiv icon

NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning

Add code
Oct 23, 2023
Figure 1 for NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning
Figure 2 for NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning
Figure 3 for NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning
Figure 4 for NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning
Viaarxiv icon

Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models

Add code
Sep 06, 2023
Figure 1 for Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
Figure 2 for Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
Figure 3 for Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
Figure 4 for Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
Viaarxiv icon

M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce

Add code
Aug 22, 2023
Figure 1 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Figure 2 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Figure 3 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Figure 4 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Viaarxiv icon