Picture for Daqing Liu

Daqing Liu

Decompose Semantic Shifts for Composed Image Retrieval

Add code
Sep 18, 2023
Figure 1 for Decompose Semantic Shifts for Composed Image Retrieval
Figure 2 for Decompose Semantic Shifts for Composed Image Retrieval
Figure 3 for Decompose Semantic Shifts for Composed Image Retrieval
Figure 4 for Decompose Semantic Shifts for Composed Image Retrieval
Viaarxiv icon

Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation

Add code
Jun 01, 2023
Figure 1 for Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation
Figure 2 for Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation
Figure 3 for Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation
Figure 4 for Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation
Viaarxiv icon

MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis

Add code
May 10, 2023
Figure 1 for MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis
Figure 2 for MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis
Figure 3 for MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis
Figure 4 for MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis
Viaarxiv icon

ESceme: Vision-and-Language Navigation with Episodic Scene Memory

Add code
Mar 07, 2023
Figure 1 for ESceme: Vision-and-Language Navigation with Episodic Scene Memory
Figure 2 for ESceme: Vision-and-Language Navigation with Episodic Scene Memory
Figure 3 for ESceme: Vision-and-Language Navigation with Episodic Scene Memory
Figure 4 for ESceme: Vision-and-Language Navigation with Episodic Scene Memory
Viaarxiv icon

OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System

Add code
Mar 01, 2023
Figure 1 for OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Figure 2 for OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Figure 3 for OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Figure 4 for OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Viaarxiv icon

Eliminating Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion

Add code
Feb 07, 2023
Figure 1 for Eliminating Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion
Figure 2 for Eliminating Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion
Figure 3 for Eliminating Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion
Figure 4 for Eliminating Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion
Viaarxiv icon

Cross-Modal Contrastive Learning for Robust Reasoning in VQA

Add code
Nov 21, 2022
Figure 1 for Cross-Modal Contrastive Learning for Robust Reasoning in VQA
Figure 2 for Cross-Modal Contrastive Learning for Robust Reasoning in VQA
Figure 3 for Cross-Modal Contrastive Learning for Robust Reasoning in VQA
Figure 4 for Cross-Modal Contrastive Learning for Robust Reasoning in VQA
Viaarxiv icon

SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders

Add code
Jun 25, 2022
Figure 1 for SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Figure 2 for SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Figure 3 for SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Figure 4 for SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Viaarxiv icon

TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer

Add code
Jun 14, 2022
Figure 1 for TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Figure 2 for TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Figure 3 for TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Figure 4 for TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Viaarxiv icon

Modeling Image Composition for Complex Scene Generation

Add code
Jun 02, 2022
Figure 1 for Modeling Image Composition for Complex Scene Generation
Figure 2 for Modeling Image Composition for Complex Scene Generation
Figure 3 for Modeling Image Composition for Complex Scene Generation
Figure 4 for Modeling Image Composition for Complex Scene Generation
Viaarxiv icon