Picture for Cuiling Lan

Cuiling Lan

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Add code
Jul 18, 2024
Viaarxiv icon

Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis

Add code
May 13, 2024
Figure 1 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 2 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 3 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 4 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Viaarxiv icon

Slot-VLM: SlowFast Slots for Video-Language Modeling

Add code
Feb 20, 2024
Figure 1 for Slot-VLM: SlowFast Slots for Video-Language Modeling
Figure 2 for Slot-VLM: SlowFast Slots for Video-Language Modeling
Figure 3 for Slot-VLM: SlowFast Slots for Video-Language Modeling
Figure 4 for Slot-VLM: SlowFast Slots for Video-Language Modeling
Viaarxiv icon

Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement

Add code
Feb 15, 2024
Figure 1 for Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
Figure 2 for Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
Figure 3 for Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
Figure 4 for Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
Viaarxiv icon

Retrieval-based Video Language Model for Efficient Long Video Question Answering

Add code
Dec 08, 2023
Figure 1 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 2 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 3 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 4 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Viaarxiv icon

Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer

Add code
Oct 04, 2023
Figure 1 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Figure 2 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Figure 3 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Figure 4 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Viaarxiv icon

Shatter and Gather: Learning Referring Image Segmentation with Text Supervision

Add code
Aug 29, 2023
Viaarxiv icon

Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey

Add code
Aug 18, 2023
Figure 1 for Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey
Figure 2 for Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey
Figure 3 for Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey
Figure 4 for Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey
Viaarxiv icon

Adaptive Frequency Filters As Efficient Global Token Mixers

Add code
Jul 26, 2023
Figure 1 for Adaptive Frequency Filters As Efficient Global Token Mixers
Figure 2 for Adaptive Frequency Filters As Efficient Global Token Mixers
Figure 3 for Adaptive Frequency Filters As Efficient Global Token Mixers
Figure 4 for Adaptive Frequency Filters As Efficient Global Token Mixers
Viaarxiv icon

Vector-based Representation is the Key: A Study on Disentanglement and Compositional Generalization

Add code
May 29, 2023
Figure 1 for Vector-based Representation is the Key: A Study on Disentanglement and Compositional Generalization
Figure 2 for Vector-based Representation is the Key: A Study on Disentanglement and Compositional Generalization
Figure 3 for Vector-based Representation is the Key: A Study on Disentanglement and Compositional Generalization
Figure 4 for Vector-based Representation is the Key: A Study on Disentanglement and Compositional Generalization
Viaarxiv icon