Picture for Minyi Zhao

Minyi Zhao

Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning

Add code
Jan 16, 2026
Viaarxiv icon

Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization

Add code
Sep 22, 2024
Figure 1 for Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization
Figure 2 for Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization
Figure 3 for Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization
Figure 4 for Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization
Viaarxiv icon

One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance

Add code
Sep 22, 2024
Viaarxiv icon

CIEM: Contrastive Instruction Evaluation Method for Better Instruction Tuning

Add code
Sep 05, 2023
Figure 1 for CIEM: Contrastive Instruction Evaluation Method for Better Instruction Tuning
Figure 2 for CIEM: Contrastive Instruction Evaluation Method for Better Instruction Tuning
Figure 3 for CIEM: Contrastive Instruction Evaluation Method for Better Instruction Tuning
Figure 4 for CIEM: Contrastive Instruction Evaluation Method for Better Instruction Tuning
Viaarxiv icon

Privacy-Preserving Face Recognition Using Random Frequency Components

Add code
Aug 21, 2023
Figure 1 for Privacy-Preserving Face Recognition Using Random Frequency Components
Figure 2 for Privacy-Preserving Face Recognition Using Random Frequency Components
Figure 3 for Privacy-Preserving Face Recognition Using Random Frequency Components
Figure 4 for Privacy-Preserving Face Recognition Using Random Frequency Components
Viaarxiv icon

HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution

Add code
Jul 31, 2023
Figure 1 for HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Figure 2 for HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Figure 3 for HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Figure 4 for HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Viaarxiv icon

Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer

Add code
May 06, 2023
Figure 1 for Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer
Figure 2 for Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer
Figure 3 for Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer
Figure 4 for Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer
Viaarxiv icon

C3-STISR: Scene Text Image Super-resolution with Triple Clues

Add code
Apr 29, 2022
Figure 1 for C3-STISR: Scene Text Image Super-resolution with Triple Clues
Figure 2 for C3-STISR: Scene Text Image Super-resolution with Triple Clues
Figure 3 for C3-STISR: Scene Text Image Super-resolution with Triple Clues
Figure 4 for C3-STISR: Scene Text Image Super-resolution with Triple Clues
Viaarxiv icon

EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification

Add code
Apr 24, 2022
Figure 1 for EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification
Figure 2 for EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification
Figure 3 for EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification
Figure 4 for EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification
Viaarxiv icon

Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction

Add code
Aug 12, 2021
Figure 1 for Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction
Figure 2 for Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction
Figure 3 for Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction
Figure 4 for Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction
Viaarxiv icon