Picture for Qiang Zhou

Qiang Zhou

Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval

Add code
Sep 30, 2024
Figure 1 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Figure 2 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Figure 3 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Figure 4 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Viaarxiv icon

I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing

Add code
Aug 26, 2024
Figure 1 for I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
Figure 2 for I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
Figure 3 for I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
Figure 4 for I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
Viaarxiv icon

INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model

Add code
Jul 23, 2024
Viaarxiv icon

Gaussian Process Model with Tensorial Inputs and Its Application to the Design of 3D Printed Antennas

Add code
Jul 19, 2024
Figure 1 for Gaussian Process Model with Tensorial Inputs and Its Application to the Design of 3D Printed Antennas
Figure 2 for Gaussian Process Model with Tensorial Inputs and Its Application to the Design of 3D Printed Antennas
Figure 3 for Gaussian Process Model with Tensorial Inputs and Its Application to the Design of 3D Printed Antennas
Figure 4 for Gaussian Process Model with Tensorial Inputs and Its Application to the Design of 3D Printed Antennas
Viaarxiv icon

Training LLMs to Better Self-Debug and Explain Code

Add code
May 28, 2024
Figure 1 for Training LLMs to Better Self-Debug and Explain Code
Figure 2 for Training LLMs to Better Self-Debug and Explain Code
Figure 3 for Training LLMs to Better Self-Debug and Explain Code
Figure 4 for Training LLMs to Better Self-Debug and Explain Code
Viaarxiv icon

Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach

Add code
Jan 28, 2024
Viaarxiv icon

DMT: Comprehensive Distillation with Multiple Self-supervised Teachers

Add code
Dec 19, 2023
Viaarxiv icon

Language-guided Few-shot Semantic Segmentation

Add code
Nov 23, 2023
Viaarxiv icon

InfMLLM: A Unified Framework for Visual-Language Tasks

Add code
Nov 12, 2023
Figure 1 for InfMLLM: A Unified Framework for Visual-Language Tasks
Figure 2 for InfMLLM: A Unified Framework for Visual-Language Tasks
Figure 3 for InfMLLM: A Unified Framework for Visual-Language Tasks
Figure 4 for InfMLLM: A Unified Framework for Visual-Language Tasks
Viaarxiv icon

PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection

Add code
Oct 11, 2023
Figure 1 for PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection
Figure 2 for PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection
Figure 3 for PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection
Figure 4 for PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection
Viaarxiv icon