Picture for Heng Tao Shen

Heng Tao Shen

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Add code
Sep 09, 2024
Figure 1 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 2 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 3 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 4 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Viaarxiv icon

VQ-Flow: Taming Normalizing Flows for Multi-Class Anomaly Detection via Hierarchical Vector Quantization

Add code
Sep 02, 2024
Viaarxiv icon

DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion

Add code
Aug 13, 2024
Figure 1 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Figure 2 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Figure 3 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Figure 4 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Viaarxiv icon

GalleryGPT: Analyzing Paintings with Large Multimodal Models

Add code
Aug 01, 2024
Figure 1 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Figure 2 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Figure 3 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Figure 4 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Viaarxiv icon

Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning

Add code
Aug 01, 2024
Figure 1 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 2 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 3 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 4 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Viaarxiv icon

Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization

Add code
May 24, 2024
Figure 1 for Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization
Figure 2 for Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization
Figure 3 for Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization
Figure 4 for Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization
Viaarxiv icon

Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning

Add code
Mar 15, 2024
Viaarxiv icon

ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement

Add code
Dec 20, 2023
Viaarxiv icon

ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval

Add code
Dec 19, 2023
Figure 1 for ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval
Figure 2 for ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval
Figure 3 for ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval
Figure 4 for ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval
Viaarxiv icon

Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control

Add code
Dec 06, 2023
Viaarxiv icon