Picture for Yifan Li

Yifan Li

Weakly-Supervised 3D Scene Graph Generation via Visual-Linguistic Assisted Pseudo-labeling

Add code
Apr 03, 2024
Viaarxiv icon

Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models

Add code
Mar 14, 2024
Viaarxiv icon

The Wolf Within: Covert Injection of Malice into MLLM Societies via an MLLM Operative

Add code
Feb 20, 2024
Viaarxiv icon

Repositioning the Subject within Image

Add code
Jan 30, 2024
Figure 1 for Repositioning the Subject within Image
Figure 2 for Repositioning the Subject within Image
Figure 3 for Repositioning the Subject within Image
Figure 4 for Repositioning the Subject within Image
Viaarxiv icon

Temporal Adaptive RGBT Tracking with Modality Prompt

Add code
Jan 02, 2024
Viaarxiv icon

A Novel Tree Model-based DNN to Achieve a High-Resolution DOA Estimation via Massive MIMO receive array

Add code
Nov 30, 2023
Viaarxiv icon

CSGNN: Conquering Noisy Node labels via Dynamic Class-wise Selection

Add code
Nov 20, 2023
Viaarxiv icon

WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

Add code
Aug 31, 2023
Viaarxiv icon

MedAugment: Universal Automatic Data Augmentation Plug-in for Medical Image Analysis

Add code
Jun 30, 2023
Figure 1 for MedAugment: Universal Automatic Data Augmentation Plug-in for Medical Image Analysis
Figure 2 for MedAugment: Universal Automatic Data Augmentation Plug-in for Medical Image Analysis
Figure 3 for MedAugment: Universal Automatic Data Augmentation Plug-in for Medical Image Analysis
Figure 4 for MedAugment: Universal Automatic Data Augmentation Plug-in for Medical Image Analysis
Viaarxiv icon

Evaluating Object Hallucination in Large Vision-Language Models

Add code
May 23, 2023
Figure 1 for Evaluating Object Hallucination in Large Vision-Language Models
Figure 2 for Evaluating Object Hallucination in Large Vision-Language Models
Figure 3 for Evaluating Object Hallucination in Large Vision-Language Models
Figure 4 for Evaluating Object Hallucination in Large Vision-Language Models
Viaarxiv icon