Picture for Wei Lu

Wei Lu

LMI

VoiceCloak: A Multi-Dimensional Defense Framework against Unauthorized Diffusion-based Voice Cloning

Add code
May 18, 2025
Viaarxiv icon

Towards Open-world Generalized Deepfake Detection: General Feature Extraction via Unsupervised Domain Adaptation

Add code
May 18, 2025
Viaarxiv icon

M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object Detection

Add code
May 16, 2025
Viaarxiv icon

Weakly-supervised Audio Temporal Forgery Localization via Progressive Audio-language Co-learning Network

Add code
May 03, 2025
Viaarxiv icon

Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection

Add code
Apr 30, 2025
Viaarxiv icon

Vidi: Large Multimodal Models for Video Understanding and Editing

Add code
Apr 22, 2025
Figure 1 for Vidi: Large Multimodal Models for Video Understanding and Editing
Figure 2 for Vidi: Large Multimodal Models for Video Understanding and Editing
Figure 3 for Vidi: Large Multimodal Models for Video Understanding and Editing
Figure 4 for Vidi: Large Multimodal Models for Video Understanding and Editing
Viaarxiv icon

FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics

Add code
Mar 31, 2025
Figure 1 for FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics
Figure 2 for FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics
Figure 3 for FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics
Figure 4 for FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics
Viaarxiv icon

LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection

Add code
Mar 18, 2025
Figure 1 for LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection
Figure 2 for LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection
Figure 3 for LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection
Figure 4 for LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection
Viaarxiv icon

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Add code
Mar 10, 2025
Viaarxiv icon

ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models

Add code
Feb 27, 2025
Figure 1 for ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models
Figure 2 for ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models
Figure 3 for ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models
Figure 4 for ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models
Viaarxiv icon