Picture for Nenghai Yu

Nenghai Yu

Detecting Voice Cloning Attacks via Timbre Watermarking

Add code
Dec 06, 2023
Viaarxiv icon

Towards More Unified In-context Visual Understanding

Add code
Dec 05, 2023
Figure 1 for Towards More Unified In-context Visual Understanding
Figure 2 for Towards More Unified In-context Visual Understanding
Figure 3 for Towards More Unified In-context Visual Understanding
Figure 4 for Towards More Unified In-context Visual Understanding
Viaarxiv icon

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Add code
Nov 29, 2023
Viaarxiv icon

CMFDFormer: Transformer-based Copy-Move Forgery Detection with Continual Learning

Add code
Nov 22, 2023
Viaarxiv icon

Segue: Side-information Guided Generative Unlearnable Examples for Facial Privacy Protection in Real World

Add code
Oct 24, 2023
Viaarxiv icon

HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending

Add code
Oct 16, 2023
Figure 1 for HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending
Figure 2 for HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending
Figure 3 for HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending
Figure 4 for HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending
Viaarxiv icon

Exploiting Modality-Specific Features For Multi-Modal Manipulation Detection And Grounding

Add code
Sep 22, 2023
Figure 1 for Exploiting Modality-Specific Features For Multi-Modal Manipulation Detection And Grounding
Figure 2 for Exploiting Modality-Specific Features For Multi-Modal Manipulation Detection And Grounding
Figure 3 for Exploiting Modality-Specific Features For Multi-Modal Manipulation Detection And Grounding
Viaarxiv icon

Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting

Add code
Aug 22, 2023
Viaarxiv icon

MotionGPT: Finetuned LLMs are General-Purpose Motion Generators

Add code
Jun 19, 2023
Figure 1 for MotionGPT: Finetuned LLMs are General-Purpose Motion Generators
Figure 2 for MotionGPT: Finetuned LLMs are General-Purpose Motion Generators
Figure 3 for MotionGPT: Finetuned LLMs are General-Purpose Motion Generators
Figure 4 for MotionGPT: Finetuned LLMs are General-Purpose Motion Generators
Viaarxiv icon

EVOPOSE: A Recursive Transformer For 3D Human Pose Estimation With Kinematic Structure Priors

Add code
Jun 16, 2023
Figure 1 for EVOPOSE: A Recursive Transformer For 3D Human Pose Estimation With Kinematic Structure Priors
Figure 2 for EVOPOSE: A Recursive Transformer For 3D Human Pose Estimation With Kinematic Structure Priors
Figure 3 for EVOPOSE: A Recursive Transformer For 3D Human Pose Estimation With Kinematic Structure Priors
Figure 4 for EVOPOSE: A Recursive Transformer For 3D Human Pose Estimation With Kinematic Structure Priors
Viaarxiv icon