Picture for Zhengqi Wen

Zhengqi Wen

$\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection

Add code
May 16, 2025
Figure 1 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Figure 2 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Figure 3 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Figure 4 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Viaarxiv icon

Deconfounded Reasoning for Multimodal Fake News Detection via Causal Intervention

Add code
Apr 12, 2025
Figure 1 for Deconfounded Reasoning for Multimodal Fake News Detection via Causal Intervention
Figure 2 for Deconfounded Reasoning for Multimodal Fake News Detection via Causal Intervention
Figure 3 for Deconfounded Reasoning for Multimodal Fake News Detection via Causal Intervention
Figure 4 for Deconfounded Reasoning for Multimodal Fake News Detection via Causal Intervention
Viaarxiv icon

Exploring Modality Disruption in Multimodal Fake News Detection

Add code
Apr 12, 2025
Figure 1 for Exploring Modality Disruption in Multimodal Fake News Detection
Figure 2 for Exploring Modality Disruption in Multimodal Fake News Detection
Figure 3 for Exploring Modality Disruption in Multimodal Fake News Detection
Figure 4 for Exploring Modality Disruption in Multimodal Fake News Detection
Viaarxiv icon

ImViD: Immersive Volumetric Videos for Enhanced VR Engagement

Add code
Mar 18, 2025
Viaarxiv icon

DReSS: Data-driven Regularized Structured Streamlining for Large Language Models

Add code
Jan 29, 2025
Figure 1 for DReSS: Data-driven Regularized Structured Streamlining for Large Language Models
Figure 2 for DReSS: Data-driven Regularized Structured Streamlining for Large Language Models
Figure 3 for DReSS: Data-driven Regularized Structured Streamlining for Large Language Models
Figure 4 for DReSS: Data-driven Regularized Structured Streamlining for Large Language Models
Viaarxiv icon

MTPareto: A MultiModal Targeted Pareto Framework for Fake News Detection

Add code
Jan 12, 2025
Viaarxiv icon

Neural Codec Source Tracing: Toward Comprehensive Attribution in Open-Set Condition

Add code
Jan 11, 2025
Figure 1 for Neural Codec Source Tracing: Toward Comprehensive Attribution in Open-Set Condition
Figure 2 for Neural Codec Source Tracing: Toward Comprehensive Attribution in Open-Set Condition
Figure 3 for Neural Codec Source Tracing: Toward Comprehensive Attribution in Open-Set Condition
Figure 4 for Neural Codec Source Tracing: Toward Comprehensive Attribution in Open-Set Condition
Viaarxiv icon

Mel-Refine: A Plug-and-Play Approach to Refine Mel-Spectrogram in Audio Generation

Add code
Dec 11, 2024
Viaarxiv icon

LetsTalk: Latent Diffusion Transformer for Talking Video Synthesis

Add code
Nov 24, 2024
Figure 1 for LetsTalk: Latent Diffusion Transformer for Talking Video Synthesis
Figure 2 for LetsTalk: Latent Diffusion Transformer for Talking Video Synthesis
Figure 3 for LetsTalk: Latent Diffusion Transformer for Talking Video Synthesis
Figure 4 for LetsTalk: Latent Diffusion Transformer for Talking Video Synthesis
Viaarxiv icon

Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0

Add code
Sep 18, 2024
Figure 1 for Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0
Figure 2 for Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0
Figure 3 for Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0
Figure 4 for Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0
Viaarxiv icon