Picture for Pengfei Zhang

Pengfei Zhang

Instruction Anchors: Dissecting the Causal Dynamics of Modality Arbitration

Add code
Feb 03, 2026
Viaarxiv icon

MedSpeak: A Knowledge Graph-Aided ASR Error Correction Framework for Spoken Medical QA

Add code
Feb 01, 2026
Viaarxiv icon

PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation

Add code
Dec 30, 2025
Viaarxiv icon

Image Generation Method Based on Heat Diffusion Models

Add code
Apr 28, 2025
Figure 1 for Image Generation Method Based on Heat Diffusion Models
Figure 2 for Image Generation Method Based on Heat Diffusion Models
Figure 3 for Image Generation Method Based on Heat Diffusion Models
Figure 4 for Image Generation Method Based on Heat Diffusion Models
Viaarxiv icon

Multimodal 3D Genome Pre-training

Add code
Apr 12, 2025
Figure 1 for Multimodal 3D Genome Pre-training
Figure 2 for Multimodal 3D Genome Pre-training
Figure 3 for Multimodal 3D Genome Pre-training
Figure 4 for Multimodal 3D Genome Pre-training
Viaarxiv icon

DEMENTIA-PLAN: An Agent-Based Framework for Multi-Knowledge Graph Retrieval-Augmented Generation in Dementia Care

Add code
Mar 26, 2025
Figure 1 for DEMENTIA-PLAN: An Agent-Based Framework for Multi-Knowledge Graph Retrieval-Augmented Generation in Dementia Care
Figure 2 for DEMENTIA-PLAN: An Agent-Based Framework for Multi-Knowledge Graph Retrieval-Augmented Generation in Dementia Care
Figure 3 for DEMENTIA-PLAN: An Agent-Based Framework for Multi-Knowledge Graph Retrieval-Augmented Generation in Dementia Care
Figure 4 for DEMENTIA-PLAN: An Agent-Based Framework for Multi-Knowledge Graph Retrieval-Augmented Generation in Dementia Care
Viaarxiv icon

Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation

Add code
Feb 11, 2025
Viaarxiv icon

Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey

Add code
Dec 09, 2024
Figure 1 for Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey
Figure 2 for Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey
Figure 3 for Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey
Figure 4 for Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey
Viaarxiv icon

Behavior Backdoor for Deep Learning Models

Add code
Dec 02, 2024
Figure 1 for Behavior Backdoor for Deep Learning Models
Figure 2 for Behavior Backdoor for Deep Learning Models
Figure 3 for Behavior Backdoor for Deep Learning Models
Figure 4 for Behavior Backdoor for Deep Learning Models
Viaarxiv icon

Streamlined Federated Unlearning: Unite as One to Be Highly Efficient

Add code
Nov 28, 2024
Figure 1 for Streamlined Federated Unlearning: Unite as One to Be Highly Efficient
Figure 2 for Streamlined Federated Unlearning: Unite as One to Be Highly Efficient
Figure 3 for Streamlined Federated Unlearning: Unite as One to Be Highly Efficient
Figure 4 for Streamlined Federated Unlearning: Unite as One to Be Highly Efficient
Viaarxiv icon