Picture for Zhaoyu Chen

Zhaoyu Chen

Invert4TVG: A Temporal Video Grounding Framework with Inversion Tasks for Enhanced Action Understanding

Add code
Aug 10, 2025
Viaarxiv icon

LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops

Add code
Jun 17, 2025
Viaarxiv icon

dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis

Add code
Mar 13, 2025
Viaarxiv icon

MMARD: Improving the Min-Max Optimization Process in Adversarial Robustness Distillation

Add code
Mar 09, 2025
Viaarxiv icon

VideoPure: Diffusion-based Adversarial Purification for Video Recognition

Add code
Jan 25, 2025
Figure 1 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Figure 2 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Figure 3 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Figure 4 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Viaarxiv icon

Pruning for Sparse Diffusion Models based on Gradient Flow

Add code
Jan 16, 2025
Figure 1 for Pruning for Sparse Diffusion Models based on Gradient Flow
Figure 2 for Pruning for Sparse Diffusion Models based on Gradient Flow
Figure 3 for Pruning for Sparse Diffusion Models based on Gradient Flow
Figure 4 for Pruning for Sparse Diffusion Models based on Gradient Flow
Viaarxiv icon

MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration

Add code
Oct 17, 2024
Figure 1 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 2 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 3 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Figure 4 for MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Viaarxiv icon

X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation

Add code
Sep 28, 2024
Figure 1 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 2 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 3 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 4 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Viaarxiv icon

General Compression Framework for Efficient Transformer Object Tracking

Add code
Sep 26, 2024
Figure 1 for General Compression Framework for Efficient Transformer Object Tracking
Figure 2 for General Compression Framework for Efficient Transformer Object Tracking
Figure 3 for General Compression Framework for Efficient Transformer Object Tracking
Figure 4 for General Compression Framework for Efficient Transformer Object Tracking
Viaarxiv icon

KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition

Add code
Sep 14, 2024
Figure 1 for KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition
Figure 2 for KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition
Figure 3 for KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition
Figure 4 for KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition
Viaarxiv icon