Picture for Chen Chen

Chen Chen

Department of Radiology, Zhejiang Cancer Hospital, Hangzhou, 310022, China, Hangzhou Institute of Medicine

CIB-SE-YOLOv8: Optimized YOLOv8 for Real-Time Safety Equipment Detection on Construction Sites

Add code
Oct 28, 2024
Figure 1 for CIB-SE-YOLOv8: Optimized YOLOv8 for Real-Time Safety Equipment Detection on Construction Sites
Figure 2 for CIB-SE-YOLOv8: Optimized YOLOv8 for Real-Time Safety Equipment Detection on Construction Sites
Figure 3 for CIB-SE-YOLOv8: Optimized YOLOv8 for Real-Time Safety Equipment Detection on Construction Sites
Figure 4 for CIB-SE-YOLOv8: Optimized YOLOv8 for Real-Time Safety Equipment Detection on Construction Sites
Viaarxiv icon

A Survey of Deep Graph Learning under Distribution Shifts: from Graph Out-of-Distribution Generalization to Adaptation

Add code
Oct 25, 2024
Viaarxiv icon

Learning Load Balancing with GNN in MPTCP-Enabled Heterogeneous Networks

Add code
Oct 22, 2024
Figure 1 for Learning Load Balancing with GNN in MPTCP-Enabled Heterogeneous Networks
Figure 2 for Learning Load Balancing with GNN in MPTCP-Enabled Heterogeneous Networks
Figure 3 for Learning Load Balancing with GNN in MPTCP-Enabled Heterogeneous Networks
Figure 4 for Learning Load Balancing with GNN in MPTCP-Enabled Heterogeneous Networks
Viaarxiv icon

AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition

Add code
Oct 21, 2024
Figure 1 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 2 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 3 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Figure 4 for AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Viaarxiv icon

Self-Comparison for Dataset-Level Membership Inference in Large (Vision-)Language Models

Add code
Oct 16, 2024
Figure 1 for Self-Comparison for Dataset-Level Membership Inference in Large (Vision-)Language Models
Figure 2 for Self-Comparison for Dataset-Level Membership Inference in Large (Vision-)Language Models
Figure 3 for Self-Comparison for Dataset-Level Membership Inference in Large (Vision-)Language Models
Figure 4 for Self-Comparison for Dataset-Level Membership Inference in Large (Vision-)Language Models
Viaarxiv icon

Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models

Add code
Oct 14, 2024
Figure 1 for Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models
Figure 2 for Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models
Figure 3 for Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models
Figure 4 for Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models
Viaarxiv icon

ControlMM: Controllable Masked Motion Generation

Add code
Oct 14, 2024
Viaarxiv icon

3D Vision-Language Gaussian Splatting

Add code
Oct 10, 2024
Figure 1 for 3D Vision-Language Gaussian Splatting
Figure 2 for 3D Vision-Language Gaussian Splatting
Figure 3 for 3D Vision-Language Gaussian Splatting
Figure 4 for 3D Vision-Language Gaussian Splatting
Viaarxiv icon

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Add code
Oct 03, 2024
Viaarxiv icon

Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective

Add code
Sep 29, 2024
Figure 1 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 2 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 3 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Figure 4 for Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Viaarxiv icon