Picture for Wentao Zhu

Wentao Zhu

Space-time Reinforcement Network for Video Object Segmentation

May 07, 2024
Viaarxiv icon

Efficient Action Counting with Dynamic Queries

Add code
Mar 05, 2024
Figure 1 for Efficient Action Counting with Dynamic Queries
Figure 2 for Efficient Action Counting with Dynamic Queries
Figure 3 for Efficient Action Counting with Dynamic Queries
Figure 4 for Efficient Action Counting with Dynamic Queries
Viaarxiv icon

OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine

Add code
Mar 04, 2024
Figure 1 for OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine
Figure 2 for OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine
Figure 3 for OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine
Figure 4 for OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine
Viaarxiv icon

Language Models Represent Beliefs of Self and Others

Add code
Feb 29, 2024
Viaarxiv icon

Real-time Holistic Robot Pose Estimation with Unknown States

Add code
Feb 08, 2024
Viaarxiv icon

Deformable Audio Transformer for Audio Event Detection

Jan 08, 2024
Viaarxiv icon

Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification

Jan 08, 2024
Viaarxiv icon

TPC-ViT: Token Propagation Controller for Efficient Vision Transformer

Jan 08, 2024
Viaarxiv icon

Efficient Selective Audio Masked Multimodal Bottleneck Transformer for Audio-Video Classification

Jan 08, 2024
Viaarxiv icon

Social Motion Prediction with Cognitive Hierarchies

Add code
Nov 08, 2023
Viaarxiv icon