Alert button
Picture for Shentong Mo

Shentong Mo

Alert button

A Large-scale Medical Visual Task Adaptation Benchmark

Add code
Bookmark button
Alert button
Apr 19, 2024
Shentong Mo, Xufang Luo, Yansen Wang, Dongsheng Li

Viaarxiv icon

DailyMAE: Towards Pretraining Masked Autoencoders in One Day

Add code
Bookmark button
Alert button
Mar 31, 2024
Jiantao Wu, Shentong Mo, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais

Viaarxiv icon

Text-to-Audio Generation Synchronized with Videos

Add code
Bookmark button
Alert button
Mar 08, 2024
Shentong Mo, Jing Shi, Yapeng Tian

Figure 1 for Text-to-Audio Generation Synchronized with Videos
Figure 2 for Text-to-Audio Generation Synchronized with Videos
Figure 3 for Text-to-Audio Generation Synchronized with Videos
Figure 4 for Text-to-Audio Generation Synchronized with Videos
Viaarxiv icon

Audio-Synchronized Visual Animation

Add code
Bookmark button
Alert button
Mar 08, 2024
Lin Zhang, Shentong Mo, Yijing Zhang, Pedro Morgado

Figure 1 for Audio-Synchronized Visual Animation
Figure 2 for Audio-Synchronized Visual Animation
Figure 3 for Audio-Synchronized Visual Animation
Figure 4 for Audio-Synchronized Visual Animation
Viaarxiv icon

LSPT: Long-term Spatial Prompt Tuning for Visual Representation Learning

Add code
Bookmark button
Alert button
Feb 27, 2024
Shentong Mo, Yansen Wang, Xufang Luo, Dongsheng Li

Viaarxiv icon

We Choose to Go to Space: Agent-driven Human and Multi-Robot Collaboration in Microgravity

Add code
Bookmark button
Alert button
Feb 22, 2024
Miao Xin, Zhongrui You, Zihan Zhang, Taoran Jiang, Tingjia Xu, Haotian Liang, Guojing Ge, Yuchen Ji, Shentong Mo, Jian Cheng

Viaarxiv icon

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation

Add code
Bookmark button
Alert button
Dec 12, 2023
Shentong Mo, Enze Xie, Yue Wu, Junsong Chen, Matthias Nießner, Zhenguo Li

Viaarxiv icon

Beyond Accuracy: Statistical Measures and Benchmark for Evaluation of Representation from Self-Supervised Learning

Add code
Bookmark button
Alert button
Dec 02, 2023
Jiantao Wu, Shentong Mo, Sara Atito, Josef Kittler, Zhenhua Feng, Muhammad Awais

Viaarxiv icon

Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling

Add code
Bookmark button
Alert button
Dec 02, 2023
Shentong Mo, Pedro Morgado

Viaarxiv icon

Weakly-Supervised Audio-Visual Segmentation

Add code
Bookmark button
Alert button
Nov 25, 2023
Shentong Mo, Bhiksha Raj

Viaarxiv icon