Alert button
Picture for Mengmeng Xu

Mengmeng Xu

Alert button

Move Anything with Layered Scene Diffusion

Add code
Bookmark button
Alert button
Apr 10, 2024
Jiawei Ren, Mengmeng Xu, Jui-Chieh Wu, Ziwei Liu, Tao Xiang, Antoine Toisoul

Viaarxiv icon

Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks

Add code
Bookmark button
Alert button
Jan 05, 2024
Christian Simon, Sen He, Juan-Manuel Perez-Rua, Mengmeng Xu, Amine Benhalloum, Tao Xiang

Viaarxiv icon

GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation

Add code
Bookmark button
Alert button
Dec 07, 2023
Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Perez-Rua

Figure 1 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 2 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 3 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 4 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Viaarxiv icon

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing

Add code
Bookmark button
Alert button
Oct 09, 2023
Yuren Cong, Mengmeng Xu, Christian Simon, Shoufa Chen, Jiawei Ren, Yanping Xie, Juan-Manuel Perez-Rua, Bodo Rosenhahn, Tao Xiang, Sen He

Figure 1 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 2 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 3 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 4 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Viaarxiv icon

Mindstorms in Natural Language-Based Societies of Mind

Add code
Bookmark button
Alert button
May 26, 2023
Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, Jinjie Mai, Piotr Piękos, Aditya Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanić, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-Ping Fan, Bernard Ghanem, Jürgen Schmidhuber

Figure 1 for Mindstorms in Natural Language-Based Societies of Mind
Figure 2 for Mindstorms in Natural Language-Based Societies of Mind
Figure 3 for Mindstorms in Natural Language-Based Societies of Mind
Figure 4 for Mindstorms in Natural Language-Based Societies of Mind
Viaarxiv icon

Boundary-Denoising for Video Activity Localization

Add code
Bookmark button
Alert button
Apr 06, 2023
Mengmeng Xu, Mattia Soldan, Jialin Gao, Shuming Liu, Juan-Manuel Pérez-Rúa, Bernard Ghanem

Figure 1 for Boundary-Denoising for Video Activity Localization
Figure 2 for Boundary-Denoising for Video Activity Localization
Figure 3 for Boundary-Denoising for Video Activity Localization
Figure 4 for Boundary-Denoising for Video Activity Localization
Viaarxiv icon

Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation

Add code
Bookmark button
Alert button
Nov 27, 2022
Sauradip Nag, Mengmeng Xu, Xiatian Zhu, Juan-Manuel Perez-Rua, Bernard Ghanem, Yi-Zhe Song, Tao Xiang

Figure 1 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 2 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 3 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 4 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Viaarxiv icon

Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization

Add code
Bookmark button
Alert button
Nov 18, 2022
Mengmeng Xu, Yanghao Li, Cheng-Yang Fu, Bernard Ghanem, Tao Xiang, Juan-Manuel Perez-Rua

Figure 1 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Figure 2 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Figure 3 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Figure 4 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Viaarxiv icon

Negative Frames Matter in Egocentric Visual Query 2D Localization

Add code
Bookmark button
Alert button
Aug 03, 2022
Mengmeng Xu, Cheng-Yang Fu, Yanghao Li, Bernard Ghanem, Juan-Manuel Perez-Rua, Tao Xiang

Figure 1 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Figure 2 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Figure 3 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Figure 4 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Viaarxiv icon

ETAD: A Unified Framework for Efficient Temporal Action Detection

Add code
Bookmark button
Alert button
May 14, 2022
Shuming Liu, Mengmeng Xu, Chen Zhao, Xu Zhao, Bernard Ghanem

Figure 1 for ETAD: A Unified Framework for Efficient Temporal Action Detection
Figure 2 for ETAD: A Unified Framework for Efficient Temporal Action Detection
Figure 3 for ETAD: A Unified Framework for Efficient Temporal Action Detection
Figure 4 for ETAD: A Unified Framework for Efficient Temporal Action Detection
Viaarxiv icon