Alert button
Picture for Xiaojian Ma

Xiaojian Ma

Alert button

Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting

Add code
Bookmark button
Alert button
Mar 22, 2024
Jun Guo, Xiaojian Ma, Yue Fan, Huaping Liu, Qing Li

Viaarxiv icon

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Add code
Bookmark button
Alert button
Mar 18, 2024
Yue Fan, Xiaojian Ma, Rujie Wu, Yuntao Du, Jiaqi Li, Zhi Gao, Qing Li

Figure 1 for VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding
Figure 2 for VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding
Figure 3 for VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding
Figure 4 for VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding
Viaarxiv icon

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

Add code
Bookmark button
Alert button
Mar 08, 2024
Zihao Wang, Anji Liu, Haowei Lin, Jiaqi Li, Xiaojian Ma, Yitao Liang

Figure 1 for RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Figure 2 for RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Figure 3 for RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Figure 4 for RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Viaarxiv icon

CLOVA: A Closed-Loop Visual Assistant with Tool Usage and Update

Add code
Bookmark button
Alert button
Dec 18, 2023
Zhi Gao, Yuntao Du, Xintong Zhang, Xiaojian Ma, Wenjuan Han, Song-Chun Zhu, Qing Li

Viaarxiv icon

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Add code
Bookmark button
Alert button
Nov 30, 2023
Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang

Viaarxiv icon

An Embodied Generalist Agent in 3D World

Add code
Bookmark button
Alert button
Nov 18, 2023
Jiangyong Huang, Silong Yong, Xiaojian Ma, Xiongkun Linghu, Puhao Li, Yan Wang, Qing Li, Song-Chun Zhu, Baoxiong Jia, Siyuan Huang

Figure 1 for An Embodied Generalist Agent in 3D World
Figure 2 for An Embodied Generalist Agent in 3D World
Figure 3 for An Embodied Generalist Agent in 3D World
Figure 4 for An Embodied Generalist Agent in 3D World
Viaarxiv icon

Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World

Add code
Bookmark button
Alert button
Oct 16, 2023
Rujie Wu, Xiaojian Ma, Qing Li, Wei Wang, Zhenliang Zhang, Song-Chun Zhu, Yizhou Wang

Figure 1 for Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
Figure 2 for Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
Figure 3 for Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
Figure 4 for Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
Viaarxiv icon

GROOT: Learning to Follow Instructions by Watching Gameplay Videos

Add code
Bookmark button
Alert button
Oct 12, 2023
Shaofei Cai, Bowei Zhang, Zihao Wang, Xiaojian Ma, Anji Liu, Yitao Liang

Viaarxiv icon

Learning Energy-Based Prior Model with Diffusion-Amortized MCMC

Add code
Bookmark button
Alert button
Oct 05, 2023
Peiyu Yu, Yaxuan Zhu, Sirui Xie, Xiaojian Ma, Ruiqi Gao, Song-Chun Zhu, Ying Nian Wu

Figure 1 for Learning Energy-Based Prior Model with Diffusion-Amortized MCMC
Figure 2 for Learning Energy-Based Prior Model with Diffusion-Amortized MCMC
Figure 3 for Learning Energy-Based Prior Model with Diffusion-Amortized MCMC
Figure 4 for Learning Energy-Based Prior Model with Diffusion-Amortized MCMC
Viaarxiv icon