Alert button
Picture for Wenxuan Xie

Wenxuan Xie

Alert button

Slot-VLM: SlowFast Slots for Video-Language Modeling

Add code
Bookmark button
Alert button
Feb 20, 2024
Jiaqi Xu, Cuiling Lan, Wenxuan Xie, Xuejin Chen, Yan Lu

Viaarxiv icon

Retrieval-based Video Language Model for Efficient Long Video Question Answering

Add code
Bookmark button
Alert button
Dec 08, 2023
Jiaqi Xu, Cuiling Lan, Wenxuan Xie, Xuejin Chen, Yan Lu

Figure 1 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 2 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 3 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 4 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Viaarxiv icon

Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API

Add code
Bookmark button
Alert button
Oct 07, 2023
Zhizheng Zhang, Wenxuan Xie, Xiaoyi Zhang, Yan Lu

Figure 1 for Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API
Figure 2 for Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API
Figure 3 for Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API
Figure 4 for Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API
Viaarxiv icon

Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators

Add code
Bookmark button
Alert button
Jun 02, 2023
Zhizheng Zhang, Xiaoyi Zhang, Wenxuan Xie, Yan Lu

Figure 1 for Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators
Figure 2 for Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators
Figure 3 for Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators
Figure 4 for Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators
Viaarxiv icon

Unifying Layout Generation with a Decoupled Diffusion Model

Add code
Bookmark button
Alert button
Mar 09, 2023
Mude Hui, Zhizheng Zhang, Xiaoyi Zhang, Wenxuan Xie, Yuwang Wang, Yan Lu

Figure 1 for Unifying Layout Generation with a Decoupled Diffusion Model
Figure 2 for Unifying Layout Generation with a Decoupled Diffusion Model
Figure 3 for Unifying Layout Generation with a Decoupled Diffusion Model
Figure 4 for Unifying Layout Generation with a Decoupled Diffusion Model
Viaarxiv icon

A Semi-supervised Sensing Rate Learning based CMAB Scheme to Combat COVID-19 by Trustful Data Collection in the Crowd

Add code
Bookmark button
Alert button
Jan 17, 2023
Jianheng Tang, Kejia Fan, Wenxuan Xie, Luomin Zeng, Feijiang Han, Guosheng Huang, Tian Wang, Anfeng Liu, Shaobo Zhang

Figure 1 for A Semi-supervised Sensing Rate Learning based CMAB Scheme to Combat COVID-19 by Trustful Data Collection in the Crowd
Figure 2 for A Semi-supervised Sensing Rate Learning based CMAB Scheme to Combat COVID-19 by Trustful Data Collection in the Crowd
Figure 3 for A Semi-supervised Sensing Rate Learning based CMAB Scheme to Combat COVID-19 by Trustful Data Collection in the Crowd
Figure 4 for A Semi-supervised Sensing Rate Learning based CMAB Scheme to Combat COVID-19 by Trustful Data Collection in the Crowd
Viaarxiv icon

Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?

Add code
Bookmark button
Alert button
Sep 12, 2021
Chuanxin Tang, Yucheng Zhao, Guangting Wang, Chong Luo, Wenxuan Xie, Wenjun Zeng

Figure 1 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 2 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 3 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 4 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Viaarxiv icon

Unsupervised Visual Representation Learning by Tracking Patches in Video

Add code
Bookmark button
Alert button
May 06, 2021
Guangting Wang, Yizhou Zhou, Chong Luo, Wenxuan Xie, Wenjun Zeng, Zhiwei Xiong

Figure 1 for Unsupervised Visual Representation Learning by Tracking Patches in Video
Figure 2 for Unsupervised Visual Representation Learning by Tracking Patches in Video
Figure 3 for Unsupervised Visual Representation Learning by Tracking Patches in Video
Figure 4 for Unsupervised Visual Representation Learning by Tracking Patches in Video
Viaarxiv icon

Detect or Track: Towards Cost-Effective Video Object Detection/Tracking

Add code
Bookmark button
Alert button
Nov 13, 2018
Hao Luo, Wenxuan Xie, Xinggang Wang, Wenjun Zeng

Figure 1 for Detect or Track: Towards Cost-Effective Video Object Detection/Tracking
Figure 2 for Detect or Track: Towards Cost-Effective Video Object Detection/Tracking
Figure 3 for Detect or Track: Towards Cost-Effective Video Object Detection/Tracking
Figure 4 for Detect or Track: Towards Cost-Effective Video Object Detection/Tracking
Viaarxiv icon

Learning to Update for Object Tracking

Add code
Bookmark button
Alert button
Jun 19, 2018
Bi Li, Wenxuan Xie, Wenjun Zeng, Wenyu Liu

Figure 1 for Learning to Update for Object Tracking
Figure 2 for Learning to Update for Object Tracking
Figure 3 for Learning to Update for Object Tracking
Figure 4 for Learning to Update for Object Tracking
Viaarxiv icon