Alert button
Picture for Xin Chen

Xin Chen

Alert button

AppAgent: Multimodal Agents as Smartphone Users

Add code
Bookmark button
Alert button
Dec 22, 2023
Chi Zhang, Zhao Yang, Jiaxuan Liu, Yucheng Han, Xin Chen, Zebiao Huang, Bin Fu, Gang Yu

Viaarxiv icon

DoDo-Code: a Deep Levenshtein Distance Embedding-based Code for IDS Channel and DNA Storage

Add code
Bookmark button
Alert button
Dec 20, 2023
Alan J. X. Guo, Sihan Sun, Xiang Wei, Mengyi Wei, Xin Chen

Viaarxiv icon

OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers

Add code
Bookmark button
Alert button
Dec 18, 2023
Han Liang, Jiacheng Bao, Ruichi Zhang, Sihan Ren, Yuecheng Xu, Sibei Yang, Xin Chen, Jingyi Yu, Lan Xu

Viaarxiv icon

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts

Add code
Bookmark button
Alert button
Dec 17, 2023
Mingsheng Li, Xin Chen, Chi Zhang, Sijin Chen, Hongyuan Zhu, Fukun Yin, Gang Yu, Tao Chen

Viaarxiv icon

HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models

Add code
Bookmark button
Alert button
Dec 08, 2023
Pei Lin, Sihang Xu, Hongdi Yang, Yiran Liu, Xin Chen, Jingya Wang, Jingyi Yu, Lan Xu

Viaarxiv icon

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model

Add code
Bookmark button
Alert button
Dec 01, 2023
Fukun Yin, Xin Chen, Chi Zhang, Biao Jiang, Zibo Zhao, Jiayuan Fan, Gang Yu, Taihao Li, Tao Chen

Figure 1 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Figure 2 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Figure 3 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Figure 4 for ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Viaarxiv icon

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

Add code
Bookmark button
Alert button
Nov 30, 2023
Sijin Chen, Xin Chen, Chi Zhang, Mingsheng Li, Gang Yu, Hao Fei, Hongyuan Zhu, Jiayuan Fan, Tao Chen

Figure 1 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 2 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 3 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 4 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Viaarxiv icon

ChartLlama: A Multimodal LLM for Chart Understanding and Generation

Add code
Bookmark button
Alert button
Nov 27, 2023
Yucheng Han, Chi Zhang, Xin Chen, Xu Yang, Zhibin Wang, Gang Yu, Bin Fu, Hanwang Zhang

Figure 1 for ChartLlama: A Multimodal LLM for Chart Understanding and Generation
Figure 2 for ChartLlama: A Multimodal LLM for Chart Understanding and Generation
Figure 3 for ChartLlama: A Multimodal LLM for Chart Understanding and Generation
Figure 4 for ChartLlama: A Multimodal LLM for Chart Understanding and Generation
Viaarxiv icon

PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation

Add code
Bookmark button
Alert button
Nov 03, 2023
Yuhan Ding, Fukun Yin, Jiayuan Fan, Hui Li, Xin Chen, Wen Liu, Chongshan Lu, Gang YU, Tao Chen

Viaarxiv icon