Alert button
Picture for Jiwen Zhang

Jiwen Zhang

Alert button

DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning

Add code
Bookmark button
Alert button
Apr 02, 2024
Mengfei Du, Binhao Wu, Jiwen Zhang, Zhihao Fan, Zejun Li, Ruipu Luo, Xuanjing Huang, Zhongyu Wei

Viaarxiv icon

Android in the Zoo: Chain-of-Action-Thought for GUI Agents

Add code
Bookmark button
Alert button
Mar 05, 2024
Jiwen Zhang, Jihao Wu, Yihua Teng, Minghui Liao, Nuo Xu, Xiao Xiao, Zhongyu Wei, Duyu Tang

Figure 1 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 2 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 3 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 4 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Viaarxiv icon

ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks

Add code
Bookmark button
Alert button
Oct 17, 2023
Zejun Li, Ye Wang, Mengfei Du, Qingwen Liu, Binhao Wu, Jiwen Zhang, Chengxing Zhou, Zhihao Fan, Jie Fu, Jingjing Chen, Xuanjing Huang, Zhongyu Wei

Figure 1 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 2 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 3 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Figure 4 for ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Viaarxiv icon

Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making

Add code
Bookmark button
Alert button
Jul 16, 2023
Ruipu Luo, Jiwen Zhang, Zhongyu Wei

Figure 1 for Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making
Figure 2 for Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making
Figure 3 for Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making
Figure 4 for Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making
Viaarxiv icon

Robotic Assembly Control Reconfiguration Based on Transfer Reinforcement Learning for Objects with Different Geometric Features

Add code
Bookmark button
Alert button
Nov 04, 2022
Yuhang Gai, Bing Wang, Jiwen Zhang, Dan Wu, Ken Chen

Figure 1 for Robotic Assembly Control Reconfiguration Based on Transfer Reinforcement Learning for Objects with Different Geometric Features
Figure 2 for Robotic Assembly Control Reconfiguration Based on Transfer Reinforcement Learning for Objects with Different Geometric Features
Figure 3 for Robotic Assembly Control Reconfiguration Based on Transfer Reinforcement Learning for Objects with Different Geometric Features
Figure 4 for Robotic Assembly Control Reconfiguration Based on Transfer Reinforcement Learning for Objects with Different Geometric Features
Viaarxiv icon

Local Connection Reinforcement Learning Method for Efficient Control of Robotic Peg-in-Hole Assembly

Add code
Bookmark button
Alert button
Oct 24, 2022
Yuhang Gai, Jiwen Zhang, Dan Wu, Ken Chen

Figure 1 for Local Connection Reinforcement Learning Method for Efficient Control of Robotic Peg-in-Hole Assembly
Figure 2 for Local Connection Reinforcement Learning Method for Efficient Control of Robotic Peg-in-Hole Assembly
Figure 3 for Local Connection Reinforcement Learning Method for Efficient Control of Robotic Peg-in-Hole Assembly
Figure 4 for Local Connection Reinforcement Learning Method for Efficient Control of Robotic Peg-in-Hole Assembly
Viaarxiv icon

Curriculum Learning for Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Nov 14, 2021
Jiwen Zhang, Zhongyu Wei, Jianqing Fan, Jiajie Peng

Figure 1 for Curriculum Learning for Vision-and-Language Navigation
Figure 2 for Curriculum Learning for Vision-and-Language Navigation
Figure 3 for Curriculum Learning for Vision-and-Language Navigation
Figure 4 for Curriculum Learning for Vision-and-Language Navigation
Viaarxiv icon