Picture for Xiang Deng

Xiang Deng

Mark

Hardware Neural Control of CartPole and F1TENTH Race Car

Add code
Jul 11, 2024
Viaarxiv icon

Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL

Add code
Jun 08, 2024
Figure 1 for Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Figure 2 for Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Figure 3 for Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Figure 4 for Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Viaarxiv icon

RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models

Add code
Apr 07, 2024
Figure 1 for RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models
Figure 2 for RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models
Figure 3 for RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models
Figure 4 for RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models
Viaarxiv icon

Dual-View Visual Contextualization for Web Navigation

Add code
Feb 06, 2024
Viaarxiv icon

GMTalker: Gaussian Mixture based Emotional talking video Portraits

Add code
Dec 12, 2023
Figure 1 for GMTalker: Gaussian Mixture based Emotional talking video Portraits
Figure 2 for GMTalker: Gaussian Mixture based Emotional talking video Portraits
Figure 3 for GMTalker: Gaussian Mixture based Emotional talking video Portraits
Figure 4 for GMTalker: Gaussian Mixture based Emotional talking video Portraits
Viaarxiv icon

LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

Add code
Nov 26, 2023
Figure 1 for LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Figure 2 for LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Figure 3 for LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Figure 4 for LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Viaarxiv icon

AgentBench: Evaluating LLMs as Agents

Add code
Aug 07, 2023
Figure 1 for AgentBench: Evaluating LLMs as Agents
Figure 2 for AgentBench: Evaluating LLMs as Agents
Figure 3 for AgentBench: Evaluating LLMs as Agents
Figure 4 for AgentBench: Evaluating LLMs as Agents
Viaarxiv icon

Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System

Add code
Jul 29, 2023
Figure 1 for Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System
Figure 2 for Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System
Figure 3 for Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System
Figure 4 for Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System
Viaarxiv icon

Mind2Web: Towards a Generalist Agent for the Web

Add code
Jun 15, 2023
Figure 1 for Mind2Web: Towards a Generalist Agent for the Web
Figure 2 for Mind2Web: Towards a Generalist Agent for the Web
Figure 3 for Mind2Web: Towards a Generalist Agent for the Web
Figure 4 for Mind2Web: Towards a Generalist Agent for the Web
Viaarxiv icon

Exploring Chain-of-Thought Style Prompting for Text-to-SQL

Add code
May 23, 2023
Figure 1 for Exploring Chain-of-Thought Style Prompting for Text-to-SQL
Figure 2 for Exploring Chain-of-Thought Style Prompting for Text-to-SQL
Figure 3 for Exploring Chain-of-Thought Style Prompting for Text-to-SQL
Figure 4 for Exploring Chain-of-Thought Style Prompting for Text-to-SQL
Viaarxiv icon