Picture for Jian Tang

Jian Tang

Baidu

SM$^3$: Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects

Add code
Jan 17, 2024
Figure 1 for SM$^3$: Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects
Figure 2 for SM$^3$: Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects
Figure 3 for SM$^3$: Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects
Figure 4 for SM$^3$: Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects
Viaarxiv icon

Visual Robotic Manipulation with Depth-Aware Pretraining

Add code
Jan 17, 2024
Figure 1 for Visual Robotic Manipulation with Depth-Aware Pretraining
Figure 2 for Visual Robotic Manipulation with Depth-Aware Pretraining
Figure 3 for Visual Robotic Manipulation with Depth-Aware Pretraining
Figure 4 for Visual Robotic Manipulation with Depth-Aware Pretraining
Viaarxiv icon

SWBT: Similarity Weighted Behavior Transformer with the Imperfect Demonstration for Robotic Manipulation

Add code
Jan 17, 2024
Viaarxiv icon

LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model

Add code
Jan 15, 2024
Viaarxiv icon

Object-Centric Instruction Augmentation for Robotic Manipulation

Add code
Jan 05, 2024
Figure 1 for Object-Centric Instruction Augmentation for Robotic Manipulation
Figure 2 for Object-Centric Instruction Augmentation for Robotic Manipulation
Figure 3 for Object-Centric Instruction Augmentation for Robotic Manipulation
Figure 4 for Object-Centric Instruction Augmentation for Robotic Manipulation
Viaarxiv icon

Cross-Modal Reasoning with Event Correlation for Video Question Answering

Add code
Dec 20, 2023
Viaarxiv icon

Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual Question Answering

Add code
Dec 20, 2023
Figure 1 for Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual Question Answering
Figure 2 for Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual Question Answering
Figure 3 for Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual Question Answering
Figure 4 for Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual Question Answering
Viaarxiv icon

Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective

Add code
Dec 18, 2023
Viaarxiv icon

Universal Deoxidation of Semiconductor Substrates Assisted by Machine-Learning and Real-Time-Feedback-Control

Add code
Dec 04, 2023
Viaarxiv icon

PDB-Struct: A Comprehensive Benchmark for Structure-based Protein Design

Add code
Nov 30, 2023
Figure 1 for PDB-Struct: A Comprehensive Benchmark for Structure-based Protein Design
Figure 2 for PDB-Struct: A Comprehensive Benchmark for Structure-based Protein Design
Figure 3 for PDB-Struct: A Comprehensive Benchmark for Structure-based Protein Design
Figure 4 for PDB-Struct: A Comprehensive Benchmark for Structure-based Protein Design
Viaarxiv icon