Picture for Jian Tang

Jian Tang

Baidu

RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence

Add code
Dec 31, 2025
Viaarxiv icon

Real-world Reinforcement Learning from Suboptimal Interventions

Add code
Dec 30, 2025
Viaarxiv icon

Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning

Add code
Dec 26, 2025
Viaarxiv icon

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation

Add code
Sep 30, 2025
Figure 1 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 2 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 3 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Figure 4 for MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
Viaarxiv icon

WoW: Towards a World omniscient World model Through Embodied Interaction

Add code
Sep 26, 2025
Viaarxiv icon

HumanoidVerse: A Versatile Humanoid for Vision-Language Guided Multi-Object Rearrangement

Add code
Aug 23, 2025
Viaarxiv icon

Humanoid Occupancy: Enabling A Generalized Multimodal Occupancy Perception System on Humanoid Robots

Add code
Jul 27, 2025
Viaarxiv icon

Self-Supervised Multi-Part Articulated Objects Modeling via Deformable Gaussian Splatting and Progressive Primitive Segmentation

Add code
Jun 11, 2025
Viaarxiv icon

FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency

Add code
Jun 10, 2025
Viaarxiv icon

RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks

Add code
Jun 07, 2025
Viaarxiv icon