Picture for Jianye Hao

Jianye Hao

Few-Shot Vision-Language Action-Incremental Policy Learning

Add code
Apr 22, 2025
Viaarxiv icon

ViMo: A Generative Visual GUI World Model for App Agent

Add code
Apr 15, 2025
Viaarxiv icon

From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models

Add code
Mar 20, 2025
Viaarxiv icon

AhaRobot: A Low-Cost Open-Source Bimanual Mobile Manipulator for Embodied AI

Add code
Mar 13, 2025
Viaarxiv icon

Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation

Add code
Mar 13, 2025
Viaarxiv icon

Apollo-MILP: An Alternating Prediction-Correction Neural Solving Framework for Mixed-Integer Linear Programming

Add code
Mar 03, 2025
Viaarxiv icon

3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds

Add code
Feb 27, 2025
Viaarxiv icon

Generative Models in Decision Making: A Survey

Add code
Feb 25, 2025
Viaarxiv icon

Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation

Add code
Feb 20, 2025
Viaarxiv icon

VSC-RL: Advancing Autonomous Vision-Language Agents with Variational Subgoal-Conditioned Reinforcement Learning

Add code
Feb 11, 2025
Viaarxiv icon