Picture for Yitao Liang

Yitao Liang

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Add code
Mar 20, 2025
Figure 1 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Figure 2 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Figure 3 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Figure 4 for JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Viaarxiv icon

A Neural Symbolic Model for Space Physics

Add code
Mar 11, 2025
Figure 1 for A Neural Symbolic Model for Space Physics
Figure 2 for A Neural Symbolic Model for Space Physics
Figure 3 for A Neural Symbolic Model for Space Physics
Figure 4 for A Neural Symbolic Model for Space Physics
Viaarxiv icon

ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment

Add code
Mar 04, 2025
Figure 1 for ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment
Figure 2 for ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment
Figure 3 for ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment
Figure 4 for ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment
Viaarxiv icon

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Add code
Feb 28, 2025
Viaarxiv icon

Tractable Transformers for Flexible Conditional Generation

Add code
Feb 11, 2025
Figure 1 for Tractable Transformers for Flexible Conditional Generation
Figure 2 for Tractable Transformers for Flexible Conditional Generation
Figure 3 for Tractable Transformers for Flexible Conditional Generation
Figure 4 for Tractable Transformers for Flexible Conditional Generation
Viaarxiv icon

TFG-Flow: Training-free Guidance in Multimodal Generative Flow

Add code
Jan 24, 2025
Figure 1 for TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Figure 2 for TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Figure 3 for TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Figure 4 for TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Viaarxiv icon

MineStudio: A Streamlined Package for Minecraft AI Agent Development

Add code
Dec 25, 2024
Figure 1 for MineStudio: A Streamlined Package for Minecraft AI Agent Development
Figure 2 for MineStudio: A Streamlined Package for Minecraft AI Agent Development
Viaarxiv icon

MinsStudio: A Streamlined Package for Minecraft AI Agent Development

Add code
Dec 24, 2024
Figure 1 for MinsStudio: A Streamlined Package for Minecraft AI Agent Development
Figure 2 for MinsStudio: A Streamlined Package for Minecraft AI Agent Development
Viaarxiv icon

Proposing and solving olympiad geometry with guided tree search

Add code
Dec 14, 2024
Viaarxiv icon

Optimizing Latent Goal by Learning from Trajectory Preference

Add code
Dec 03, 2024
Figure 1 for Optimizing Latent Goal by Learning from Trajectory Preference
Figure 2 for Optimizing Latent Goal by Learning from Trajectory Preference
Figure 3 for Optimizing Latent Goal by Learning from Trajectory Preference
Figure 4 for Optimizing Latent Goal by Learning from Trajectory Preference
Viaarxiv icon