Picture for Jing Huo

Jing Huo

Thinking-Based Non-Thinking: Solving the Reward Hacking Problem in Training Hybrid Reasoning Models via Reinforcement Learning

Add code
Jan 08, 2026
Viaarxiv icon

ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation

Add code
Dec 18, 2025
Figure 1 for ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation
Figure 2 for ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation
Figure 3 for ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation
Figure 4 for ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon Manipulation
Viaarxiv icon

VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation

Add code
Oct 10, 2025
Viaarxiv icon

MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems

Add code
May 27, 2025
Figure 1 for MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems
Figure 2 for MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems
Figure 3 for MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems
Figure 4 for MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems
Viaarxiv icon

SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving

Add code
May 18, 2025
Figure 1 for SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving
Figure 2 for SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving
Figure 3 for SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving
Figure 4 for SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving
Viaarxiv icon

DeCo: Task Decomposition and Skill Composition for Zero-Shot Generalization in Long-Horizon 3D Manipulation

Add code
May 01, 2025
Viaarxiv icon

Robust Dataset Distillation by Matching Adversarial Trajectories

Add code
Mar 15, 2025
Viaarxiv icon

RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation

Add code
Jan 15, 2025
Figure 1 for RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation
Figure 2 for RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation
Figure 3 for RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation
Figure 4 for RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation
Viaarxiv icon

Distractor-free Generalizable 3D Gaussian Splatting

Add code
Nov 26, 2024
Figure 1 for Distractor-free Generalizable 3D Gaussian Splatting
Figure 2 for Distractor-free Generalizable 3D Gaussian Splatting
Figure 3 for Distractor-free Generalizable 3D Gaussian Splatting
Figure 4 for Distractor-free Generalizable 3D Gaussian Splatting
Viaarxiv icon

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment

Add code
Oct 22, 2024
Figure 1 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Figure 2 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Figure 3 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Figure 4 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Viaarxiv icon