Picture for Yang Gao

Yang Gao

Harry

Extrapolation Merging: Keep Improving With Extrapolation and Merging

Add code
Mar 05, 2025
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon

SKIL: Semantic Keypoint Imitation Learning for Generalizable Data-efficient Manipulation

Add code
Jan 24, 2025
Viaarxiv icon

RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation

Add code
Jan 15, 2025
Figure 1 for RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation
Figure 2 for RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation
Figure 3 for RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation
Figure 4 for RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation
Viaarxiv icon

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Add code
Nov 29, 2024
Viaarxiv icon

Distractor-free Generalizable 3D Gaussian Splatting

Add code
Nov 26, 2024
Figure 1 for Distractor-free Generalizable 3D Gaussian Splatting
Figure 2 for Distractor-free Generalizable 3D Gaussian Splatting
Figure 3 for Distractor-free Generalizable 3D Gaussian Splatting
Figure 4 for Distractor-free Generalizable 3D Gaussian Splatting
Viaarxiv icon

SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation

Add code
Nov 21, 2024
Viaarxiv icon

Reviving Dormant Memories: Investigating Catastrophic Forgetting in Language Models through Rationale-Guidance Difficulty

Add code
Nov 18, 2024
Viaarxiv icon

PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment

Add code
Nov 18, 2024
Figure 1 for PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment
Figure 2 for PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment
Figure 3 for PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment
Figure 4 for PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment
Viaarxiv icon

METEOR: Evolutionary Journey of Large Language Models from Guidance to Self-Growth

Add code
Nov 18, 2024
Viaarxiv icon