Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Albert Wu

Test-Time Scaling Makes Overtraining Compute-Optimal

Apr 01, 2026

Nicholas Roberts, Sungjun Cho, Zhiqi Gao, Tzu-Heng Huang, Albert Wu, Gabriel Orlanski, Avi Trost, Kelly Buchanan, Aws Albarghouthi, Frederic Sala

Abstract:Modern LLMs scale at test-time, e.g. via repeated sampling, where inference cost grows with model size and the number of samples. This creates a trade-off that pretraining scaling laws, such as Chinchilla, do not address. We present Train-to-Test ($T^2$) scaling laws that jointly optimize model size, training tokens, and number of inference samples under fixed end-to-end budgets. $T^2$ modernizes pretraining scaling laws with pass@$k$ modeling used for test-time scaling, then jointly optimizes pretraining and test-time decisions. Forecasts from $T^2$ are robust over distinct modeling approaches: measuring joint scaling effect on the task loss and modeling impact on task accuracy. Across eight downstream tasks, we find that when accounting for inference cost, optimal pretraining decisions shift radically into the overtraining regime, well-outside of the range of standard pretraining scaling suites. We validate our results by pretraining heavily overtrained models in the optimal region that $T^2$ scaling forecasts, confirming their substantially stronger performance compared to pretraining scaling alone. Finally, as frontier LLMs are post-trained, we show that our findings survive the post-training stage, making $T^2$ scaling meaningful in modern deployments.

Via

Access Paper or Ask Questions

One-Shot Transfer of Long-Horizon Extrinsic Manipulation Through Contact Retargeting

Apr 11, 2024

Albert Wu, Ruocheng Wang, Sirui Chen, Clemens Eppner, C. Karen Liu

Figure 1 for One-Shot Transfer of Long-Horizon Extrinsic Manipulation Through Contact Retargeting

Figure 2 for One-Shot Transfer of Long-Horizon Extrinsic Manipulation Through Contact Retargeting

Figure 3 for One-Shot Transfer of Long-Horizon Extrinsic Manipulation Through Contact Retargeting

Figure 4 for One-Shot Transfer of Long-Horizon Extrinsic Manipulation Through Contact Retargeting

Abstract:Extrinsic manipulation, the use of environment contacts to achieve manipulation objectives, enables strategies that are otherwise impossible with a parallel jaw gripper. However, orchestrating a long-horizon sequence of contact interactions between the robot, object, and environment is notoriously challenging due to the scene diversity, large action space, and difficult contact dynamics. We observe that most extrinsic manipulation are combinations of short-horizon primitives, each of which depend strongly on initializing from a desirable contact configuration to succeed. Therefore, we propose to generalize one extrinsic manipulation trajectory to diverse objects and environments by retargeting contact requirements. We prepare a single library of robust short-horizon, goal-conditioned primitive policies, and design a framework to compose state constraints stemming from contacts specifications of each primitive. Given a test scene and a single demo prescribing the primitive sequence, our method enforces the state constraints on the test scene and find intermediate goal states using inverse kinematics. The goals are then tracked by the primitive policies. Using a 7+1 DoF robotic arm-gripper system, we achieved an overall success rate of 80.5% on hardware over 4 long-horizon extrinsic manipulation tasks, each with up to 4 primitives. Our experiments cover 10 objects and 6 environment configurations. We further show empirically that our method admits a wide range of demonstrations, and that contact retargeting is indeed the key to successfully combining primitives for long-horizon extrinsic manipulation. Code and additional details are available at stanford-tml.github.io/extrinsic-manipulation.

* 8 pages, 6 figures

Via

Access Paper or Ask Questions

Synthesize Dexterous Nonprehensile Pregrasp for Ungraspable Objects

May 08, 2023

Sirui Chen, Albert Wu, C. Karen Liu

Figure 1 for Synthesize Dexterous Nonprehensile Pregrasp for Ungraspable Objects

Figure 2 for Synthesize Dexterous Nonprehensile Pregrasp for Ungraspable Objects

Figure 3 for Synthesize Dexterous Nonprehensile Pregrasp for Ungraspable Objects

Figure 4 for Synthesize Dexterous Nonprehensile Pregrasp for Ungraspable Objects

Abstract:Daily objects embedded in a contextual environment are often ungraspable initially. Whether it is a book sandwiched by other books on a fully packed bookshelf or a piece of paper lying flat on the desk, a series of nonprehensile pregrasp maneuvers is required to manipulate the object into a graspable state. Humans are proficient at utilizing environmental contacts to achieve manipulation tasks that are otherwise impossible, but synthesizing such nonprehensile pregrasp behaviors is challenging to existing methods. We present a novel method that combines graph search, optimal control, and a learning-based objective function to synthesize physically realistic and diverse nonprehensile pre-grasp motions that leverage the external contacts. Since the ``graspability'' of an object in context with its surrounding is difficult to define, we utilize a dataset of dexterous grasps to learn a metric which implicitly takes into account the exposed surface of the object and the finger tip locations. Our method can efficiently discover hand and object trajectories that are certified to be physically feasible by the simulation and kinematically achievable by the dexterous hand. We evaluate our method on eight challenging scenarios where nonprehensile pre-grasps are required to succeed. We also show that our method can be applied to unseen objects different from those in the training dataset. Finally, we report quantitative analyses on generalization and robustness of our method, as well as an ablation study.

* ACM SIGGRAPH Conference Proceedings 2023
* 11 pages, 9 figures, SIGGRAPH Conference Proceedings 2023

Via

Access Paper or Ask Questions

Learning Diverse and Physically Feasible Dexterous Grasps with Generative Model and Bilevel Optimization

Jul 01, 2022

Albert Wu, Michelle Guo, C. Karen Liu

Figure 1 for Learning Diverse and Physically Feasible Dexterous Grasps with Generative Model and Bilevel Optimization

Figure 2 for Learning Diverse and Physically Feasible Dexterous Grasps with Generative Model and Bilevel Optimization

Figure 3 for Learning Diverse and Physically Feasible Dexterous Grasps with Generative Model and Bilevel Optimization

Figure 4 for Learning Diverse and Physically Feasible Dexterous Grasps with Generative Model and Bilevel Optimization

Abstract:To fully utilize the versatility of a multi-finger dexterous robotic hand for object grasping, one must satisfy complex physical constraints introduced by hand-object interaction and object geometry during grasp planning. We propose an integrative approach of combining a generative model and a bilevel optimization to compute diverse grasps for novel unseen objects. First, a grasp prediction is obtained from a conditional variational autoencoder trained on merely six YCB objects. The prediction is then projected onto the manifold of kinematically and dynamically feasible grasps by jointly solving collision-aware inverse kinematics, force closure, and friction constraints as one nonconvex bilevel optimization. We demonstrate the effectiveness of our method on hardware by successfully grasping a wide range of unseen household objects, including adversarial shapes challenging to other types of robotic grippers. A video summary of our results is available at https://youtu.be/9DTrImbN99I.

* 12 pages, 4 figures

Via

Access Paper or Ask Questions

Robust-RRT: Probabilistically-Complete Motion Planning for Uncertain Nonlinear Systems

May 16, 2022

Albert Wu, Thomas Lew, Kiril Solovey, Edward Schmerling, Marco Pavone

Figure 1 for Robust-RRT: Probabilistically-Complete Motion Planning for Uncertain Nonlinear Systems

Figure 2 for Robust-RRT: Probabilistically-Complete Motion Planning for Uncertain Nonlinear Systems

Figure 3 for Robust-RRT: Probabilistically-Complete Motion Planning for Uncertain Nonlinear Systems

Figure 4 for Robust-RRT: Probabilistically-Complete Motion Planning for Uncertain Nonlinear Systems

Abstract:Robust motion planning entails computing a global motion plan that is safe under all possible uncertainty realizations, be it in the system dynamics, the robot's initial position, or with respect to external disturbances. Current approaches for robust motion planning either lack theoretical guarantees, or make restrictive assumptions on the system dynamics and uncertainty distributions. In this paper, we address these limitations by proposing the robust rapidly-exploring random-tree (Robust-RRT) algorithm, which integrates forward reachability analysis directly into sampling-based control trajectory synthesis. We prove that Robust-RRT is probabilistically complete (PC) for nonlinear Lipschitz continuous dynamical systems with bounded uncertainty. In other words, Robust-RRT eventually finds a robust motion plan that is feasible under all possible uncertainty realizations assuming such a plan exists. Our analysis applies even to unstable systems that admit only short-horizon feasible plans; this is because we explicitly consider the time evolution of reachable sets along control trajectories. Thanks to the explicit consideration of time dependency in our analysis, PC applies to unstabilizable systems. To the best of our knowledge, this is the most general PC proof for robust sampling-based motion planning, in terms of the types of uncertainties and dynamical systems it can handle. Considering that an exact computation of reachable sets can be computationally expensive for some dynamical systems, we incorporate sampling-based reachability analysis into Robust-RRT and demonstrate our robust planner on nonlinear, underactuated, and hybrid systems.

* 16 pages of main text + 5 pages of appendix, 5 figures, submitted to the 2022 International Symposium on Robotics Research

Via

Access Paper or Ask Questions