Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tenny Yin

Video Generation Models in Robotics -- Applications, Research Challenges, Future Directions

Jan 12, 2026

Zhiting Mei, Tenny Yin, Ola Shorinwa, Apurva Badithela, Zhonghe Zheng, Joseph Bruno, Madison Bland, Lihan Zha, Asher Hancock, Jaime Fernández Fisac(+2 more)

Abstract:Video generation models have emerged as high-fidelity models of the physical world, capable of synthesizing high-quality videos capturing fine-grained interactions between agents and their environments conditioned on multi-modal user inputs. Their impressive capabilities address many of the long-standing challenges faced by physics-based simulators, driving broad adoption in many problem domains, e.g., robotics. For example, video models enable photorealistic, physically consistent deformable-body simulation without making prohibitive simplifying assumptions, which is a major bottleneck in physics-based simulation. Moreover, video models can serve as foundation world models that capture the dynamics of the world in a fine-grained and expressive way. They thus overcome the limited expressiveness of language-only abstractions in describing intricate physical interactions. In this survey, we provide a review of video models and their applications as embodied world models in robotics, encompassing cost-effective data generation and action prediction in imitation learning, dynamics and rewards modeling in reinforcement learning, visual planning, and policy evaluation. Further, we highlight important challenges hindering the trustworthy integration of video models in robotics, which include poor instruction following, hallucinations such as violations of physics, and unsafe content generation, in addition to fundamental limitations such as significant data curation, training, and inference costs. We present potential future directions to address these open research challenges to motivate research and ultimately facilitate broader applications, especially in safety-critical settings.

Via

Access Paper or Ask Questions

Online Resynthesis of High-Level Collaborative Tasks for Robots with Changing Capabilities

Sep 09, 2024

Amy Fang, Tenny Yin, Hadas Kress-Gazit

Abstract:Given a collaborative high-level task and a team of heterogeneous robots and behaviors to satisfy it, this work focuses on the challenge of automatically, at runtime, adjusting the individual robot behaviors such that the task is still satisfied, when robots encounter changes to their abilities--either failures or additional actions they can perform. We consider tasks encoded in LTL^\psi and minimize global teaming reassignments (and as a result, local resynthesis) when robots' capabilities change. We also increase the expressivity of LTL^\psi by including additional types of constraints on the overall teaming assignment that the user can specify, such as the minimum number of robots required for each assignment. We demonstrate the framework in a simulated warehouse scenario.

* Under review in IEEE Robotics and Automation Letters

Via

Access Paper or Ask Questions

Continuous Execution of High-Level Collaborative Tasks for Heterogeneous Robot Teams

Jun 26, 2024

Amy Fang, Tenny Yin, Jiawei Lin, Hadas Kress-Gazit

Figure 1 for Continuous Execution of High-Level Collaborative Tasks for Heterogeneous Robot Teams

Figure 2 for Continuous Execution of High-Level Collaborative Tasks for Heterogeneous Robot Teams

Figure 3 for Continuous Execution of High-Level Collaborative Tasks for Heterogeneous Robot Teams

Figure 4 for Continuous Execution of High-Level Collaborative Tasks for Heterogeneous Robot Teams

Abstract:We propose a control synthesis framework for a heterogeneous multi-robot system to satisfy collaborative tasks, where actions may take varying duration of time to complete. We encode tasks using the discrete logic LTL^\psi, which uses the concept of bindings to interleave robot actions and express information about relationship between specific task requirements and robot assignments. We present a synthesis approach to automatically generate a teaming assignment and corresponding discrete behavior that is correct-by-construction for continuous execution, while also implementing synchronization policies to ensure collaborative portions of the task are satisfied. We demonstrate our approach on a physical multi-robot system.

* Under review in IEEE Transactions on Robotics

Via

Access Paper or Ask Questions