Picture for Pavel Tokmakov

Pavel Tokmakov

AnyView: Synthesizing Any Novel View in Dynamic Scenes

Add code
Jan 23, 2026
Viaarxiv icon

Walk through Paintings: Egocentric World Models from Internet Priors

Add code
Jan 21, 2026
Viaarxiv icon

AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis

Add code
Dec 12, 2025
Viaarxiv icon

Video Generators are Robot Policies

Add code
Aug 01, 2025
Viaarxiv icon

Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors

Add code
Jun 15, 2025
Viaarxiv icon

AllTracker: Efficient Dense Point Tracking at High Resolution

Add code
Jun 08, 2025
Viaarxiv icon

Understanding Complexity in VideoQA via Visual Program Generation

Add code
May 19, 2025
Viaarxiv icon

ReferEverything: Towards Segmenting Everything We Can Speak of in Videos

Add code
Oct 30, 2024
Figure 1 for ReferEverything: Towards Segmenting Everything We Can Speak of in Videos
Figure 2 for ReferEverything: Towards Segmenting Everything We Can Speak of in Videos
Figure 3 for ReferEverything: Towards Segmenting Everything We Can Speak of in Videos
Figure 4 for ReferEverything: Towards Segmenting Everything We Can Speak of in Videos
Viaarxiv icon

GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion

Add code
Sep 15, 2024
Figure 1 for GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Figure 2 for GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Figure 3 for GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Figure 4 for GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Viaarxiv icon

Dreamitate: Real-World Visuomotor Policy Learning via Video Generation

Add code
Jun 24, 2024
Figure 1 for Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Figure 2 for Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Figure 3 for Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Figure 4 for Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Viaarxiv icon