Picture for Angtian Wang

Angtian Wang

Envision: Embodied Visual Planning via Goal-Imagery Video Diffusion

Add code
Dec 27, 2025
Viaarxiv icon

StoryMem: Multi-shot Long Video Storytelling with Memory

Add code
Dec 22, 2025
Viaarxiv icon

VIVA: VLM-Guided Instruction-Based Video Editing with Reward Optimization

Add code
Dec 18, 2025
Viaarxiv icon

MAGREF: Masked Guidance for Any-Reference Video Generation

Add code
May 29, 2025
Viaarxiv icon

ATI: Any Trajectory Instruction for Controllable Video Generation

Add code
May 28, 2025
Viaarxiv icon

PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation

Add code
May 27, 2025
Viaarxiv icon

CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance

Add code
Mar 13, 2025
Viaarxiv icon

Causal Image Modeling for Efficient Visual Understanding

Add code
Oct 10, 2024
Figure 1 for Causal Image Modeling for Efficient Visual Understanding
Figure 2 for Causal Image Modeling for Efficient Visual Understanding
Figure 3 for Causal Image Modeling for Efficient Visual Understanding
Figure 4 for Causal Image Modeling for Efficient Visual Understanding
Viaarxiv icon

iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning

Add code
Jul 12, 2024
Viaarxiv icon

Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering

Add code
Jun 02, 2024
Figure 1 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Figure 2 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Figure 3 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Figure 4 for Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Viaarxiv icon