Picture for Xinyi Xie

Xinyi Xie

Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision

Add code
Apr 22, 2025
Viaarxiv icon

VidTwin: Video VAE with Decoupled Structure and Dynamics

Add code
Dec 23, 2024
Viaarxiv icon

Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?

Add code
Nov 27, 2024
Figure 1 for Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Figure 2 for Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Figure 3 for Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Figure 4 for Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Viaarxiv icon