Picture for Changti Wu

Changti Wu

LangForce: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

Add code
Jan 27, 2026
Viaarxiv icon

BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

Add code
Jan 21, 2026
Viaarxiv icon

TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers

Add code
Jan 20, 2026
Viaarxiv icon

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

Add code
Dec 18, 2025
Figure 1 for PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence
Figure 2 for PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence
Figure 3 for PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence
Figure 4 for PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence
Viaarxiv icon

DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry

Add code
Oct 25, 2025
Figure 1 for DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry
Figure 2 for DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry
Figure 3 for DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry
Figure 4 for DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry
Viaarxiv icon