Picture for Songyan Zhang

Songyan Zhang

AutoMoT: A Unified Vision-Language-Action Model with Asynchronous Mixture-of-Transformers for End-to-End Autonomous Driving

Add code
Mar 16, 2026
Viaarxiv icon

POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction

Add code
Apr 08, 2025
Viaarxiv icon

WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model

Add code
Dec 13, 2024
Figure 1 for WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model
Figure 2 for WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model
Figure 3 for WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model
Figure 4 for WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model
Viaarxiv icon

Digging Into Normal Incorporated Stereo Matching

Add code
Feb 28, 2024
Figure 1 for Digging Into Normal Incorporated Stereo Matching
Figure 2 for Digging Into Normal Incorporated Stereo Matching
Figure 3 for Digging Into Normal Incorporated Stereo Matching
Figure 4 for Digging Into Normal Incorporated Stereo Matching
Viaarxiv icon

RGM: A Robust Generalist Matching Model

Add code
Oct 19, 2023
Figure 1 for RGM: A Robust Generalist Matching Model
Figure 2 for RGM: A Robust Generalist Matching Model
Figure 3 for RGM: A Robust Generalist Matching Model
Figure 4 for RGM: A Robust Generalist Matching Model
Viaarxiv icon

DAVOS: Semi-Supervised Video Object Segmentation via Adversarial Domain Adaptation

Add code
May 24, 2021
Figure 1 for DAVOS: Semi-Supervised Video Object Segmentation via Adversarial Domain Adaptation
Figure 2 for DAVOS: Semi-Supervised Video Object Segmentation via Adversarial Domain Adaptation
Figure 3 for DAVOS: Semi-Supervised Video Object Segmentation via Adversarial Domain Adaptation
Figure 4 for DAVOS: Semi-Supervised Video Object Segmentation via Adversarial Domain Adaptation
Viaarxiv icon

EDNet: Efficient Disparity Estimation with Combination Volume and Spatial Attention based Residual Learning

Add code
Nov 08, 2020
Figure 1 for EDNet: Efficient Disparity Estimation with Combination Volume and Spatial Attention based Residual Learning
Figure 2 for EDNet: Efficient Disparity Estimation with Combination Volume and Spatial Attention based Residual Learning
Figure 3 for EDNet: Efficient Disparity Estimation with Combination Volume and Spatial Attention based Residual Learning
Figure 4 for EDNet: Efficient Disparity Estimation with Combination Volume and Spatial Attention based Residual Learning
Viaarxiv icon