Alert button
Picture for Shan Yang

Shan Yang

Alert button

EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization

Feb 21, 2024
Zhendong Xiao, Changhao Chen, Shan Yang, Wu Wei

Viaarxiv icon

MLLMReID: Multimodal Large Language Model-based Person Re-identification

Jan 24, 2024
Shan Yang, Yongfei Zhang

Viaarxiv icon

VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering

Dec 13, 2023
Xijun Wang, Junbang Liang, Chun-Kai Wang, Kenan Deng, Yu Lou, Ming Lin, Shan Yang

Viaarxiv icon

A High Fidelity and Low Complexity Neural Audio Coding

Oct 17, 2023
Wenzhe Liu, Wei Xiao, Meng Wang, Shan Yang, Yupeng Shi, Yuyong Kang, Dan Su, Shidong Shang, Dong Yu

Viaarxiv icon

MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation

Oct 06, 2023
Muhammad Osama Khan, Junbang Liang, Chun-Kai Wang, Shan Yang, Yu Lou

Figure 1 for MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Figure 2 for MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Figure 3 for MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Figure 4 for MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Viaarxiv icon

ICAR: Image-based Complementary Auto Reasoning

Aug 17, 2023
Xijun Wang, Anqi Liang, Junbang Liang, Ming Lin, Yu Lou, Shan Yang

Figure 1 for ICAR: Image-based Complementary Auto Reasoning
Figure 2 for ICAR: Image-based Complementary Auto Reasoning
Figure 3 for ICAR: Image-based Complementary Auto Reasoning
Figure 4 for ICAR: Image-based Complementary Auto Reasoning
Viaarxiv icon

RoSI: Recovering 3D Shape Interiors from Few Articulation Images

Apr 13, 2023
Akshay Gadi Patil, Yiming Qian, Shan Yang, Brian Jackson, Eric Bennett, Hao Zhang

Figure 1 for RoSI: Recovering 3D Shape Interiors from Few Articulation Images
Figure 2 for RoSI: Recovering 3D Shape Interiors from Few Articulation Images
Figure 3 for RoSI: Recovering 3D Shape Interiors from Few Articulation Images
Figure 4 for RoSI: Recovering 3D Shape Interiors from Few Articulation Images
Viaarxiv icon

Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction

Mar 18, 2023
Jiayang Bai, Zhen He, Shan Yang, Jie Guo, Zhenyu Chen, Yan Zhang, Yanwen Guo

Figure 1 for Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction
Figure 2 for Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction
Figure 3 for Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction
Figure 4 for Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction
Viaarxiv icon

Aligning Multi-Sequence CMR Towards Fully Automated Myocardial Pathology Segmentation

Feb 07, 2023
Wangbin Ding, Lei Li, Junyi Qiu, Sihan Wang, Liqin Huang, Yinyin Chen, Shan Yang, Xiahai Zhuang

Figure 1 for Aligning Multi-Sequence CMR Towards Fully Automated Myocardial Pathology Segmentation
Figure 2 for Aligning Multi-Sequence CMR Towards Fully Automated Myocardial Pathology Segmentation
Figure 3 for Aligning Multi-Sequence CMR Towards Fully Automated Myocardial Pathology Segmentation
Figure 4 for Aligning Multi-Sequence CMR Towards Fully Automated Myocardial Pathology Segmentation
Viaarxiv icon

UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis

Dec 06, 2022
Yi Lei, Shan Yang, Xinsheng Wang, Qicong Xie, Jixun Yao, Lei Xie, Dan Su

Figure 1 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Figure 2 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Figure 3 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Figure 4 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Viaarxiv icon