Picture for Shan Yang

Shan Yang

TokSing: Singing Voice Synthesis based on Discrete Tokens

Add code
Jun 12, 2024
Viaarxiv icon

TotalSegmentator MRI: Sequence-Independent Segmentation of 59 Anatomical Structures in MR images

Add code
May 29, 2024
Viaarxiv icon

EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization

Add code
Feb 21, 2024
Viaarxiv icon

MLLMReID: Multimodal Large Language Model-based Person Re-identification

Add code
Jan 24, 2024
Viaarxiv icon

VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering

Add code
Dec 13, 2023
Figure 1 for VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering
Figure 2 for VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering
Figure 3 for VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering
Figure 4 for VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering
Viaarxiv icon

A High Fidelity and Low Complexity Neural Audio Coding

Add code
Oct 17, 2023
Viaarxiv icon

MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation

Add code
Oct 06, 2023
Figure 1 for MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Figure 2 for MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Figure 3 for MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Figure 4 for MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Viaarxiv icon

ICAR: Image-based Complementary Auto Reasoning

Add code
Aug 17, 2023
Figure 1 for ICAR: Image-based Complementary Auto Reasoning
Figure 2 for ICAR: Image-based Complementary Auto Reasoning
Figure 3 for ICAR: Image-based Complementary Auto Reasoning
Figure 4 for ICAR: Image-based Complementary Auto Reasoning
Viaarxiv icon

RoSI: Recovering 3D Shape Interiors from Few Articulation Images

Add code
Apr 13, 2023
Figure 1 for RoSI: Recovering 3D Shape Interiors from Few Articulation Images
Figure 2 for RoSI: Recovering 3D Shape Interiors from Few Articulation Images
Figure 3 for RoSI: Recovering 3D Shape Interiors from Few Articulation Images
Figure 4 for RoSI: Recovering 3D Shape Interiors from Few Articulation Images
Viaarxiv icon

Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction

Add code
Mar 18, 2023
Figure 1 for Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction
Figure 2 for Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction
Figure 3 for Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction
Figure 4 for Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction
Viaarxiv icon