Alert button
Picture for Ying Shan

Ying Shan

Alert button

Advances in 3D Generation: A Survey

Jan 31, 2024
Xiaoyu Li, Qi Zhang, Di Kang, Weihao Cheng, Yiming Gao, Jingbo Zhang, Zhihao Liang, Jing Liao, Yan-Pei Cao, Ying Shan

Viaarxiv icon

YOLO-World: Real-Time Open-Vocabulary Object Detection

Jan 30, 2024
Tianheng Cheng, Lin Song, Yixiao Ge, Wenyu Liu, Xinggang Wang, Ying Shan

Viaarxiv icon

RecDCL: Dual Contrastive Learning for Recommendation

Jan 28, 2024
Dan Zhang, Yangliao Geng, Wenwen Gong, Zhongang Qi, Zhiyu Chen, Xing Tang, Ying Shan, Yuxiao Dong, Jie Tang

Viaarxiv icon

TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts

Jan 26, 2024
Jingyu Zhuang, Di Kang, Yan-Pei Cao, Guanbin Li, Liang Lin, Ying Shan

Viaarxiv icon

Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Jan 25, 2024
Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong, Yixiao Ge, Ying Shan, Xiangyu Yue

Viaarxiv icon

Supervised Fine-tuning in turn Improves Visual Foundation Models

Jan 18, 2024
Xiaohu Jiang, Yixiao Ge, Yuying Ge, Chun Yuan, Ying Shan

Viaarxiv icon

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Jan 17, 2024
Haoxin Chen, Yong Zhang, Xiaodong Cun, Menghan Xia, Xintao Wang, Chao Weng, Ying Shan

Viaarxiv icon

Towards A Better Metric for Text-to-Video Generation

Jan 15, 2024
Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou

Viaarxiv icon

LLaMA Pro: Progressive LLaMA with Block Expansion

Jan 04, 2024
Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ping Luo, Ying Shan

Viaarxiv icon

VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation

Dec 14, 2023
Jinguo Zhu, Xiaohan Ding, Yixiao Ge, Yuying Ge, Sijie Zhao, Hengshuang Zhao, Xiaohua Wang, Ying Shan

Viaarxiv icon