Alert button

"Text": models, code, and papers
Alert button

MotionScript: Natural Language Descriptions for Expressive 3D Human Motions

Dec 19, 2023
Payam Jome Yazdian, Eric Liu, Li Cheng, Angelica Lim

Viaarxiv icon

VILA: On Pre-training for Visual Language Models

Dec 14, 2023
Ji Lin, Hongxu Yin, Wei Ping, Yao Lu, Pavlo Molchanov, Andrew Tao, Huizi Mao, Jan Kautz, Mohammad Shoeybi, Song Han

Figure 1 for VILA: On Pre-training for Visual Language Models
Figure 2 for VILA: On Pre-training for Visual Language Models
Figure 3 for VILA: On Pre-training for Visual Language Models
Figure 4 for VILA: On Pre-training for Visual Language Models
Viaarxiv icon

ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors

Nov 09, 2023
Jingwen Chen, Yingwei Pan, Ting Yao, Tao Mei

Viaarxiv icon

CLDR: Contrastive Learning Drug Response Models from Natural Language Supervision

Dec 17, 2023
Kun Li, Wenbin Hu

Viaarxiv icon

Mono3DVG: 3D Visual Grounding in Monocular Images

Dec 13, 2023
Yang Zhan, Yuan Yuan, Zhitong Xiong

Figure 1 for Mono3DVG: 3D Visual Grounding in Monocular Images
Figure 2 for Mono3DVG: 3D Visual Grounding in Monocular Images
Figure 3 for Mono3DVG: 3D Visual Grounding in Monocular Images
Figure 4 for Mono3DVG: 3D Visual Grounding in Monocular Images
Viaarxiv icon

LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching

Nov 19, 2023
Yixun Liang, Xin Yang, Jiantao Lin, Haodong Li, Xiaogang Xu, Yingcong Chen

Figure 1 for LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Figure 2 for LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Figure 3 for LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Figure 4 for LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Viaarxiv icon

Breathing Life Into Sketches Using Text-to-Video Priors

Nov 21, 2023
Rinon Gal, Yael Vinker, Yuval Alaluf, Amit H. Bermano, Daniel Cohen-Or, Ariel Shamir, Gal Chechik

Viaarxiv icon

Towards Robust Text Retrieval with Progressive Learning

Nov 20, 2023
Tong Wu, Yulei Qin, Enwei Zhang, Zihan Xu, Yuting Gao, Ke Li, Xing Sun

Figure 1 for Towards Robust Text Retrieval with Progressive Learning
Figure 2 for Towards Robust Text Retrieval with Progressive Learning
Figure 3 for Towards Robust Text Retrieval with Progressive Learning
Figure 4 for Towards Robust Text Retrieval with Progressive Learning
Viaarxiv icon

What Large Language Models Bring to Text-rich VQA?

Nov 13, 2023
Xuejing Liu, Wei Tang, Xinzhe Ni, Jinghui Lu, Rui Zhao, Zechao Li, Fei Tan

Figure 1 for What Large Language Models Bring to Text-rich VQA?
Figure 2 for What Large Language Models Bring to Text-rich VQA?
Figure 3 for What Large Language Models Bring to Text-rich VQA?
Figure 4 for What Large Language Models Bring to Text-rich VQA?
Viaarxiv icon

Learning Subject-Aware Cropping by Outpainting Professional Photos

Dec 19, 2023
James Hong, Lu Yuan, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian

Viaarxiv icon