Alert button

"Text": models, code, and papers
Alert button

AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval

Nov 27, 2023
Shicheng Xu, Danyang Hou, Liang Pang, Jingcheng Deng, Jun Xu, Huawei Shen, Xueqi Cheng

Viaarxiv icon

Text-Driven Image Editing via Learnable Regions

Nov 28, 2023
Yuanze Lin, Yi-Wen Chen, Yi-Hsuan Tsai, Lu Jiang, Ming-Hsuan Yang

Viaarxiv icon

P2M2-Net: Part-Aware Prompt-Guided Multimodal Point Cloud Completion

Dec 29, 2023
Linlian Jiang, Pan Chen, Ye Wang, Tieru Wu, Rui Ma

Viaarxiv icon

Hardware Resilience Properties of Text-Guided Image Classifiers

Nov 23, 2023
Syed Talal Wasim, Kabila Haile Saboka, Abdulrahman Mahmoud, Salman Khan, David Brooks, Gu-Yeon Wei

Figure 1 for Hardware Resilience Properties of Text-Guided Image Classifiers
Figure 2 for Hardware Resilience Properties of Text-Guided Image Classifiers
Figure 3 for Hardware Resilience Properties of Text-Guided Image Classifiers
Figure 4 for Hardware Resilience Properties of Text-Guided Image Classifiers
Viaarxiv icon

UniHuman: A Unified Model for Editing Human Images in the Wild

Dec 22, 2023
Nannan Li, Qing Liu, Krishna Kumar Singh, Yilin Wang, Jianming Zhang, Bryan A. Plummer, Zhe Lin

Viaarxiv icon

kNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels

Dec 21, 2023
Jiaming Zhou, Shiwan Zhao, Yaqi Liu, Wenjia Zeng, Yong Chen, Yong Qin

Viaarxiv icon

MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training

Nov 28, 2023
Pavan Kumar Anasosalu Vasu, Hadi Pouransari, Fartash Faghri, Raviteja Vemulapalli, Oncel Tuzel

Viaarxiv icon

ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation

Dec 04, 2023
Dar-Yen Chen, Hamish Tennent, Ching-Wen Hsu

Viaarxiv icon

Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

Nov 23, 2023
Jiahao Li, Hao Tan, Kai Zhang, Zexiang Xu, Fujun Luan, Yinghao Xu, Yicong Hong, Kalyan Sunkavalli, Greg Shakhnarovich, Sai Bi

Figure 1 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Figure 2 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Figure 3 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Figure 4 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Viaarxiv icon

Spatial-Related Sensors Matters: 3D Human Motion Reconstruction Assisted with Textual Semantics

Dec 27, 2023
Xueyuan Yang, Chao Yao, Xiaojuan Ban

Viaarxiv icon