Alert button

"Image": models, code, and papers
Alert button

VILA: On Pre-training for Visual Language Models

Add code
Bookmark button
Alert button
Dec 14, 2023
Ji Lin, Hongxu Yin, Wei Ping, Yao Lu, Pavlo Molchanov, Andrew Tao, Huizi Mao, Jan Kautz, Mohammad Shoeybi, Song Han

Figure 1 for VILA: On Pre-training for Visual Language Models
Figure 2 for VILA: On Pre-training for Visual Language Models
Figure 3 for VILA: On Pre-training for Visual Language Models
Figure 4 for VILA: On Pre-training for Visual Language Models
Viaarxiv icon

Exploring the Naturalness of AI-Generated Images

Add code
Bookmark button
Alert button
Dec 14, 2023
Zijian Chen, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

Viaarxiv icon

Brain-optimized inference improves reconstructions of fMRI brain activity

Dec 12, 2023
Reese Kneeland, Jordyn Ojeda, Ghislain St-Yves, Thomas Naselaris

Viaarxiv icon

VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering

Dec 13, 2023
Xijun Wang, Junbang Liang, Chun-Kai Wang, Kenan Deng, Yu Lou, Ming Lin, Shan Yang

Viaarxiv icon

Diffusing More Objects for Semi-Supervised Domain Adaptation with Less Labeling

Dec 19, 2023
Leander van den Heuvel, Gertjan Burghouts, David W. Zhang, Gwenn Englebienne, Sabina B. van Rooij

Viaarxiv icon

Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation

Add code
Bookmark button
Alert button
Dec 15, 2023
YoungJoon Yoo, Jongwon Choi

Figure 1 for Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
Figure 2 for Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
Figure 3 for Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
Figure 4 for Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
Viaarxiv icon

Multiscale Vision Transformer With Deep Clustering-Guided Refinement for Weakly Supervised Object Localization

Dec 15, 2023
David Kim, Sinhae Cha, Byeongkeun Kang

Viaarxiv icon

Enhancing Neural Training via a Correlated Dynamics Model

Dec 20, 2023
Jonathan Brokman, Roy Betser, Rotem Turjeman, Tom Berkov, Ido Cohen, Guy Gilboa

Viaarxiv icon

On the Quantification of Image Reconstruction Uncertainty without Training Data

Nov 16, 2023
Sirui Bi, Victor Fung, Jiaxin Zhang

Viaarxiv icon

Enhancing Instance-Level Image Classification with Set-Level Labels

Nov 09, 2023
Renyu Zhang, Aly A. Khan, Yuxin Chen, Robert L. Grossman

Viaarxiv icon