Alert button

"Image": models, code, and papers
Alert button

OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition

Feb 29, 2024
Yuchen Pan, Junjun Jiang, Kui Jiang, Zhihao Wu, Keyuan Yu, Xianming Liu

Viaarxiv icon

Privacy-Preserving Autoencoder for Collaborative Object Detection

Feb 29, 2024
Bardia Azizian, Ivan Bajic

Viaarxiv icon

Continuous Sign Language Recognition Based on Motor attention mechanism and frame-level Self-distillation

Feb 29, 2024
Qidan Zhu, Jing Li, Fei Yuan, Quan Gan

Viaarxiv icon

Artwork Explanation in Large-scale Vision Language Models

Feb 29, 2024
Kazuki Hayashi, Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

Figure 1 for Artwork Explanation in Large-scale Vision Language Models
Figure 2 for Artwork Explanation in Large-scale Vision Language Models
Figure 3 for Artwork Explanation in Large-scale Vision Language Models
Figure 4 for Artwork Explanation in Large-scale Vision Language Models
Viaarxiv icon

Outline-Guided Object Inpainting with Diffusion Models

Add code
Bookmark button
Alert button
Feb 26, 2024
Markus Pobitzer, Filip Janicki, Mattia Rigotti, Cristiano Malossi

Viaarxiv icon

BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM

Add code
Bookmark button
Alert button
Feb 26, 2024
Li Zhang, Youwei Liang, Pengtao Xie

Viaarxiv icon

SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model

Add code
Bookmark button
Alert button
Feb 28, 2024
Bin Cao, Jianhao Yuan, Yexin Liu, Jian Li, Shuyang Sun, Jing Liu, Bo Zhao

Viaarxiv icon

OccTransformer: Improving BEVFormer for 3D camera-only occupancy prediction

Feb 28, 2024
Jian Liu, Sipeng Zhang, Chuixin Kong, Wenyuan Zhang, Yuhang Wu, Yikang Ding, Borun Xu, Ruibo Ming, Donglai Wei, Xianming Liu

Viaarxiv icon

Downstream Task Guided Masking Learning in Masked Autoencoders Using Multi-Level Optimization

Add code
Bookmark button
Alert button
Feb 28, 2024
Han Guo, Ramtin Hosseini, Ruiyi Zhang, Sai Ashish Somayajula, Ranak Roy Chowdhury, Rajesh K. Gupta, Pengtao Xie

Viaarxiv icon

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Add code
Bookmark button
Alert button
Feb 06, 2024
Ling Yang, Zhaochen Yu, Chenlin Meng, Minkai Xu, Stefano Ermon, Bin Cui

Viaarxiv icon