Alert button

"Image": models, code, and papers
Alert button

EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Add code
Bookmark button
Alert button
Jan 07, 2024
Wenxi Chen, Yuzhe Liang, Ziyang Ma, Zhisheng Zheng, Xie Chen

Viaarxiv icon

Describing Differences in Image Sets with Natural Language

Dec 05, 2023
Lisa Dunlap, Yuhui Zhang, Xiaohan Wang, Ruiqi Zhong, Trevor Darrell, Jacob Steinhardt, Joseph E. Gonzalez, Serena Yeung-Levy

Viaarxiv icon

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Dec 01, 2023
Yunyang Xiong, Bala Varadarajan, Lemeng Wu, Xiaoyu Xiang, Fanyi Xiao, Chenchen Zhu, Xiaoliang Dai, Dilin Wang, Fei Sun, Forrest Iandola, Raghuraman Krishnamoorthi, Vikas Chandra

Viaarxiv icon

Prototype-Based Approach for One-Shot Segmentation of Brain Tumors using Few-Shot Learning

Jan 09, 2024
Ahmed Ayman

Viaarxiv icon

ESDMR-Net: A Lightweight Network With Expand-Squeeze and Dual Multiscale Residual Connections for Medical Image Segmentation

Dec 17, 2023
Tariq M Khan, Syed S. Naqvi, Erik Meijering

Viaarxiv icon

Stable Messenger: Steganography for Message-Concealed Image Generation

Dec 03, 2023
Quang Nguyen, Truong Vu, Cuong Pham, Anh Tran, Khoi Nguyen

Figure 1 for Stable Messenger: Steganography for Message-Concealed Image Generation
Figure 2 for Stable Messenger: Steganography for Message-Concealed Image Generation
Figure 3 for Stable Messenger: Steganography for Message-Concealed Image Generation
Figure 4 for Stable Messenger: Steganography for Message-Concealed Image Generation
Viaarxiv icon

Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation

Jan 02, 2024
Renshuai Liu, Bowen Ma, Wei Zhang, Zhipeng Hu, Changjie Fan, Tangjie Lv, Yu Ding, Xuan Cheng

Viaarxiv icon

Fair Text-to-Image Diffusion via Fair Mapping

Nov 29, 2023
Jia Li, Lijie Hu, Jingfeng Zhang, Tianhang Zheng, Hua Zhang, Di Wang

Figure 1 for Fair Text-to-Image Diffusion via Fair Mapping
Figure 2 for Fair Text-to-Image Diffusion via Fair Mapping
Figure 3 for Fair Text-to-Image Diffusion via Fair Mapping
Figure 4 for Fair Text-to-Image Diffusion via Fair Mapping
Viaarxiv icon

Large Language Models as Visual Cross-Domain Learners

Jan 06, 2024
Shuhao Chen, Yulong Zhang, Weisen Jiang, Jiangang Lu, Yu Zhang

Viaarxiv icon

UGGNet: Bridging U-Net and VGG for Advanced Breast Cancer Diagnosis

Jan 06, 2024
Tran Cao Minh, Nguyen Kim Quoc, Phan Cong Vinh, Dang Nhu Phu, Vuong Xuan Chi, Ha Minh Tan

Viaarxiv icon