Alert button

"Information": models, code, and papers
Alert button

Backchannel Detection and Agreement Estimation from Video with Transformer Networks

Jun 02, 2023
Ahmed Amer, Chirag Bhuvaneshwara, Gowtham K. Addluri, Mohammed M. Shaik, Vedant Bonde, Philipp Müller

Figure 1 for Backchannel Detection and Agreement Estimation from Video with Transformer Networks
Figure 2 for Backchannel Detection and Agreement Estimation from Video with Transformer Networks
Figure 3 for Backchannel Detection and Agreement Estimation from Video with Transformer Networks
Figure 4 for Backchannel Detection and Agreement Estimation from Video with Transformer Networks
Viaarxiv icon

Leveraging the Triple Exponential Moving Average for Fast-Adaptive Moment Estimation

Jun 02, 2023
Roi Peleg, Roi Weiss, Assaf Hoogi

Figure 1 for Leveraging the Triple Exponential Moving Average for Fast-Adaptive Moment Estimation
Figure 2 for Leveraging the Triple Exponential Moving Average for Fast-Adaptive Moment Estimation
Figure 3 for Leveraging the Triple Exponential Moving Average for Fast-Adaptive Moment Estimation
Figure 4 for Leveraging the Triple Exponential Moving Average for Fast-Adaptive Moment Estimation
Viaarxiv icon

Video Colorization with Pre-trained Text-to-Image Diffusion Models

Add code
Bookmark button
Alert button
Jun 02, 2023
Hanyuan Liu, Minshan Xie, Jinbo Xing, Chengze Li, Tien-Tsin Wong

Figure 1 for Video Colorization with Pre-trained Text-to-Image Diffusion Models
Figure 2 for Video Colorization with Pre-trained Text-to-Image Diffusion Models
Figure 3 for Video Colorization with Pre-trained Text-to-Image Diffusion Models
Figure 4 for Video Colorization with Pre-trained Text-to-Image Diffusion Models
Viaarxiv icon

Using Caterpillar to Nibble Small-Scale Images

Add code
Bookmark button
Alert button
May 28, 2023
Jin Sun, Xiaoshuang Shi, Zhiyuan Weng, Kaidi Xu, Heng Tao Shen, Xiaofeng Zhu

Figure 1 for Using Caterpillar to Nibble Small-Scale Images
Figure 2 for Using Caterpillar to Nibble Small-Scale Images
Figure 3 for Using Caterpillar to Nibble Small-Scale Images
Figure 4 for Using Caterpillar to Nibble Small-Scale Images
Viaarxiv icon

KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models

Add code
Bookmark button
Alert button
May 28, 2023
Zhiwei Jia, Pradyumna Narayana, Arjun R. Akula, Garima Pruthi, Hao Su, Sugato Basu, Varun Jampani

Figure 1 for KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models
Figure 2 for KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models
Figure 3 for KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models
Figure 4 for KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models
Viaarxiv icon

Retrieving Multimodal Information for Augmented Generation: A Survey

Mar 20, 2023
Ruochen Zhao, Hailin Chen, Weishi Wang, Fangkai Jiao, Xuan Long Do, Chengwei Qin, Bosheng Ding, Xiaobao Guo, Minzhi Li, Xingxuan Li, Shafiq Joty

Figure 1 for Retrieving Multimodal Information for Augmented Generation: A Survey
Figure 2 for Retrieving Multimodal Information for Augmented Generation: A Survey
Viaarxiv icon

TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation

Add code
Bookmark button
Alert button
Jun 07, 2023
Tao Lei, Rui Sun, Yong Wan, Yong Xia, Xiaogang Du, Asoke K. Nandi

Figure 1 for TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation
Figure 2 for TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation
Figure 3 for TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation
Figure 4 for TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation
Viaarxiv icon

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Add code
Bookmark button
Alert button
Jun 07, 2023
George Stein, Jesse C. Cresswell, Rasa Hosseinzadeh, Yi Sui, Brendan Leigh Ross, Valentin Villecroze, Zhaoyan Liu, Anthony L. Caterini, J. Eric T. Taylor, Gabriel Loaiza-Ganem

Figure 1 for Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models
Figure 2 for Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models
Figure 3 for Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models
Figure 4 for Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models
Viaarxiv icon

GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?

Add code
Bookmark button
Alert button
Jun 07, 2023
Ning Ding, Yehui Tang, Zhongqian Fu, Chao Xu, Kai Han, Yunhe Wang

Figure 1 for GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?
Figure 2 for GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?
Figure 3 for GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?
Figure 4 for GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?
Viaarxiv icon

A bioinspired three-stage model for camouflaged object detection

Add code
Bookmark button
Alert button
May 22, 2023
Tianyou Chen, Jin Xiao, Xiaoguang Hu, Guofeng Zhang, Shaojie Wang

Figure 1 for A bioinspired three-stage model for camouflaged object detection
Figure 2 for A bioinspired three-stage model for camouflaged object detection
Figure 3 for A bioinspired three-stage model for camouflaged object detection
Figure 4 for A bioinspired three-stage model for camouflaged object detection
Viaarxiv icon