Alert button

"Image": models, code, and papers
Alert button

Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences

Add code
Bookmark button
Alert button
Jul 31, 2023
Dingyi Yang, Hongyu Chen, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Qin Jin

Figure 1 for Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
Figure 2 for Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
Figure 3 for Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
Figure 4 for Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
Viaarxiv icon

Lightweight Super-Resolution Head for Human Pose Estimation

Add code
Bookmark button
Alert button
Jul 31, 2023
Haonan Wang, Jie Liu, Jie Tang, Gangshan Wu

Figure 1 for Lightweight Super-Resolution Head for Human Pose Estimation
Figure 2 for Lightweight Super-Resolution Head for Human Pose Estimation
Figure 3 for Lightweight Super-Resolution Head for Human Pose Estimation
Figure 4 for Lightweight Super-Resolution Head for Human Pose Estimation
Viaarxiv icon

Multi-layer Aggregation as a key to feature-based OOD detection

Jul 28, 2023
Benjamin Lambert, Florence Forbes, Senan Doyle, Michel Dojat

Figure 1 for Multi-layer Aggregation as a key to feature-based OOD detection
Figure 2 for Multi-layer Aggregation as a key to feature-based OOD detection
Figure 3 for Multi-layer Aggregation as a key to feature-based OOD detection
Viaarxiv icon

Supervised Homography Learning with Realistic Dataset Generation

Add code
Bookmark button
Alert button
Jul 28, 2023
Hai Jiang, Haipeng Li, Songchen Han, Haoqiang Fan, Bing Zeng, Shuaicheng Liu

Figure 1 for Supervised Homography Learning with Realistic Dataset Generation
Figure 2 for Supervised Homography Learning with Realistic Dataset Generation
Figure 3 for Supervised Homography Learning with Realistic Dataset Generation
Figure 4 for Supervised Homography Learning with Realistic Dataset Generation
Viaarxiv icon

Beyond Reality: The Pivotal Role of Generative AI in the Metaverse

Jul 28, 2023
Vinay Chamola, Gaurang Bansal, Tridib Kumar Das, Vikas Hassija, Naga Siva Sai Reddy, Jiacheng Wang, Sherali Zeadally, Amir Hussain, F. Richard Yu, Mohsen Guizani, Dusit Niyato

Figure 1 for Beyond Reality: The Pivotal Role of Generative AI in the Metaverse
Figure 2 for Beyond Reality: The Pivotal Role of Generative AI in the Metaverse
Figure 3 for Beyond Reality: The Pivotal Role of Generative AI in the Metaverse
Figure 4 for Beyond Reality: The Pivotal Role of Generative AI in the Metaverse
Viaarxiv icon

Medical Image Deidentification, Cleaning and Compression Using Pylogik

May 03, 2023
Adrienne Kline, Vinesh Appadurai, Yuan Luo, Sanjiv Shah

Figure 1 for Medical Image Deidentification, Cleaning and Compression Using Pylogik
Figure 2 for Medical Image Deidentification, Cleaning and Compression Using Pylogik
Figure 3 for Medical Image Deidentification, Cleaning and Compression Using Pylogik
Figure 4 for Medical Image Deidentification, Cleaning and Compression Using Pylogik
Viaarxiv icon

Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination

Add code
Bookmark button
Alert button
May 25, 2023
Hao Fei, Qian Liu, Meishan Zhang, Min Zhang, Tat-Seng Chua

Figure 1 for Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
Figure 2 for Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
Figure 3 for Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
Figure 4 for Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
Viaarxiv icon

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

Add code
Bookmark button
Alert button
Jul 03, 2023
Rui Sun, Zhecan Wang, Haoxuan You, Noel Codella, Kai-Wei Chang, Shih-Fu Chang

Figure 1 for UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
Figure 2 for UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
Figure 3 for UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
Figure 4 for UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
Viaarxiv icon

Boundary-weighted logit consistency improves calibration of segmentation networks

Jul 16, 2023
Neerav Karani, Neel Dey, Polina Golland

Viaarxiv icon

Multimodal Machine Learning for Extraction of Theorems and Proofs in the Scientific Literature

Add code
Bookmark button
Alert button
Jul 18, 2023
Shrey Mishra, Antoine Gauquier, Pierre Senellart

Figure 1 for Multimodal Machine Learning for Extraction of Theorems and Proofs in the Scientific Literature
Figure 2 for Multimodal Machine Learning for Extraction of Theorems and Proofs in the Scientific Literature
Figure 3 for Multimodal Machine Learning for Extraction of Theorems and Proofs in the Scientific Literature
Figure 4 for Multimodal Machine Learning for Extraction of Theorems and Proofs in the Scientific Literature
Viaarxiv icon