Alert button

"Image": models, code, and papers
Alert button

Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation

Jun 23, 2023
Qianji Di, Wenxi Ma, Zhongang Qi, Tianxiang Hou, Ying Shan, Hanzi Wang

Figure 1 for Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation
Figure 2 for Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation
Figure 3 for Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation
Figure 4 for Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation
Viaarxiv icon

On the Fly Neural Style Smoothing for Risk-Averse Domain Generalization

Add code
Bookmark button
Alert button
Jul 17, 2023
Akshay Mehra, Yunbei Zhang, Bhavya Kailkhura, Jihun Hamm

Figure 1 for On the Fly Neural Style Smoothing for Risk-Averse Domain Generalization
Figure 2 for On the Fly Neural Style Smoothing for Risk-Averse Domain Generalization
Figure 3 for On the Fly Neural Style Smoothing for Risk-Averse Domain Generalization
Figure 4 for On the Fly Neural Style Smoothing for Risk-Averse Domain Generalization
Viaarxiv icon

Robust Single-view Cone-beam X-ray Pose Estimation with Neural Tuned Tomography (NeTT) and Masked Neural Radiance Fields (mNeRF)

Aug 01, 2023
Chaochao Zhou, Syed Hasib Akhter Faruqui, Abhinav Patel, Ramez N. Abdalla, Michael C. Hurley, Ali Shaibani, Matthew B. Potts, Babak S. Jahromi, Leon Cho, Sameer A. Ansari, Donald R. Cantrell

Figure 1 for Robust Single-view Cone-beam X-ray Pose Estimation with Neural Tuned Tomography (NeTT) and Masked Neural Radiance Fields (mNeRF)
Figure 2 for Robust Single-view Cone-beam X-ray Pose Estimation with Neural Tuned Tomography (NeTT) and Masked Neural Radiance Fields (mNeRF)
Figure 3 for Robust Single-view Cone-beam X-ray Pose Estimation with Neural Tuned Tomography (NeTT) and Masked Neural Radiance Fields (mNeRF)
Figure 4 for Robust Single-view Cone-beam X-ray Pose Estimation with Neural Tuned Tomography (NeTT) and Masked Neural Radiance Fields (mNeRF)
Viaarxiv icon

Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding

Aug 01, 2023
Runyu Ding, Jihan Yang, Chuhui Xue, Wenqing Zhang, Song Bai, Xiaojuan Qi

Figure 1 for Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
Figure 2 for Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
Figure 3 for Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
Figure 4 for Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
Viaarxiv icon

Target Detection on Hyperspectral Images Using MCMC and VI Trained Bayesian Neural Networks

Aug 11, 2023
Daniel Ries, Jason Adams, Joshua Zollweg

Figure 1 for Target Detection on Hyperspectral Images Using MCMC and VI Trained Bayesian Neural Networks
Figure 2 for Target Detection on Hyperspectral Images Using MCMC and VI Trained Bayesian Neural Networks
Figure 3 for Target Detection on Hyperspectral Images Using MCMC and VI Trained Bayesian Neural Networks
Figure 4 for Target Detection on Hyperspectral Images Using MCMC and VI Trained Bayesian Neural Networks
Viaarxiv icon

Evidence of Human-Like Visual-Linguistic Integration in Multimodal Large Language Models During Predictive Language Processing

Add code
Bookmark button
Alert button
Aug 11, 2023
Viktor Kewenig, Christopher Edwards, Quitterie Lacome DEstalenx, Akilles Rechardt, Jeremy I Skipper, Gabriella Vigliocco

Figure 1 for Evidence of Human-Like Visual-Linguistic Integration in Multimodal Large Language Models During Predictive Language Processing
Figure 2 for Evidence of Human-Like Visual-Linguistic Integration in Multimodal Large Language Models During Predictive Language Processing
Figure 3 for Evidence of Human-Like Visual-Linguistic Integration in Multimodal Large Language Models During Predictive Language Processing
Figure 4 for Evidence of Human-Like Visual-Linguistic Integration in Multimodal Large Language Models During Predictive Language Processing
Viaarxiv icon

Collaborative Auto-encoding for Blind Image Quality Assessment

Add code
Bookmark button
Alert button
May 24, 2023
Zehong Zhou, Fei Zhou, Guoping Qiu

Figure 1 for Collaborative Auto-encoding for Blind Image Quality Assessment
Figure 2 for Collaborative Auto-encoding for Blind Image Quality Assessment
Figure 3 for Collaborative Auto-encoding for Blind Image Quality Assessment
Figure 4 for Collaborative Auto-encoding for Blind Image Quality Assessment
Viaarxiv icon

MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation

May 24, 2023
Marco Bellagente, Manuel Brack, Hannah Teufel, Felix Friedrich, Björn Deiseroth, Constantin Eichenberg, Andrew Dai, Robert Baldock, Souradeep Nanda, Koen Oostermeijer, Andres Felipe Cruz-Salinas, Patrick Schramowski, Kristian Kersting, Samuel Weinbach

Figure 1 for MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Figure 2 for MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Figure 3 for MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Figure 4 for MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Viaarxiv icon

Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity

May 14, 2023
Raman Dutt, Linus Ericsson, Pedro Sanchez, Sotirios A. Tsaftaris, Timothy Hospedales

Figure 1 for Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Figure 2 for Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Figure 3 for Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Figure 4 for Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Viaarxiv icon

Hierarchical Spatiotemporal Transformers for Video Object Segmentation

Jul 17, 2023
Jun-Sang Yoo, Hongjae Lee, Seung-Won Jung

Figure 1 for Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Figure 2 for Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Figure 3 for Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Figure 4 for Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Viaarxiv icon