Alert button

"Image": models, code, and papers
Alert button

Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models

Add code
Bookmark button
Alert button
Oct 26, 2023
Tsun-Hsuan Wang, Alaa Maalouf, Wei Xiao, Yutong Ban, Alexander Amini, Guy Rosman, Sertac Karaman, Daniela Rus

Figure 1 for Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models
Figure 2 for Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models
Figure 3 for Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models
Figure 4 for Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models
Viaarxiv icon

VLIS: Unimodal Language Models Guide Multimodal Language Generation

Add code
Bookmark button
Alert button
Oct 15, 2023
Jiwan Chung, Youngjae Yu

Viaarxiv icon

Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation

Sep 12, 2023
Ling Huang, Su Ruan, Pierre Decazes, Thierry Denoeux

Figure 1 for Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation
Figure 2 for Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation
Figure 3 for Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation
Figure 4 for Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation
Viaarxiv icon

Contrastive Grouping with Transformer for Referring Image Segmentation

Add code
Bookmark button
Alert button
Sep 02, 2023
Jiajin Tang, Ge Zheng, Cheng Shi, Sibei Yang

Figure 1 for Contrastive Grouping with Transformer for Referring Image Segmentation
Figure 2 for Contrastive Grouping with Transformer for Referring Image Segmentation
Figure 3 for Contrastive Grouping with Transformer for Referring Image Segmentation
Figure 4 for Contrastive Grouping with Transformer for Referring Image Segmentation
Viaarxiv icon

The BLA Benchmark: Investigating Basic Language Abilities of Pre-Trained Multimodal Models

Add code
Bookmark button
Alert button
Oct 23, 2023
Xinyi Chen, Raquel Fernández, Sandro Pezzelle

Viaarxiv icon

Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond

Add code
Bookmark button
Alert button
Oct 23, 2023
Zhecan Wang, Long Chen, Haoxuan You, Keyang Xu, Yicheng He, Wenhao Li, Noal Codella, Kai-Wei Chang, Shih-Fu Chang

Figure 1 for Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Figure 2 for Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Figure 3 for Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Figure 4 for Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Viaarxiv icon

XTSC-Bench: Quantitative Benchmarking for Explainers on Time Series Classification

Add code
Bookmark button
Alert button
Oct 23, 2023
Jacqueline Höllig, Steffen Thoma, Florian Grimm

Viaarxiv icon

Joint Non-Linear MRI Inversion with Diffusion Priors

Oct 23, 2023
Moritz Erlacher, Martin Zach

Viaarxiv icon

Meaning Representations from Trajectories in Autoregressive Models

Oct 23, 2023
Tian Yu Liu, Matthew Trager, Alessandro Achille, Pramuditha Perera, Luca Zancato, Stefano Soatto

Viaarxiv icon

LLM4SGG: Large Language Model for Weakly Supervised Scene Graph Generation

Add code
Bookmark button
Alert button
Oct 20, 2023
Kibum Kim, Kanghoon Yoon, Jaehyeong Jeon, Yeonjun In, Jinyoung Moon, Donghyun Kim, Chanyoung Park

Viaarxiv icon