Alert button

"Information": models, code, and papers
Alert button

TraveLER: A Multi-LMM Agent Framework for Video Question-Answering

Apr 01, 2024
Chuyi Shang, Amos You, Sanjay Subramanian, Trevor Darrell, Roei Herzig

Viaarxiv icon

Multi-Agent Team Access Monitoring: Environments that Benefit from Target Information Sharing

Mar 28, 2024
Andrew Dudash, Scott James, Ryan Rubel

Viaarxiv icon

Attribution Regularization for Multimodal Paradigms

Apr 02, 2024
Sahiti Yerramilli, Jayant Sravan Tamarapalli, Jonathan Francis, Eric Nyberg

Viaarxiv icon

Contextual Embedding Learning to Enhance 2D Networks for Volumetric Image Segmentation

Apr 02, 2024
Zhuoyuan Wang, Dong Sun, Xiangyun Zeng, Ruodai Wu, Yi Wang

Viaarxiv icon

An Active Perception Game for Robust Autonomous Exploration

Mar 31, 2024
Siming He, Yuezhan Tao, Igor Spasojevic, Vijay Kumar, Pratik Chaudhari

Viaarxiv icon

TSOM: Small Object Motion Detection Neural Network Inspired by Avian Visual Circuit

Apr 01, 2024
Pignge Hu, Xiaoteng Zhang, Mengmeng Li, Yingjie Zhu, Li Shi

Viaarxiv icon

Dynamic Demonstration Retrieval and Cognitive Understanding for Emotional Support Conversation

Apr 03, 2024
Zhe Xu, Daoyuan Chen, Jiayi Kuang, Zihao Yi, Yaliang Li, Ying Shen

Viaarxiv icon

Cohort-Individual Cooperative Learning for Multimodal Cancer Survival Analysis

Apr 03, 2024
Huajun Zhou, Fengtao Zhou, Hao Chen

Viaarxiv icon

Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation

Add code
Bookmark button
Alert button
Apr 03, 2024
Xiaoshuang Huang, Hongxiang Li, Meng Cao, Long Chen, Chenyu You, Dong An

Viaarxiv icon

SalFoM: Dynamic Saliency Prediction with Video Foundation Models

Apr 03, 2024
Morteza Moradi, Mohammad Moradi, Francesco Rundo, Concetto Spampinato, Ali Borji, Simone Palazzo

Viaarxiv icon