Alert button

"Image": models, code, and papers
Alert button

Unsupervised Audio-Visual Segmentation with Modality Alignment

Mar 21, 2024
Swapnil Bhosale, Haosen Yang, Diptesh Kanojia, Jiangkang Deng, Xiatian Zhu

Figure 1 for Unsupervised Audio-Visual Segmentation with Modality Alignment
Figure 2 for Unsupervised Audio-Visual Segmentation with Modality Alignment
Figure 3 for Unsupervised Audio-Visual Segmentation with Modality Alignment
Figure 4 for Unsupervised Audio-Visual Segmentation with Modality Alignment
Viaarxiv icon

EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition

Mar 21, 2024
Xu Zheng, Lin Wang

Figure 1 for EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition
Figure 2 for EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition
Figure 3 for EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition
Figure 4 for EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition
Viaarxiv icon

VidLA: Video-Language Alignment at Scale

Mar 21, 2024
Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan, Son Tran, Benjamin Z. Yao, Belinda Zeng, Mubarak Shah, Trishul Chilimbi

Viaarxiv icon

ReGround: Improving Textual and Spatial Grounding at No Cost

Add code
Bookmark button
Alert button
Mar 20, 2024
Yuseung Lee, Minhyuk Sung

Figure 1 for ReGround: Improving Textual and Spatial Grounding at No Cost
Figure 2 for ReGround: Improving Textual and Spatial Grounding at No Cost
Figure 3 for ReGround: Improving Textual and Spatial Grounding at No Cost
Figure 4 for ReGround: Improving Textual and Spatial Grounding at No Cost
Viaarxiv icon

D-YOLO a robust framework for object detection in adverse weather conditions

Mar 20, 2024
Zihan Chu

Figure 1 for D-YOLO a robust framework for object detection in adverse weather conditions
Figure 2 for D-YOLO a robust framework for object detection in adverse weather conditions
Figure 3 for D-YOLO a robust framework for object detection in adverse weather conditions
Figure 4 for D-YOLO a robust framework for object detection in adverse weather conditions
Viaarxiv icon

Synthetic Image Generation in Cyber Influence Operations: An Emergent Threat?

Mar 18, 2024
Melanie Mathys, Marco Willi, Michael Graber, Raphael Meier

Figure 1 for Synthetic Image Generation in Cyber Influence Operations: An Emergent Threat?
Figure 2 for Synthetic Image Generation in Cyber Influence Operations: An Emergent Threat?
Figure 3 for Synthetic Image Generation in Cyber Influence Operations: An Emergent Threat?
Figure 4 for Synthetic Image Generation in Cyber Influence Operations: An Emergent Threat?
Viaarxiv icon

Differentially Private Representation Learning via Image Captioning

Mar 04, 2024
Tom Sander, Yaodong Yu, Maziar Sanjabi, Alain Durmus, Yi Ma, Kamalika Chaudhuri, Chuan Guo

Figure 1 for Differentially Private Representation Learning via Image Captioning
Figure 2 for Differentially Private Representation Learning via Image Captioning
Figure 3 for Differentially Private Representation Learning via Image Captioning
Figure 4 for Differentially Private Representation Learning via Image Captioning
Viaarxiv icon

Modality-Agnostic Structural Image Representation Learning for Deformable Multi-Modality Medical Image Registration

Feb 29, 2024
Tony C. W. Mok, Zi Li, Yunhao Bai, Jianpeng Zhang, Wei Liu, Yan-Jie Zhou, Ke Yan, Dakai Jin, Yu Shi, Xiaoli Yin, Le Lu, Ling Zhang

Viaarxiv icon

Ultra Low-Cost Two-Stage Multimodal System for Non-Normative Behavior Detection

Mar 24, 2024
Albert Lu, Stephen Cranefield

Viaarxiv icon

Towards Two-Stream Foveation-based Active Vision Learning

Mar 24, 2024
Timur Ibrayev, Amitangshu Mukherjee, Sai Aparna Aketi, Kaushik Roy

Viaarxiv icon