Alert button

"Image": models, code, and papers
Alert button

Autonomous damage assessment of structural columns using low-cost micro aerial vehicles and multi-view computer vision

Aug 30, 2023
Sina Tavasoli, Xiao Pan, T. Y. Yang, Saudah Gazi, Mohsen Azimi

Figure 1 for Autonomous damage assessment of structural columns using low-cost micro aerial vehicles and multi-view computer vision
Figure 2 for Autonomous damage assessment of structural columns using low-cost micro aerial vehicles and multi-view computer vision
Figure 3 for Autonomous damage assessment of structural columns using low-cost micro aerial vehicles and multi-view computer vision
Figure 4 for Autonomous damage assessment of structural columns using low-cost micro aerial vehicles and multi-view computer vision
Viaarxiv icon

Diving into Darkness: A Dual-Modulated Framework for High-Fidelity Super-Resolution in Ultra-Dark Environments

Add code
Bookmark button
Alert button
Sep 11, 2023
Jiaxin Gao, Ziyu Yue, Yaohua Liu, Sihan Xie, Xin Fan, Risheng Liu

Figure 1 for Diving into Darkness: A Dual-Modulated Framework for High-Fidelity Super-Resolution in Ultra-Dark Environments
Figure 2 for Diving into Darkness: A Dual-Modulated Framework for High-Fidelity Super-Resolution in Ultra-Dark Environments
Figure 3 for Diving into Darkness: A Dual-Modulated Framework for High-Fidelity Super-Resolution in Ultra-Dark Environments
Figure 4 for Diving into Darkness: A Dual-Modulated Framework for High-Fidelity Super-Resolution in Ultra-Dark Environments
Viaarxiv icon

Comparative Study of Visual SLAM-Based Mobile Robot Localization Using Fiducial Markers

Add code
Bookmark button
Alert button
Sep 08, 2023
Jongwon Lee, Su Yeon Choi, David Hanley, Timothy Bretl

Figure 1 for Comparative Study of Visual SLAM-Based Mobile Robot Localization Using Fiducial Markers
Figure 2 for Comparative Study of Visual SLAM-Based Mobile Robot Localization Using Fiducial Markers
Figure 3 for Comparative Study of Visual SLAM-Based Mobile Robot Localization Using Fiducial Markers
Viaarxiv icon

Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment

Sep 08, 2023
Hongyu Hu, Tiancheng Lin, Jie Wang, Zhenbang Sun, Yi Xu

Figure 1 for Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment
Figure 2 for Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment
Figure 3 for Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment
Figure 4 for Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment
Viaarxiv icon

Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection

Add code
Bookmark button
Alert button
Aug 22, 2023
Bingqing Zhang, Sen Wang, Yifan Liu, Brano Kusy, Xue Li, Jiajun Liu

Figure 1 for Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection
Figure 2 for Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection
Figure 3 for Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection
Figure 4 for Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection
Viaarxiv icon

DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation

Add code
Bookmark button
Alert button
Jul 01, 2023
Zhuowei Chen, Shancheng Fang, Wei Liu, Qian He, Mengqi Huang, Yongdong Zhang, Zhendong Mao

Figure 1 for DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation
Figure 2 for DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation
Figure 3 for DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation
Figure 4 for DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation
Viaarxiv icon

A Multimodal Visual Encoding Model Aided by Introducing Verbal Semantic Information

Aug 29, 2023
Shuxiao Ma, Linyuan Wang, Bin Yan

Figure 1 for A Multimodal Visual Encoding Model Aided by Introducing Verbal Semantic Information
Figure 2 for A Multimodal Visual Encoding Model Aided by Introducing Verbal Semantic Information
Figure 3 for A Multimodal Visual Encoding Model Aided by Introducing Verbal Semantic Information
Figure 4 for A Multimodal Visual Encoding Model Aided by Introducing Verbal Semantic Information
Viaarxiv icon

Multimodal Foundation Models For Echocardiogram Interpretation

Add code
Bookmark button
Alert button
Aug 29, 2023
Matthew Christensen, Milos Vukadinovic, Neal Yuan, David Ouyang

Figure 1 for Multimodal Foundation Models For Echocardiogram Interpretation
Figure 2 for Multimodal Foundation Models For Echocardiogram Interpretation
Figure 3 for Multimodal Foundation Models For Echocardiogram Interpretation
Figure 4 for Multimodal Foundation Models For Echocardiogram Interpretation
Viaarxiv icon

Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis

Add code
Bookmark button
Alert button
Aug 29, 2023
Sotirios Kastanas, Shaomu Tan, Yi He

Figure 1 for Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis
Figure 2 for Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis
Figure 3 for Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis
Figure 4 for Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis
Viaarxiv icon

VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection

Aug 25, 2023
Peng Wu, Xuerong Zhou, Guansong Pang, Lingru Zhou, Qingsen Yan, Peng Wang, Yanning Zhang

Figure 1 for VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection
Figure 2 for VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection
Figure 3 for VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection
Figure 4 for VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection
Viaarxiv icon