Alert button

"Object Detection": models, code, and papers
Alert button

CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification

Add code
Bookmark button
Alert button
Mar 14, 2024
Yiming Ma, Victor Sanchez, Tanaya Guha

Figure 1 for CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification
Figure 2 for CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification
Figure 3 for CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification
Figure 4 for CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification
Viaarxiv icon

MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark

Add code
Bookmark button
Alert button
Mar 29, 2024
Sanghyun Woo, Kwanyong Park, Inkyu Shin, Myungchul Kim, In So Kweon

Figure 1 for MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark
Figure 2 for MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark
Figure 3 for MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark
Figure 4 for MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark
Viaarxiv icon

Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction

Add code
Bookmark button
Alert button
Mar 12, 2024
Alexander Timans, Christoph-Nikolas Straehle, Kaspar Sakmann, Eric Nalisnick

Figure 1 for Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction
Figure 2 for Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction
Figure 3 for Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction
Figure 4 for Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction
Viaarxiv icon

SeMoLi: What Moves Together Belongs Together

Feb 29, 2024
Jenny Seidenschwarz, Aljoša Ošep, Francesco Ferroni, Simon Lucey, Laura Leal-Taixé

Viaarxiv icon

FreeA: Human-object Interaction Detection using Free Annotation Labels

Add code
Bookmark button
Alert button
Mar 04, 2024
Yuxiao Wang, Zhenao Wei, Xinyu Jiang, Yu Lei, Weiying Xue, Jinxiu Liu, Qi Liu

Figure 1 for FreeA: Human-object Interaction Detection using Free Annotation Labels
Figure 2 for FreeA: Human-object Interaction Detection using Free Annotation Labels
Figure 3 for FreeA: Human-object Interaction Detection using Free Annotation Labels
Figure 4 for FreeA: Human-object Interaction Detection using Free Annotation Labels
Viaarxiv icon

MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning

Add code
Bookmark button
Alert button
Mar 13, 2024
Jialv Zou, Bencheng Liao, Qian Zhang, Wenyu Liu, Xinggang Wang

Figure 1 for MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
Figure 2 for MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
Figure 3 for MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
Figure 4 for MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
Viaarxiv icon

Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks

Mar 13, 2024
Khondoker Murad Hossain, Tim Oates

Figure 1 for Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks
Figure 2 for Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks
Figure 3 for Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks
Figure 4 for Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks
Viaarxiv icon

A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product

Mar 13, 2024
Ao Xiang, Zongqing Qi, Han Wang, Qin Yang, Danqing Ma

Figure 1 for A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product
Figure 2 for A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product
Figure 3 for A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product
Figure 4 for A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product
Viaarxiv icon

Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference

Mar 12, 2024
Changmin Jeon, Seonjun Kim, Juheon Yi, Youngki Lee

Figure 1 for Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference
Figure 2 for Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference
Figure 3 for Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference
Figure 4 for Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference
Viaarxiv icon

LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations

Mar 11, 2024
Mohammad Alkhalefi, Georgios Leontidis, Mingjun Zhong

Figure 1 for LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations
Figure 2 for LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations
Figure 3 for LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations
Figure 4 for LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations
Viaarxiv icon