Alert button

"Image": models, code, and papers
Alert button

Event-based Human Pose Tracking by Spiking Spatiotemporal Transformer

Mar 16, 2023
Shihao Zou, Yuxuan Mu, Xinxin Zuo, Sen Wang, Li Cheng

Figure 1 for Event-based Human Pose Tracking by Spiking Spatiotemporal Transformer
Figure 2 for Event-based Human Pose Tracking by Spiking Spatiotemporal Transformer
Figure 3 for Event-based Human Pose Tracking by Spiking Spatiotemporal Transformer
Figure 4 for Event-based Human Pose Tracking by Spiking Spatiotemporal Transformer
Viaarxiv icon

From MNIST to ImageNet and Back: Benchmarking Continual Curriculum Learning

Mar 16, 2023
Kamil Faber, Dominik Zurek, Marcin Pietron, Nathalie Japkowicz, Antonio Vergari, Roberto Corizzo

Figure 1 for From MNIST to ImageNet and Back: Benchmarking Continual Curriculum Learning
Figure 2 for From MNIST to ImageNet and Back: Benchmarking Continual Curriculum Learning
Figure 3 for From MNIST to ImageNet and Back: Benchmarking Continual Curriculum Learning
Figure 4 for From MNIST to ImageNet and Back: Benchmarking Continual Curriculum Learning
Viaarxiv icon

NeuroDAVIS: A neural network model for data visualization

Apr 01, 2023
Chayan Maitra, Dibyendu B. Seal, Rajat K. De

Figure 1 for NeuroDAVIS: A neural network model for data visualization
Figure 2 for NeuroDAVIS: A neural network model for data visualization
Figure 3 for NeuroDAVIS: A neural network model for data visualization
Figure 4 for NeuroDAVIS: A neural network model for data visualization
Viaarxiv icon

Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors

Mar 09, 2023
Mesut Erhan Unal, Adriana Kovashka

Figure 1 for Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors
Figure 2 for Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors
Figure 3 for Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors
Figure 4 for Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors
Viaarxiv icon

Cones: Concept Neurons in Diffusion Models for Customized Generation

Mar 09, 2023
Zhiheng Liu, Ruili Feng, Kai Zhu, Yifei Zhang, Kecheng Zheng, Yu Liu, Deli Zhao, Jingren Zhou, Yang Cao

Figure 1 for Cones: Concept Neurons in Diffusion Models for Customized Generation
Figure 2 for Cones: Concept Neurons in Diffusion Models for Customized Generation
Figure 3 for Cones: Concept Neurons in Diffusion Models for Customized Generation
Figure 4 for Cones: Concept Neurons in Diffusion Models for Customized Generation
Viaarxiv icon

SMR: Satisfied Machine Ratio Modeling for Machine Recognition-Oriented Image and Video Compression

Nov 13, 2022
Qi Zhang, Shanshe Wang, Xinfeng Zhang, Chuanmin Jia, Jingshan Pan, Siwei Ma, Wen Gao

Figure 1 for SMR: Satisfied Machine Ratio Modeling for Machine Recognition-Oriented Image and Video Compression
Figure 2 for SMR: Satisfied Machine Ratio Modeling for Machine Recognition-Oriented Image and Video Compression
Figure 3 for SMR: Satisfied Machine Ratio Modeling for Machine Recognition-Oriented Image and Video Compression
Figure 4 for SMR: Satisfied Machine Ratio Modeling for Machine Recognition-Oriented Image and Video Compression
Viaarxiv icon

Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models

Mar 12, 2023
Zangwei Zheng, Mingyuan Ma, Kai Wang, Ziheng Qin, Xiangyu Yue, Yang You

Figure 1 for Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
Figure 2 for Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
Figure 3 for Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
Figure 4 for Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
Viaarxiv icon

Depth-based 6DoF Object Pose Estimation using Swin Transformer

Mar 03, 2023
Zhujun Li, Ioannis Stamos

Figure 1 for Depth-based 6DoF Object Pose Estimation using Swin Transformer
Figure 2 for Depth-based 6DoF Object Pose Estimation using Swin Transformer
Figure 3 for Depth-based 6DoF Object Pose Estimation using Swin Transformer
Figure 4 for Depth-based 6DoF Object Pose Estimation using Swin Transformer
Viaarxiv icon

MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video

Mar 17, 2023
Alexey Larionov, Evgeniya Ustinova, Mikhail Sidorenko, David Svitov, Ilya Zakharkin, Victor Lempitsky, Renat Bashirov

Figure 1 for MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video
Figure 2 for MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video
Figure 3 for MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video
Figure 4 for MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video
Viaarxiv icon

BEL: A Bag Embedding Loss for Transformer enhances Multiple Instance Whole Slide Image Classification

Mar 02, 2023
Daniel Sens, Ario Sadafi, Francesco Paolo Casale, Nassir Navab, Carsten Marr

Figure 1 for BEL: A Bag Embedding Loss for Transformer enhances Multiple Instance Whole Slide Image Classification
Figure 2 for BEL: A Bag Embedding Loss for Transformer enhances Multiple Instance Whole Slide Image Classification
Figure 3 for BEL: A Bag Embedding Loss for Transformer enhances Multiple Instance Whole Slide Image Classification
Viaarxiv icon