Picture for Nanning Zheng

Nanning Zheng

Xi'an Jiaotong University

Finetuning Pretrained Vision-Language Models with Correlation Information Bottleneck for Robust Visual Question Answering

Add code
Sep 14, 2022
Figure 1 for Finetuning Pretrained Vision-Language Models with Correlation Information Bottleneck for Robust Visual Question Answering
Figure 2 for Finetuning Pretrained Vision-Language Models with Correlation Information Bottleneck for Robust Visual Question Answering
Figure 3 for Finetuning Pretrained Vision-Language Models with Correlation Information Bottleneck for Robust Visual Question Answering
Figure 4 for Finetuning Pretrained Vision-Language Models with Correlation Information Bottleneck for Robust Visual Question Answering
Viaarxiv icon

DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection

Add code
Jul 22, 2022
Figure 1 for DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection
Figure 2 for DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection
Figure 3 for DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection
Figure 4 for DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection
Viaarxiv icon

Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization

Add code
Jun 23, 2022
Figure 1 for Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization
Figure 2 for Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization
Viaarxiv icon

Social Interpretable Tree for Pedestrian Trajectory Prediction

Add code
May 26, 2022
Figure 1 for Social Interpretable Tree for Pedestrian Trajectory Prediction
Figure 2 for Social Interpretable Tree for Pedestrian Trajectory Prediction
Figure 3 for Social Interpretable Tree for Pedestrian Trajectory Prediction
Figure 4 for Social Interpretable Tree for Pedestrian Trajectory Prediction
Viaarxiv icon

Test-time Batch Normalization

Add code
May 20, 2022
Figure 1 for Test-time Batch Normalization
Figure 2 for Test-time Batch Normalization
Figure 3 for Test-time Batch Normalization
Figure 4 for Test-time Batch Normalization
Viaarxiv icon

Visual Concepts Tokenization

Add code
May 20, 2022
Figure 1 for Visual Concepts Tokenization
Figure 2 for Visual Concepts Tokenization
Figure 3 for Visual Concepts Tokenization
Figure 4 for Visual Concepts Tokenization
Viaarxiv icon

Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models

Add code
Mar 07, 2022
Figure 1 for Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Figure 2 for Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Figure 3 for Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Figure 4 for Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Viaarxiv icon

Trajectory Forecasting from Detection with Uncertainty-Aware Motion Encoding

Add code
Feb 10, 2022
Viaarxiv icon

LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering

Add code
Nov 30, 2021
Figure 1 for LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering
Figure 2 for LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering
Figure 3 for LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering
Figure 4 for LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering
Viaarxiv icon

Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition

Add code
Nov 07, 2021
Figure 1 for Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
Figure 2 for Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
Figure 3 for Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
Figure 4 for Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
Viaarxiv icon