Picture for Ting Yao

Ting Yao

ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding

Add code
Aug 05, 2022
Figure 1 for ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
Figure 2 for ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
Figure 3 for ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
Figure 4 for ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
Viaarxiv icon

Lightweight and Progressively-Scalable Networks for Semantic Segmentation

Add code
Jul 27, 2022
Figure 1 for Lightweight and Progressively-Scalable Networks for Semantic Segmentation
Figure 2 for Lightweight and Progressively-Scalable Networks for Semantic Segmentation
Figure 3 for Lightweight and Progressively-Scalable Networks for Semantic Segmentation
Figure 4 for Lightweight and Progressively-Scalable Networks for Semantic Segmentation
Viaarxiv icon

Dual Vision Transformer

Add code
Jul 12, 2022
Figure 1 for Dual Vision Transformer
Figure 2 for Dual Vision Transformer
Figure 3 for Dual Vision Transformer
Figure 4 for Dual Vision Transformer
Viaarxiv icon

Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning

Add code
Jul 11, 2022
Figure 1 for Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
Figure 2 for Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
Figure 3 for Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
Figure 4 for Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
Viaarxiv icon

Bi-Calibration Networks for Weakly-Supervised Video Representation Learning

Add code
Jun 21, 2022
Figure 1 for Bi-Calibration Networks for Weakly-Supervised Video Representation Learning
Figure 2 for Bi-Calibration Networks for Weakly-Supervised Video Representation Learning
Figure 3 for Bi-Calibration Networks for Weakly-Supervised Video Representation Learning
Figure 4 for Bi-Calibration Networks for Weakly-Supervised Video Representation Learning
Viaarxiv icon

Stand-Alone Inter-Frame Attention in Video Models

Add code
Jun 14, 2022
Figure 1 for Stand-Alone Inter-Frame Attention in Video Models
Figure 2 for Stand-Alone Inter-Frame Attention in Video Models
Figure 3 for Stand-Alone Inter-Frame Attention in Video Models
Figure 4 for Stand-Alone Inter-Frame Attention in Video Models
Viaarxiv icon

Comprehending and Ordering Semantics for Image Captioning

Add code
Jun 14, 2022
Figure 1 for Comprehending and Ordering Semantics for Image Captioning
Figure 2 for Comprehending and Ordering Semantics for Image Captioning
Figure 3 for Comprehending and Ordering Semantics for Image Captioning
Figure 4 for Comprehending and Ordering Semantics for Image Captioning
Viaarxiv icon

MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing

Add code
Jun 13, 2022
Figure 1 for MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
Figure 2 for MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
Figure 3 for MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
Figure 4 for MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
Viaarxiv icon

Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection

Add code
Jun 13, 2022
Figure 1 for Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection
Figure 2 for Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection
Figure 3 for Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection
Figure 4 for Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection
Viaarxiv icon

Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation

Add code
Jun 13, 2022
Figure 1 for Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation
Figure 2 for Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation
Figure 3 for Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation
Figure 4 for Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation
Viaarxiv icon