Picture for Yiran Zhong

Yiran Zhong

Deep Laparoscopic Stereo Matching with Transformers

Add code
Jul 25, 2022
Figure 1 for Deep Laparoscopic Stereo Matching with Transformers
Figure 2 for Deep Laparoscopic Stereo Matching with Transformers
Figure 3 for Deep Laparoscopic Stereo Matching with Transformers
Figure 4 for Deep Laparoscopic Stereo Matching with Transformers
Viaarxiv icon

Audio-Visual Segmentation

Add code
Jul 11, 2022
Figure 1 for Audio-Visual Segmentation
Figure 2 for Audio-Visual Segmentation
Figure 3 for Audio-Visual Segmentation
Figure 4 for Audio-Visual Segmentation
Viaarxiv icon

Vicinity Vision Transformer

Add code
Jun 21, 2022
Figure 1 for Vicinity Vision Transformer
Figure 2 for Vicinity Vision Transformer
Figure 3 for Vicinity Vision Transformer
Figure 4 for Vicinity Vision Transformer
Viaarxiv icon

Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective

Add code
Apr 10, 2022
Figure 1 for Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective
Figure 2 for Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective
Figure 3 for Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective
Figure 4 for Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective
Viaarxiv icon

Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition

Add code
Mar 29, 2022
Figure 1 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Figure 2 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Figure 3 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Figure 4 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Viaarxiv icon

Implicit Motion Handling for Video Camouflaged Object Detection

Add code
Mar 15, 2022
Figure 1 for Implicit Motion Handling for Video Camouflaged Object Detection
Figure 2 for Implicit Motion Handling for Video Camouflaged Object Detection
Figure 3 for Implicit Motion Handling for Video Camouflaged Object Detection
Figure 4 for Implicit Motion Handling for Video Camouflaged Object Detection
Viaarxiv icon

cosFormer: Rethinking Softmax in Attention

Add code
Feb 17, 2022
Viaarxiv icon

Transcribing Natural Languages for The Deaf via Neural Editing Programs

Add code
Dec 17, 2021
Figure 1 for Transcribing Natural Languages for The Deaf via Neural Editing Programs
Figure 2 for Transcribing Natural Languages for The Deaf via Neural Editing Programs
Figure 3 for Transcribing Natural Languages for The Deaf via Neural Editing Programs
Figure 4 for Transcribing Natural Languages for The Deaf via Neural Editing Programs
Viaarxiv icon

GETAM: Gradient-weighted Element-wise Transformer Attention Map for Weakly-supervised Semantic segmentation

Add code
Dec 06, 2021
Figure 1 for GETAM: Gradient-weighted Element-wise Transformer Attention Map for Weakly-supervised Semantic segmentation
Figure 2 for GETAM: Gradient-weighted Element-wise Transformer Attention Map for Weakly-supervised Semantic segmentation
Figure 3 for GETAM: Gradient-weighted Element-wise Transformer Attention Map for Weakly-supervised Semantic segmentation
Figure 4 for GETAM: Gradient-weighted Element-wise Transformer Attention Map for Weakly-supervised Semantic segmentation
Viaarxiv icon

MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation

Add code
Nov 29, 2021
Figure 1 for MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation
Figure 2 for MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation
Figure 3 for MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation
Figure 4 for MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation
Viaarxiv icon