Picture for Yi Zhu

Yi Zhu

Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue

Add code
Apr 18, 2022
Figure 1 for Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue
Figure 2 for Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue
Figure 3 for Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue
Figure 4 for Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue
Viaarxiv icon

Harnessing Interpretable Machine Learning for Origami Feature Design and Pattern Selection

Add code
Apr 12, 2022
Figure 1 for Harnessing Interpretable Machine Learning for Origami Feature Design and Pattern Selection
Figure 2 for Harnessing Interpretable Machine Learning for Origami Feature Design and Pattern Selection
Figure 3 for Harnessing Interpretable Machine Learning for Origami Feature Design and Pattern Selection
Figure 4 for Harnessing Interpretable Machine Learning for Origami Feature Design and Pattern Selection
Viaarxiv icon

ImpDet: Exploring Implicit Fields for 3D Object Detection

Add code
Mar 31, 2022
Figure 1 for ImpDet: Exploring Implicit Fields for 3D Object Detection
Figure 2 for ImpDet: Exploring Implicit Fields for 3D Object Detection
Figure 3 for ImpDet: Exploring Implicit Fields for 3D Object Detection
Figure 4 for ImpDet: Exploring Implicit Fields for 3D Object Detection
Viaarxiv icon

Prompt-Learning for Short Text Classification

Add code
Mar 31, 2022
Figure 1 for Prompt-Learning for Short Text Classification
Figure 2 for Prompt-Learning for Short Text Classification
Figure 3 for Prompt-Learning for Short Text Classification
Figure 4 for Prompt-Learning for Short Text Classification
Viaarxiv icon

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

Add code
Mar 24, 2022
Figure 1 for BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Figure 2 for BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Figure 3 for BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Figure 4 for BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Viaarxiv icon

Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis

Add code
Mar 22, 2022
Figure 1 for Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis
Figure 2 for Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis
Figure 3 for Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis
Figure 4 for Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis
Viaarxiv icon

Contrastive Instruction-Trajectory Learning for Vision-Language Navigation

Add code
Dec 09, 2021
Figure 1 for Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Figure 2 for Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Figure 3 for Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Figure 4 for Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Viaarxiv icon

Blending Anti-Aliasing into Vision Transformer

Add code
Oct 28, 2021
Figure 1 for Blending Anti-Aliasing into Vision Transformer
Figure 2 for Blending Anti-Aliasing into Vision Transformer
Figure 3 for Blending Anti-Aliasing into Vision Transformer
Figure 4 for Blending Anti-Aliasing into Vision Transformer
Viaarxiv icon

CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations

Add code
Sep 30, 2021
Figure 1 for CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations
Figure 2 for CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations
Figure 3 for CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations
Figure 4 for CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations
Viaarxiv icon

An Unsupervised Method for Building Sentence Simplification Corpora in Multiple Languages

Add code
Sep 01, 2021
Figure 1 for An Unsupervised Method for Building Sentence Simplification Corpora in Multiple Languages
Figure 2 for An Unsupervised Method for Building Sentence Simplification Corpora in Multiple Languages
Figure 3 for An Unsupervised Method for Building Sentence Simplification Corpora in Multiple Languages
Figure 4 for An Unsupervised Method for Building Sentence Simplification Corpora in Multiple Languages
Viaarxiv icon