Picture for Jonathan Huang

Jonathan Huang

Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations

Add code
May 03, 2023
Viaarxiv icon

Local Metrics for Multi-Object Tracking

Add code
Apr 06, 2021
Figure 1 for Local Metrics for Multi-Object Tracking
Figure 2 for Local Metrics for Multi-Object Tracking
Figure 3 for Local Metrics for Multi-Object Tracking
Figure 4 for Local Metrics for Multi-Object Tracking
Viaarxiv icon

The surprising impact of mask-head architecture on novel class segmentation

Add code
Apr 01, 2021
Figure 1 for The surprising impact of mask-head architecture on novel class segmentation
Figure 2 for The surprising impact of mask-head architecture on novel class segmentation
Figure 3 for The surprising impact of mask-head architecture on novel class segmentation
Figure 4 for The surprising impact of mask-head architecture on novel class segmentation
Viaarxiv icon

PERF-Net: Pose Empowered RGB-Flow Net

Add code
Sep 28, 2020
Figure 1 for PERF-Net: Pose Empowered RGB-Flow Net
Figure 2 for PERF-Net: Pose Empowered RGB-Flow Net
Figure 3 for PERF-Net: Pose Empowered RGB-Flow Net
Figure 4 for PERF-Net: Pose Empowered RGB-Flow Net
Viaarxiv icon

Compact Speaker Embedding: lrx-vector

Add code
Aug 11, 2020
Figure 1 for Compact Speaker Embedding: lrx-vector
Figure 2 for Compact Speaker Embedding: lrx-vector
Figure 3 for Compact Speaker Embedding: lrx-vector
Figure 4 for Compact Speaker Embedding: lrx-vector
Viaarxiv icon

RetinaTrack: Online Single Stage Joint Detection and Tracking

Add code
Mar 30, 2020
Figure 1 for RetinaTrack: Online Single Stage Joint Detection and Tracking
Figure 2 for RetinaTrack: Online Single Stage Joint Detection and Tracking
Figure 3 for RetinaTrack: Online Single Stage Joint Detection and Tracking
Figure 4 for RetinaTrack: Online Single Stage Joint Detection and Tracking
Viaarxiv icon

Exploring Context, Attention and Audio Features for Audio Visual Scene-Aware Dialog

Add code
Dec 20, 2019
Figure 1 for Exploring Context, Attention and Audio Features for Audio Visual Scene-Aware Dialog
Figure 2 for Exploring Context, Attention and Audio Features for Audio Visual Scene-Aware Dialog
Figure 3 for Exploring Context, Attention and Audio Features for Audio Visual Scene-Aware Dialog
Figure 4 for Exploring Context, Attention and Audio Features for Audio Visual Scene-Aware Dialog
Viaarxiv icon

Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog

Add code
Dec 20, 2019
Figure 1 for Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog
Figure 2 for Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog
Figure 3 for Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog
Figure 4 for Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog
Viaarxiv icon

Long Term Temporal Context for Per-Camera Object Detection

Add code
Dec 07, 2019
Figure 1 for Long Term Temporal Context for Per-Camera Object Detection
Figure 2 for Long Term Temporal Context for Per-Camera Object Detection
Figure 3 for Long Term Temporal Context for Per-Camera Object Detection
Figure 4 for Long Term Temporal Context for Per-Camera Object Detection
Viaarxiv icon

Structural sparsification for Far-field Speaker Recognition with GNA

Add code
Oct 25, 2019
Figure 1 for Structural sparsification for Far-field Speaker Recognition with GNA
Figure 2 for Structural sparsification for Far-field Speaker Recognition with GNA
Figure 3 for Structural sparsification for Far-field Speaker Recognition with GNA
Figure 4 for Structural sparsification for Far-field Speaker Recognition with GNA
Viaarxiv icon