Picture for Dilin Wang

Dilin Wang

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Add code
Dec 01, 2023
Viaarxiv icon

Drag View: Generalizable Novel View Synthesis with Unposed Imagery

Add code
Oct 05, 2023
Viaarxiv icon

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models

Add code
Sep 05, 2023
Figure 1 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Figure 2 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Figure 3 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Figure 4 for TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models
Viaarxiv icon

Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts

Add code
Jun 08, 2023
Figure 1 for Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
Figure 2 for Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
Figure 3 for Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
Figure 4 for Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
Viaarxiv icon

PathFusion: Path-consistent Lidar-Camera Deep Feature Fusion

Add code
Dec 12, 2022
Viaarxiv icon

Fast Point Cloud Generation with Straight Flows

Add code
Dec 04, 2022
Figure 1 for Fast Point Cloud Generation with Straight Flows
Figure 2 for Fast Point Cloud Generation with Straight Flows
Figure 3 for Fast Point Cloud Generation with Straight Flows
Figure 4 for Fast Point Cloud Generation with Straight Flows
Viaarxiv icon

Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation

Add code
Nov 23, 2021
Figure 1 for Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Figure 2 for Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Figure 3 for Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Figure 4 for Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Viaarxiv icon

Temporally Consistent Online Depth Estimation in Dynamic Scenes

Add code
Nov 17, 2021
Figure 1 for Temporally Consistent Online Depth Estimation in Dynamic Scenes
Figure 2 for Temporally Consistent Online Depth Estimation in Dynamic Scenes
Figure 3 for Temporally Consistent Online Depth Estimation in Dynamic Scenes
Figure 4 for Temporally Consistent Online Depth Estimation in Dynamic Scenes
Viaarxiv icon

Omni-sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR via Supernet

Add code
Oct 15, 2021
Figure 1 for Omni-sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR via Supernet
Figure 2 for Omni-sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR via Supernet
Figure 3 for Omni-sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR via Supernet
Figure 4 for Omni-sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR via Supernet
Viaarxiv icon

Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution

Add code
Oct 07, 2021
Figure 1 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 2 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 3 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Figure 4 for Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Viaarxiv icon