Picture for Wen-mei Hwu

Wen-mei Hwu

EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions

Add code
May 06, 2020
Figure 1 for EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions
Figure 2 for EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions
Figure 3 for EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions
Figure 4 for EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions
Viaarxiv icon

Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation

Add code
Apr 22, 2020
Figure 1 for Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation
Figure 2 for Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation
Figure 3 for Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation
Figure 4 for Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation
Viaarxiv icon

MLModelScope: A Distributed Platform for Model Evaluation and Benchmarking at Scale

Add code
Feb 19, 2020
Figure 1 for MLModelScope: A Distributed Platform for Model Evaluation and Benchmarking at Scale
Figure 2 for MLModelScope: A Distributed Platform for Model Evaluation and Benchmarking at Scale
Figure 3 for MLModelScope: A Distributed Platform for Model Evaluation and Benchmarking at Scale
Figure 4 for MLModelScope: A Distributed Platform for Model Evaluation and Benchmarking at Scale
Viaarxiv icon

DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs

Add code
Nov 20, 2019
Figure 1 for DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs
Figure 2 for DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs
Figure 3 for DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs
Figure 4 for DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs
Viaarxiv icon

The Design and Implementation of a Scalable DL Benchmarking Platform

Add code
Nov 19, 2019
Figure 1 for The Design and Implementation of a Scalable DL Benchmarking Platform
Figure 2 for The Design and Implementation of a Scalable DL Benchmarking Platform
Figure 3 for The Design and Implementation of a Scalable DL Benchmarking Platform
Figure 4 for The Design and Implementation of a Scalable DL Benchmarking Platform
Viaarxiv icon

Benanza: Automatic $μ$Benchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs

Add code
Nov 19, 2019
Figure 1 for Benanza: Automatic $μ$Benchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs
Figure 2 for Benanza: Automatic $μ$Benchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs
Figure 3 for Benanza: Automatic $μ$Benchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs
Figure 4 for Benanza: Automatic $μ$Benchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs
Viaarxiv icon

NAIS: Neural Architecture and Implementation Search and its Applications in Autonomous Driving

Add code
Nov 18, 2019
Figure 1 for NAIS: Neural Architecture and Implementation Search and its Applications in Autonomous Driving
Figure 2 for NAIS: Neural Architecture and Implementation Search and its Applications in Autonomous Driving
Figure 3 for NAIS: Neural Architecture and Implementation Search and its Applications in Autonomous Driving
Figure 4 for NAIS: Neural Architecture and Implementation Search and its Applications in Autonomous Driving
Viaarxiv icon

SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems

Add code
Sep 20, 2019
Figure 1 for SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems
Figure 2 for SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems
Figure 3 for SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems
Figure 4 for SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems
Viaarxiv icon

Across-Stack Profiling and Characterization of Machine Learning Models on GPUs

Add code
Aug 19, 2019
Figure 1 for Across-Stack Profiling and Characterization of Machine Learning Models on GPUs
Figure 2 for Across-Stack Profiling and Characterization of Machine Learning Models on GPUs
Figure 3 for Across-Stack Profiling and Characterization of Machine Learning Models on GPUs
Figure 4 for Across-Stack Profiling and Characterization of Machine Learning Models on GPUs
Viaarxiv icon

SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection

Add code
Jul 09, 2019
Figure 1 for SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection
Figure 2 for SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection
Figure 3 for SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection
Figure 4 for SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection
Viaarxiv icon