Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andrew Ng

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Dec 08, 2015
Dario Amodei, Rishita Anubhai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse Engel, Linxi Fan, Christopher Fougner, Tony Han, Awni Hannun, Billy Jun, Patrick LeGresley, Libby Lin, Sharan Narang, Andrew Ng, Sherjil Ozair, Ryan Prenger, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Yi Wang, Zhiqian Wang, Chong Wang, Bo Xiao, Dani Yogatama, Jun Zhan, Zhenyao Zhu

Figure 1 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Figure 2 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Figure 3 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Figure 4 for Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.

Via

Access Paper or Ask Questions

Driverseat: Crowdstrapping Learning Tasks for Autonomous Driving

Dec 07, 2015
Pranav Rajpurkar, Toki Migimatsu, Jeff Kiske, Royce Cheng-Yue, Sameep Tandon, Tao Wang, Andrew Ng

Figure 1 for Driverseat: Crowdstrapping Learning Tasks for Autonomous Driving

Figure 2 for Driverseat: Crowdstrapping Learning Tasks for Autonomous Driving

Figure 3 for Driverseat: Crowdstrapping Learning Tasks for Autonomous Driving

Figure 4 for Driverseat: Crowdstrapping Learning Tasks for Autonomous Driving

While emerging deep-learning systems have outclassed knowledge-based approaches in many tasks, their application to detection tasks for autonomous technologies remains an open field for scientific exploration. Broadly, there are two major developmental bottlenecks: the unavailability of comprehensively labeled datasets and of expressive evaluation strategies. Approaches for labeling datasets have relied on intensive hand-engineering, and strategies for evaluating learning systems have been unable to identify failure-case scenarios. Human intelligence offers an untapped approach for breaking through these bottlenecks. This paper introduces Driverseat, a technology for embedding crowds around learning systems for autonomous driving. Driverseat utilizes crowd contributions for (a) collecting complex 3D labels and (b) tagging diverse scenarios for ready evaluation of learning systems. We demonstrate how Driverseat can crowdstrap a convolutional neural network on the lane-detection task. More generally, crowdstrapping introduces a valuable paradigm for any technology that can benefit from leveraging the powerful combination of human and computer intelligence.

Via

Access Paper or Ask Questions

Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (2009)

Aug 28, 2014
Jeff Bilmes, Andrew Ng

This is the Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, which was held in Montreal, QC, Canada, June 18 - 21 2009.

Via

Access Paper or Ask Questions

Deep learning for class-generic object detection

Dec 24, 2013
Brody Huval, Adam Coates, Andrew Ng

Figure 1 for Deep learning for class-generic object detection

Figure 2 for Deep learning for class-generic object detection

Figure 3 for Deep learning for class-generic object detection

We investigate the use of deep neural networks for the novel task of class generic object detection. We show that neural networks originally designed for image recognition can be trained to detect objects within images, regardless of their class, including objects for which no bounding box labels have been provided. In addition, we show that bounding box labels yield a 1% performance increase on the ImageNet recognition challenge.

Via

Access Paper or Ask Questions

Tuned Models of Peer Assessment in MOOCs

Jul 09, 2013
Chris Piech, Jonathan Huang, Zhenghao Chen, Chuong Do, Andrew Ng, Daphne Koller

Figure 1 for Tuned Models of Peer Assessment in MOOCs

Figure 2 for Tuned Models of Peer Assessment in MOOCs

Figure 3 for Tuned Models of Peer Assessment in MOOCs

Figure 4 for Tuned Models of Peer Assessment in MOOCs

In massive open online courses (MOOCs), peer grading serves as a critical tool for scaling the grading of complex, open-ended assignments to courses with tens or hundreds of thousands of students. But despite promising initial trials, it does not always deliver accurate results compared to human experts. In this paper, we develop algorithms for estimating and correcting for grader biases and reliabilities, showing significant improvement in peer grading accuracy on real data with 63,199 peer grades from Coursera's HCI course offerings --- the largest peer grading networks analysed to date. We relate grader biases and reliabilities to other student factors such as student engagement, performance as well as commenting style. We also show that our model can lead to more intelligent assignment of graders to gradees.

* Proceedings of The 6th International Conference on Educational Data Mining (EDM 2013)

Via

Access Paper or Ask Questions