【13】Achieving Human Parity in Conversational Speech Recognition.pdf

上传者：ss19673 2021-04-18 07:50:56上传 PDF文件 249.19KB 热度 15次

Conversational speech recognition has served as a flagship speech recognition task since the release of the DARPA Switchboard corpus in the 1990s. In this paper, we measure the human error rate on the widely used NIST 2000 test set, and find that our latest automated system has reached human parity. The error rate of professional transcriptionists is 5.9% for the Switchboard portion of the data, in which newly acquainted pairs of people discuss an assigned topic, and 11.3% for the CallHome portion where friends and family members have open-ended conversations. In both cases, our automated system establishes a new state-of-the-art, and edges past the human benchmark. This marks the first time that human parity has been reported for conversational speech. The key to our system’s performance is the systematic use of convolutional and LSTM neural networks, combined with a novel spatial smoothing method and lattice-free MMI acoustic training. Index Terms— Conversational speech recognition, convolutional neural networks, recurrent neural networks, VGG, ResNet, LACE, BLSTM, spatial smoothing.

下载地址

用户评论

更多下载

下载地址

立即下载

用户评论

13Achieving Human Parity in Conversational Speech Recognition.pdf

Conversational speech recognition has served as a ...

大小：249KB | 2021-04-18 07:50:56
Automatic Speech and Speaker Recognition.pdf

大小：0B | 2018-12-09 04:26:45
Deep Learning for NLP and Speech Recognition.pdf

DeepLearningforNLPandSpeechRecognition

大小：0B | 2019-09-03 10:30:52
New Era for Robust Speech Recognition.pdf

ShinjiWatanabe•MarcDelcroix•FlorianMetze•JohnR.Her...

大小：0B | 2019-06-04 07:22:32
LISTEN ATTEND AND SPELL A NEURAL NETWORK FOR SPEECH RECOGNITION.pdf

语音识别LAS结构

大小：0B | 2020-06-10 14:42:34
Receipt Recognition.pdf

Inspired by the recent successes of deep learning ...

大小：845KB | 2021-05-02 15:46:12
Conversational Speech Transcription

大小：0B | 2019-03-16 18:07:31
bird species recognition.pdf

This paper investigates acoustic modeling for reco...

大小：1.93MB | 2021-04-18 07:51:02
语音识别基本原理Fundamentals of Speech Recognition.pdf

Basic principles of speech recognition Fundamental...

大小：0B | 2019-06-25 17:54:01
Neural Networks for Pattern Recognition.pdf

For those entering the field of artificial neural ...

大小：0B | 2018-12-29 05:04:47
Deep Residual Learning for Image Recognition.pdf

深度较之宽度对神经网络具有更重要的意义，能一定程度模拟人脑，但是随着深度的加深，会出现梯度消失问题，...

大小：0B | 2019-07-26 14:58:57
speech recognition

这是语音识别技术的第一个例子。语音技术的概念实际包括两个技术：合成器和识别器（参见图1）。语音合成器...

大小：0B | 2019-10-01 07:21:38
GradientBased Learning Applied to Document Recognition.pdf

deeplearning三位大神YannLecun、Y.Bengio、PatrickHaffner联...

大小：0B | 2020-06-08 06:07:54
Deep Learningbased Food Image Recognition.pdf

深度学习识别食物深基础的食物图像识别学习

大小：0B | 2019-09-22 22:28:23
Applying OCR Technology for Receipt Recognition.pdf

Applying OCR Technology for Receipt Recognition.pd...

大小：7.37MB | 2021-03-20 16:33:19
Recurrent attention model for pedestrian attribute recognition.pdf

Recurrentattentionmodelforpedestrianattributerecog...

大小：0B | 2020-06-14 12:30:16