End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results

Chorowski, Jan; Bahdanau, Dzmitry; Cho, Kyunghyun; Bengio, Yoshua

Computer Science > Neural and Evolutionary Computing

arXiv:1412.1602 (cs)

[Submitted on 4 Dec 2014]

Title:End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results

Authors:Jan Chorowski, Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio

View PDF

Abstract:We replace the Hidden Markov Model (HMM) which is traditionally used in in continuous speech recognition with a bi-directional recurrent neural network encoder coupled to a recurrent neural network decoder that directly emits a stream of phonemes. The alignment between the input and output sequences is established using an attention mechanism: the decoder emits each symbol based on a context created with a subset of input symbols elected by the attention mechanism. We report initial results demonstrating that this new approach achieves phoneme error rates that are comparable to the state-of-the-art HMM-based decoders, on the TIMIT dataset.

Comments:	As accepted to: Deep Learning and Representation Learning Workshop, NIPS 2014
Subjects:	Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1412.1602 [cs.NE]
	(or arXiv:1412.1602v1 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.1412.1602

Submission history

From: Jan Chorowski [view email]
[v1] Thu, 4 Dec 2014 10:00:19 UTC (216 KB)

Computer Science > Neural and Evolutionary Computing

Title:End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators