State-of-the-art speech recognition with sequence-to-sequence models CC Chiu, TN Sainath, Y Wu, R Prabhavalkar, P Nguyen, Z Chen, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 644 | 2018 |
Fast and accurate recurrent neural network acoustic models for speech recognition H Sak, A Senior, K Rao, F Beaufays arXiv preprint arXiv:1507.06947, 2015 | 373 | 2015 |
Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 198 | 2019 |
Federated learning for mobile keyboard prediction A Hard, K Rao, R Mathews, S Ramaswamy, F Beaufays, S Augenstein, ... arXiv preprint arXiv:1811.03604, 2018 | 192 | 2018 |
A Comparison of Sequence-to-Sequence Models for Speech Recognition. R Prabhavalkar, K Rao, TN Sainath, B Li, L Johnson, N Jaitly Interspeech, 939-943, 2017 | 177 | 2017 |
Grapheme-to-phoneme conversion using long short-term memory recurrent neural networks K Rao, F Peng, H Sak, F Beaufays 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 167 | 2015 |
Learning acoustic frame labeling for speech recognition with recurrent neural networks H Sak, A Senior, K Rao, O Irsoy, A Graves, F Beaufays, J Schalkwyk 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 162 | 2015 |
Exploring architectures, data and units for streaming end-to-end speech recognition with rnn-transducer K Rao, H Sak, R Prabhavalkar 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 159 | 2017 |
Personalized speech recognition on mobile devices I McGraw, R Prabhavalkar, R Alvarez, MG Arenas, K Rao, D Rybach, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 121 | 2016 |
Acoustic modelling with cd-ctc-smbr lstm rnns H Sak, F de Chaumont Quitry, T Sainath, K Rao 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 118 | 2015 |
Multilingual speech recognition with a single end-to-end model S Toshniwal, TN Sainath, RJ Weiss, B Li, P Moreno, E Weinstein, K Rao 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 108 | 2018 |
Recurrent Neural Aligner: An Encoder-Decoder Neural Network Model for Sequence to Sequence Mapping. H Sak, M Shannon, K Rao, F Beaufays Interspeech 8, 1298-1302, 2017 | 77 | 2017 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 69 | 2019 |
Large-scale visual speech recognition B Shillingford, Y Assael, MW Hoffman, T Paine, C Hughes, U Prabhu, ... arXiv preprint arXiv:1807.05162, 2018 | 53 | 2018 |
Multi-dialect speech recognition with a single sequence-to-sequence model B Li, TN Sainath, KC Sim, M Bacchiani, E Weinstein, P Nguyen, Z Chen, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 53 | 2018 |
Multi-accent speech recognition with hierarchical grapheme based models K Rao, H Sak 2017 IEEE international conference on acoustics, speech and signal …, 2017 | 46 | 2017 |
Federated learning for emoji prediction in a mobile keyboard S Ramaswamy, R Mathews, K Rao, F Beaufays arXiv preprint arXiv:1906.04329, 2019 | 45 | 2019 |
Streaming small-footprint keyword spotting using sequence-to-sequence models Y He, R Prabhavalkar, K Rao, W Li, A Bakhtin, I McGraw 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 43 | 2017 |
An Analysis of" Attention" in Sequence-to-Sequence Models. R Prabhavalkar, TN Sainath, B Li, K Rao, N Jaitly Interspeech, 3702-3706, 2017 | 37 | 2017 |
Google voice search: faster and more accurate H Sak, A Senior, K Rao, F Beaufays, J Schalkwyk Google Research Blog, 2015, http://googleresearch. blogspot. ch/2015/09 …, 2015 | 25 | 2015 |