Transfer learning from adult to children for speech recognition: Evaluation, analysis and recommendations PG Shivakumar, P Georgiou Computer speech & language 63, 101077, 2020 | 170 | 2020 |
Multimodal and multiresolution depression detection from speech and facial landmark features M Nasir, A Jati, PG Shivakumar, S Nallan Chakravarthula, P Georgiou Proceedings of the 6th international workshop on audio/visual emotion …, 2016 | 152 | 2016 |
Improving speech recognition for children using acoustic adaptation and pronunciation modeling. PG Shivakumar, A Potamianos, S Lee, SS Narayanan WOCCI, 15-19, 2014 | 93 | 2014 |
Perception optimized deep denoising autoencoders for speech enhancement. PG Shivakumar, PG Georgiou Interspeech, 3743-3747, 2016 | 57 | 2016 |
End-to-end neural systems for automatic children speech recognition: An empirical study PG Shivakumar, S Narayanan Computer Speech & Language 72, 101289, 2022 | 49 | 2022 |
Low-rank adaptation of large language model rescoring for parameter-efficient speech recognition Y Yu, CHH Yang, J Kolehmainen, PG Shivakumar, Y Gu, SRR Ren, Q Luo, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 39 | 2023 |
Spoken Language Intent Detection Using Confusion2Vec PG Shivakumar, M Yang, P Georgiou Proc. Interspeech 2019, 819--823, 2019 | 35 | 2019 |
Learning from past mistakes: improving automatic speech recognition output via noisy-clean phrase context modeling PG Shivakumar, H Li, K Knight, P Georgiou APSIPA Transactions on Signal and Information Processing 8, e8, 2019 | 31 | 2019 |
Confusion2vec: Towards enriching vector space word representations with representational ambiguities PG Shivakumar, P Georgiou PeerJ Computer Science 5, e195, 2019 | 26 | 2019 |
Simplified and supervised i-vector modeling for speaker age regression PG Shivakumar, M Li, V Dhandhania, SS Narayanan 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 22 | 2014 |
Multimodal Fusion of Multirate Acoustic, Prosodic, and Lexical Speaker Characteristics for Native Language Identification. PG Shivakumar, SN Chakravarthula, PG Georgiou INTERSPEECH, 2408-2412, 2016 | 14 | 2016 |
Paralinguistics-enhanced large language modeling of spoken dialogue GT Lin, PG Shivakumar, A Gandhe, CHH Yang, Y Gu, S Ghosh, A Stolcke, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 11 | 2024 |
Scaling laws for discriminative speech recognition rescoring models Y Gu, PG Shivakumar, J Kolehmainen, A Gandhe, A Rastrow, I Bulyko arXiv preprint arXiv:2306.15815, 2023 | 8 | 2023 |
Towards ASR robust spoken language understanding through in-context learning with word confusion networks K Everson, Y Gu, H Yang, PG Shivakumar, GT Lin, J Kolehmainen, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 5 | 2024 |
Distillation strategies for discriminative speech recognition rescoring PG Shivakumar, J Kolehmainen, Y Gu, A Gandhe, A Rastrow, I Bulyko arXiv preprint arXiv:2306.09452, 2023 | 4 | 2023 |
Incremental online spoken language understanding PG Shivakumar, N Kumar, P Georgiou, S Narayanan arXiv preprint arXiv:1910.10287, 2019 | 4 | 2019 |
Discriminative Speech Recognition Rescoring With Pre-Trained Language Models PG Shivakumar, J Kolehmainen, Y Gu, A Gandhe, A Rastrow, I Bulyko 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 3 | 2023 |
Personalization for bert-based discriminative speech recognition rescoring J Kolehmainen, Y Gu, A Gourav, PG Shivakumar, A Gandhe, A Rastrow, ... arXiv preprint arXiv:2307.06832, 2023 | 3 | 2023 |
Rnn based incremental online spoken language understanding PG Shivakumar, N Kumar, P Georgiou, S Narayanan 2021 IEEE Spoken Language Technology Workshop (SLT), 989-996, 2021 | 3 | 2021 |
Behavior gated language models PG Shivakumar, SY Tseng, P Georgiou, S Narayanan arXiv preprint arXiv:1909.00107, 2019 | 3 | 2019 |