Variational image compression with a scale hyperprior J Ballé, D Minnen, S Singh, SJ Hwang, N Johnston arXiv preprint arXiv:1802.01436, 2018 | 1934 | 2018 |
Joint autoregressive and hierarchical priors for learned image compression D Minnen, J Ballé, GD Toderici Advances in neural information processing systems 31, 2018 | 1350 | 2018 |
Full resolution image compression with recurrent neural networks G Toderici, D Vincent, N Johnston, S Jin Hwang, D Minnen, J Shor, ... Proceedings of the IEEE conference on Computer Vision and Pattern …, 2017 | 1049 | 2017 |
Variable rate image compression with recurrent neural networks G Toderici, SM O'Malley, SJ Hwang, D Vincent, D Minnen, S Baluja, ... arXiv preprint arXiv:1511.06085, 2015 | 757 | 2015 |
Improved lossy image compression with priming and spatially adaptive bit rates for recurrent networks N Johnston, D Vincent, D Minnen, M Covell, S Singh, T Chinen, ... Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 464 | 2018 |
Channel-wise autoregressive entropy models for learned image compression D Minnen, S Singh 2020 IEEE International Conference on Image Processing (ICIP), 3339-3343, 2020 | 383 | 2020 |
Scale-space flow for end-to-end optimized video compression E Agustsson, D Minnen, N Johnston, J Balle, SJ Hwang, G Toderici Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 339 | 2020 |
Nonlinear transform coding J Ballé, PA Chou, D Minnen, S Singh, N Johnston, E Agustsson, ... IEEE Journal of Selected Topics in Signal Processing 15 (2), 339-353, 2020 | 220 | 2020 |
Discovering characteristic actions from on-body sensor data D Minnen, T Starner, I Essa, C Isbell 2006 10th IEEE international symposium on wearable computers, 11-18, 2006 | 213 | 2006 |
Propagation networks for recognition of partially ordered sequential action Y Shi, Y Huang, D Minnen, A Bobick, I Essa Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision …, 2004 | 201 | 2004 |
Expectation grammars: Leveraging high-level expectations for activity recognition D Minnen, I Essa, T Starner 2003 IEEE Computer Society Conference on Computer Vision and Pattern …, 2003 | 162 | 2003 |
Videopoet: A large language model for zero-shot video generation D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, G Schindler, R Hornung, ... arXiv preprint arXiv:2312.14125, 2023 | 145 | 2023 |
Language Model Beats Diffusion--Tokenizer is Key to Visual Generation L Yu, J Lezama, NB Gundavarapu, L Versari, K Sohn, D Minnen, Y Cheng, ... arXiv preprint arXiv:2310.05737, 2023 | 145 | 2023 |
Discovering multivariate motifs using subsequence density estimation and greedy mixture learning D Minnen, CL Isbell, I Essa, T Starner Proceedings of the national conference on artificial intelligence 22 (1), 615, 2007 | 131 | 2007 |
The perceptive workbench: Computer-vision-based gesture tracking, object tracking, and 3D reconstruction for augmented desks T Starner, B Leibe, D Minnen, T Westyn, A Hurst, J Weeks Machine Vision and Applications 14, 59-71, 2003 | 121 | 2003 |
Detecting subdimensional motifs: An efficient algorithm for generalized multivariate pattern discovery D Minnen, C Isbell, I Essa, T Starner Seventh IEEE International Conference on Data Mining (ICDM 2007), 601-606, 2007 | 120 | 2007 |
Recognizing and discovering human actions from on-body sensor data D Minnen, T Starner, JA Ward, P Lukowicz, G Troster 2005 IEEE International Conference on Multimedia and Expo, 1545-1548, 2005 | 115 | 2005 |
Performance metrics and evaluation issues for continuous activity recognition D Minnen, T Westeyn, T Starner, JA Ward, P Lukowicz Performance Metrics for Intelligent Systems 4, 141-148, 2006 | 99 | 2006 |
Vct: A video compression transformer F Mentzer, G Toderici, D Minnen, SJ Hwang, S Caelles, M Lucic, ... arXiv preprint arXiv:2206.07307, 2022 | 97 | 2022 |
Finite scalar quantization: Vq-vae made simple F Mentzer, D Minnen, E Agustsson, M Tschannen arXiv preprint arXiv:2309.15505, 2023 | 88 | 2023 |