Follow
Aviv Tamar
Title
Cited by
Cited by
Year
Multi-agent actor-critic for mixed cooperative-competitive environments
R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch
Advances in neural information processing systems 30, 2017
54942017
Constrained policy optimization
J Achiam, D Held, A Tamar, P Abbeel
International conference on machine learning, 22-31, 2017
16052017
Value iteration networks
A Tamar, Y Wu, G Thomas, S Levine, P Abbeel
Advances in neural information processing systems 29, 2016
7772016
Bayesian reinforcement learning: A survey
M Ghavamzadeh, S Mannor, J Pineau, A Tamar
Foundations and Trends® in Machine Learning 8 (5-6), 359-483, 2015
5772015
Model-ensemble trust-region policy optimization
T Kurutach, I Clavera, Y Duan, A Tamar, P Abbeel
arXiv preprint arXiv:1802.10592, 2018
5542018
Risk-sensitive and robust decision-making: a cvar optimization approach
Y Chow, A Tamar, S Mannor, M Pavone
Advances in neural information processing systems 28, 2015
3922015
A deep reinforcement learning perspective on internet congestion control
N Jay, N Rotman, B Godfrey, M Schapira, A Tamar
International Conference on Machine Learning, 3050-3059, 2019
3702019
Policy gradients with variance related risk criteria
A Tamar, D Di Castro, S Mannor
Proceedings of the twenty-ninth international conference on machine learning …, 2012
3552012
Learning to route
A Valadarsky, M Schapira, D Shahaf, A Tamar
Proceedings of the 16th ACM workshop on hot topics in networks, 185-191, 2017
2702017
Variance adjusted actor critic algorithms
A Tamar, S Mannor
arXiv preprint arXiv:1310.3697, 2013
2472013
Reinforcement learning on variable impedance controller for high-precision robotic assembly
J Luo, E Solowjow, C Wen, JA Ojea, AM Agogino, A Tamar, P Abbeel
2019 International Conference on Robotics and Automation (ICRA), 3080-3087, 2019
2222019
Optimizing the CVaR via sampling
A Tamar, Y Glassner, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015
2192015
Learning plannable representations with causal infogan
T Kurutach, A Tamar, G Yang, SJ Russell, P Abbeel
Advances in Neural Information Processing Systems 31, 2018
2112018
Policy gradients beyond expectations: Conditional value-at-risk
A Tamar, Y Glassner, S Mannor
AAAI, 2015
1962015
Learning robotic assembly from cad
G Thomas, M Chien, A Tamar, JA Ojea, P Abbeel
2018 IEEE International Conference on Robotics and Automation (ICRA), 3524-3531, 2018
1902018
Scaling up robust MDPs using function approximation
A Tamar, S Mannor, H Xu
International conference on machine learning, 181-189, 2014
1592014
Learning robotic manipulation through visual planning and acting
A Wang, T Kurutach, K Liu, P Abbeel, A Tamar
arXiv preprint arXiv:1905.04411, 2019
1582019
Policy gradient for coherent risk measures
A Tamar, Y Chow, M Ghavamzadeh, S Mannor
Advances in neural information processing systems 28, 2015
1502015
Learning generalized reactive policies using deep neural networks
E Groshev, M Goldstein, A Tamar, S Srivastava, P Abbeel
Proceedings of the International Conference on Automated Planning and …, 2018
1402018
Deep residual flow for out of distribution detection
E Zisselman, A Tamar
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
1222020
The system can't perform the operation now. Try again later.
Articles 1–20