WenLan: Bridging vision and language by large-scale multi-modal pre-training Y Huo, M Zhang, G Liu, H Lu, Y Gao, G Yang, J Wen, H Zhang, B Xu, ... arXiv preprint arXiv:2103.06561, 2021 | 117 | 2021 |
Deep fusion network for image completion X Hong, P Xiong, R Ji, H Fan Proceedings of the 27th ACM international conference on multimedia, 2033-2042, 2019 | 112 | 2019 |
Attention-driven factor model for explainable personalized recommendation J Chen, F Zhuang, X Hong, X Ao, X Xie, Q He The 41st international ACM SIGIR conference on research & development in …, 2018 | 57 | 2018 |
Robust reinforcement learning with Wasserstein constraint L Hou, L Pang, X Hong, Y Lan, Z Ma, D Yin arXiv preprint arXiv:2006.00945, 2020 | 20 | 2020 |
Transformation Driven Visual Reasoning X Hong, Y Lan, L Pang, J Guo, X Cheng IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021 …, 2020 | 17 | 2020 |
Visual Reasoning: From State to Transformation X Hong, Y Lan, L Pang, J Guo, X Cheng IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 1 | 2023 |
Visual Transformation Telling X Hong, Y Lan, L Pang, J Guo, X Cheng arXiv preprint arXiv:2305.01928, 2023 | | 2023 |
Transformation Driven Visual Reasoning-Supplementary Material X Hong, Y Lan, L Pang, J Guo, X Cheng | | |