Referring expression comprehension: A survey of methods and datasets Y Qiao, C Deng, Q Wu IEEE Transactions on Multimedia 23, 4426-4440, 2020 | 78 | 2020 |
HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation Y Qiao, Y Qi, Y Hong, Z Yu, P Wang, Q Wu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 56 | 2022 |
Improving visual question answering using dropout and enhanced question encoder Z Fang, J Liu, Y Li, Y Qiao, H Lu Pattern Recognition 90, 404-414, 2019 | 33 | 2019 |
HOP+: History-enhanced and Order-aware Pre-training for Vision-and-Language Navigation Y Qiao, Y Qi, Y Hong, Z Yu, P Wang, Q Wu IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 24 | 2023 |
R-GAN: Exploring Human-like Way for Reasonable Text-to-Image Synthesis via Generative Adversarial Networks Y Qiao, Q Chen, C Deng, N Ding, Y Qi, M Tan, X Ren, Q Wu Proceedings of the 29th ACM International Conference on Multimedia, 2085-2093, 2021 | 17 | 2021 |
March in Chat: Interactive Prompting for Remote Embodied Referring Expression Y Qiao, Y Qi, Z Yu, J Liu, Q Wu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 9 | 2023 |
Rankvqa: Answer re-ranking for visual question answering Y Qiao, Z Yu, J Liu 2020 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2020 | 9 | 2020 |
VL-Mamba: Exploring State Space Models for Multimodal Learning Y Qiao, Z Yu, L Guo, S Chen, Z Zhao, M Sun, Q Wu, J Liu arXiv preprint arXiv:2403.13600, 2024 | 8 | 2024 |
VC-VQA: visual calibration mechanism for visual question answering Y Qiao, Z Yu, J Liu 2020 IEEE International Conference on Image Processing (ICIP), 1481-1485, 2020 | 7 | 2020 |
Enhancing visual question answering using dropout Z Fang, J Liu, Y Qiao, Q Tang, Y Li, H Lu Proceedings of the 26th ACM international conference on Multimedia, 1002-1010, 2018 | 4 | 2018 |
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation Y Qiao, Z Yu, Q Wu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 3 | 2023 |
Multi-modal Adapter for Medical Vision-and-Language Learning Z Yu, Y Qiao, Y Xie, Q Wu International Workshop on Machine Learning in Medical Imaging, 393-402, 2023 | 1 | 2023 |
Improving Online Source-free Domain Adaptation for Object Detection by Unsupervised Data Acquisition X Shi, Y Qiao, Q Wu, L Liu, F Dayoub arXiv preprint arXiv:2310.19258, 2023 | | 2023 |
General Vision and Language Methods in Real Applications: A Focus on Vision-and-Language Navigation Y Qiao | | 2023 |