Macaw-llm: Multi-modal language modeling with image, audio, video, and text integration C Lyu, M Wu, L Wang, X Huang, B Liu, Z Du, S Shi, Z Tu arXiv preprint arXiv:2306.09093, 2023 | 74 | 2023 |
On the cultural gap in text-to-image generation B Liu, L Wang, C Lyu, Y Zhang, J Su, S Shi, Z Tu arXiv preprint arXiv:2307.02971, 2023 | 4 | 2023 |