Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques | Journal of Artificial Intelligence Research#Datasets#Image Recognition#Large Language Models·jair.org·Feb 19, 2022Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques | Journal of Artificial Intelligence Research