Return to Article Details Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques Download Download PDF