Data-Efficient Multimodal Fusion on a Single GPUView PDF#Machine Learning#Computer Vision#Multimodal#Paper#PDF·arxiv.org·May 2, 2024Data-Efficient Multimodal Fusion on a Single GPU
Gong, Y., Rouditchenko, A., Liu, A. H., Harwath, D., Karlinsky, L., Kuehne, H., & Glass, J. (2022). Contrastive audio-visual masked autoencoder. arXiv preprint arXiv:2210.07839.#Machine Learning#Multimodal#Paper#PDF·openreview.net·Jun 10, 2023Gong, Y., Rouditchenko, A., Liu, A. H., Harwath, D., Karlinsky, L., Kuehne, H., & Glass, J. (2022). Contrastive audio-visual masked autoencoder. arXiv preprint arXiv:2210.07839.