Search Test Information Space

Found 10 bookmarks

Custom sorting

Data-Efficient Multimodal Fusion on a Single GPU

View PDF

#Machine Learning #Computer Vision #Multimodal #Paper #PDF

·arxiv.org·May 2, 2024

Data-Efficient Multimodal Fusion on a Single GPU

Gong, Y., Rouditchenko, A., Liu, A. H., Harwath, D., Karlinsky, L., Kuehne, H., & Glass, J. (2022). Contrastive audio-visual masked autoencoder. arXiv preprint arXiv:2210.07839.

#Machine Learning #Multimodal #Paper #PDF

·openreview.net·Jun 10, 2023

Gong, Y., Rouditchenko, A., Liu, A. H., Harwath, D., Karlinsky, L., Kuehne, H., & Glass, J. (2022). Contrastive audio-visual masked autoencoder. arXiv preprint arXiv:2210.07839.

Personalizing Stable Diffusion with Determined

#Machine Learning #Model #API #Multimodal

·determined.ai·Nov 1, 2022

Personalizing Stable Diffusion with Determined

What the new wave of machine learning libraries means for SEO, marketing

#SEO #Machine Learning #Multimodal #MUM #Large Language Models

·searchengineland.com·Oct 6, 2022

What the new wave of machine learning libraries means for SEO, marketing

An AI used medical notes to teach itself to spot disease on chest x-rays

#Multimodal #Medical #Training #Machine Learning

·technologyreview.com·Sep 15, 2022

An AI used medical notes to teach itself to spot disease on chest x-rays

Mapping Urban Trees Across North America with the Auto Arborist Dataset

#Forestry #Machine Learning #Multimodal #Google

·ai.googleblog.com·Jun 23, 2022

Mapping Urban Trees Across North America with the Auto Arborist Dataset

Google AI Introduces 'LIMoE': One Of The First Large-Scale Architecture That Processes Both Images And Text Using A Sparse Mixture Of Experts

#Machine Learning #Multimodal #Subject Matter Experts

·marktechpost.com·Jun 12, 2022

Google AI Introduces 'LIMoE': One Of The First Large-Scale Architecture That Processes Both Images And Text Using A Sparse Mixture Of Experts

Vision Language models: towards multi-modal deep learning | AI Summer

#Multimodal #Machine Learning #Large Language Models #Computer Vision #Natural Language Processing #Transformers #Attention

·theaisummer.com·Mar 4, 2022

Vision Language models: towards multi-modal deep learning | AI Summer

Google, Cambridge U & Alan Turing Institute Propose PolyViT: A Universal Transformer for Image, Video, and Audio Classification | Synced

#Machine Learning #Multimodal

·syncedreview.com·Dec 1, 2021

Google, Cambridge U & Alan Turing Institute Propose PolyViT: A Universal Transformer for Image, Video, and Audio Classification | Synced

Artificial intelligence that understands object relationships

#Machine Learning #Multimodal

·news.mit.edu·Nov 29, 2021

Artificial intelligence that understands object relationships