Search Test Information Space

Found 152 bookmarks

Custom sorting

Microsoft reveals two in-house AI models: MAI-Voice-1 and MAI-1-preview

#Multimodal #Voice #Microsoft

·neowin.net·Aug 29, 2025

Microsoft reveals two in-house AI models: MAI-Voice-1 and MAI-1-preview

Capabilities of GPT-5 on Multimodal Medical Reasoning

#Multimodal #Medical #Reasoning #GPT-5 #Paper #PDF

·arxiv.org·Aug 17, 2025

Capabilities of GPT-5 on Multimodal Medical Reasoning

Scaling Language-Free Visual Representation Learning

View PDF

#Computer Vision #Paper #PDF #Self-Supervised Learning #Questions and Answers #Multimodal

·arxiv.org·Jun 15, 2025

Scaling Language-Free Visual Representation Learning

Multimodal Large Language Models: A Survey

View PDF

#Multimodal #Large Language Models #Survey #Architecture #Transformers #Diffusion #Paper #PDF

·arxiv.org·Jun 14, 2025

Multimodal Large Language Models: A Survey

MMaDA: Multimodal Large Diffusion Language Models

#Multimodal #Diffusion #Large Language Models #Paper #PDF

·arxiv.org·May 22, 2025

MMaDA: Multimodal Large Diffusion Language Models

UniVG-R1: Reasoning Guided Universal Visual Grounding with...

#Reasoning #Reinforcement Learning #Large Language Models #Multimodal #Paper #PDF

·arxiv.org·May 22, 2025

UniVG-R1: Reasoning Guided Universal Visual Grounding with...

AMIE gains vision: A research AI agent for multimodal diagnostic dialogue

work

#Medical #Diagnostics #Multimodal #Dialogue #Google #Research #Paper #PDF #Gemini

·research.google·May 2, 2025

AMIE gains vision: A research AI agent for multimodal diagnostic dialogue

Introducing Embed 4: Multimodal search for business

#Cohere #Multimodal #Search

·cohere.com·Apr 20, 2025

Introducing Embed 4: Multimodal search for business

Bringing multimodal search to AI Mode

#Multimodal #Search #Google #Blog

·blog.google·Apr 7, 2025

Bringing multimodal search to AI Mode

Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks

#Multimodal #Large Language Models #Allen Institute

·arxiv.org·Jun 26, 2022

Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks

Scoop: Meta won't offer future multimodal AI models in EU

#Meta #Europe #Multimodal

·axios.com·Jul 18, 2024

Scoop: Meta won't offer future multimodal AI models in EU

This Advanced Kind Of AI Could Be The Secret To AI Assistants

#Multimodal #AI

·youtube.com·Jun 1, 2024

This Advanced Kind Of AI Could Be The Secret To AI Assistants

From Baby Talk to Baby A.I.

#Linguistics #AI #Multimodal

·nytimes.com·May 5, 2024

From Baby Talk to Baby A.I.

Data-Efficient Multimodal Fusion on a Single GPU

View PDF

#Machine Learning #Computer Vision #Multimodal #Paper #PDF

·arxiv.org·May 2, 2024

Data-Efficient Multimodal Fusion on a Single GPU

Key Consciousness Connections Uncovered - Neuroscience News

(Maybe there is a "collectome" that researchers have in common.)

#Consciousness #Neuroscience #MRI #Multimodal #Connectome

·neurosciencenews.com·May 2, 2024

Key Consciousness Connections Uncovered - Neuroscience News

The Ray-Ban Meta Smart Glasses have multimodal AI now

#Multimodal #AI #Glasses #Meta

·theverge.com·Apr 24, 2024

The Ray-Ban Meta Smart Glasses have multimodal AI now

Google’s Gemini 1.5 Pro can now hear

#Gemini #Audio #Multimodal

·theverge.com·Apr 9, 2024

Google’s Gemini 1.5 Pro can now hear

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

#Large Language Models #Multimodal #Apple #Paper #PDF

·arxiv.org·Mar 17, 2024

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Apple Announces MM1: A Family of Multimodal LLMs Up To 30B Parameters that are SoTA in Pre-Training Metrics and Perform Competitively after Fine-Tuning

#Apple #Large Language Models #Multimodal

·marktechpost.com·Mar 17, 2024

Apple Announces MM1: A Family of Multimodal LLMs Up To 30B Parameters that are SoTA in Pre-Training Metrics and Perform Competitively after Fine-Tuning

Guiding Instruction-based Image Editing via Multimodal Large Language Models

Download PDF

#Apple #Multimodal #Editing #Paper #PDF #Opensource

·arxiv.org·Feb 7, 2024

Guiding Instruction-based Image Editing via Multimodal Large Language Models

Ferret: Refer and Ground Anything Anywhere at Any Granularity

#Apple #Large Language Models #Multimodal #Paper #PDF #Opensource

·arxiv.org·Dec 26, 2023

Ferret: Refer and Ground Anything Anywhere at Any Granularity

StyleDrop: Text-to-image generation in any style

#Style #Multimodal #Google

·blog.research.google·Dec 15, 2023

StyleDrop: Text-to-image generation in any style

Hands-on with Gemini: Interacting with multimodal AI

#Gemini #Multimodal

·youtube.com·Dec 6, 2023

Hands-on with Gemini: Interacting with multimodal AI

Google DeepMind's Demis Hassabis Says Gemini Is a New Breed of AI

#Gemini #Multimodal #DeepMind

·wired.com·Dec 6, 2023

Google DeepMind's Demis Hassabis Says Gemini Is a New Breed of AI

Scaling multimodal understanding to long videos

#Multimodal #Model #Google #Research

·blog.research.google·Nov 15, 2023

Scaling multimodal understanding to long videos

New models and developer products announced at DevDay

#OpenAI #Multimodal #Large Language Models #Development #Conference

·openai.com·Nov 6, 2023

New models and developer products announced at DevDay

Multimodal AI become accessible: new model runs on your laptop

announced

#Multimodal #Edge Computing #Obsidian

·readwrite.com·Nov 1, 2023

Multimodal AI become accessible: new model runs on your laptop

LLaVA

#Large Language Models #LLaVA #Multimodal

·llava-vl.github.io·Oct 7, 2023

ChatGPT’s New Upgrade Teases AI’s Multimodal Future - IEEE Spectrum

#ChatGPT #Multimodal

·spectrum.ieee.org·Oct 2, 2023

ChatGPT’s New Upgrade Teases AI’s Multimodal Future - IEEE Spectrum

NExT-GPT: Any-to-Any Multimodal LLM

#Large Language Models #Multimodal #Paper #PDF

·arxiv.org·Sep 27, 2023

NExT-GPT: Any-to-Any Multimodal LLM