MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training#Large Language Models#Multimodal#Apple#Paper#PDF·arxiv.org·Mar 17, 2024MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Guiding Instruction-based Image Editing via Multimodal Large Language ModelsDownload PDF#Apple#Multimodal#Editing#Paper#PDF#Opensource·arxiv.org·Feb 7, 2024Guiding Instruction-based Image Editing via Multimodal Large Language Models
Ferret: Refer and Ground Anything Anywhere at Any Granularity#Apple#Large Language Models#Multimodal#Paper#PDF#Opensource·arxiv.org·Dec 26, 2023Ferret: Refer and Ground Anything Anywhere at Any Granularity