Guiding Instruction-based Image Editing via Multimodal Large Language ModelsDownload PDF#Apple#Multimodal#Editing#Paper#PDF#Opensource·arxiv.org·Feb 7, 2024Guiding Instruction-based Image Editing via Multimodal Large Language Models
Ferret: Refer and Ground Anything Anywhere at Any Granularity#Apple#Large Language Models#Multimodal#Paper#PDF#Opensource·arxiv.org·Dec 26, 2023Ferret: Refer and Ground Anything Anywhere at Any Granularity