Guiding Instruction-based Image Editing via Multimodal Large Language ModelsDownload PDF#Apple#Multimodal#Editing#Paper#PDF#Opensource·arxiv.org·Feb 7, 2024Guiding Instruction-based Image Editing via Multimodal Large Language Models
Ferret: Refer and Ground Anything Anywhere at Any Granularity#Apple#Large Language Models#Multimodal#Paper#PDF#Opensource·arxiv.org·Dec 26, 2023Ferret: Refer and Ground Anything Anywhere at Any Granularity
DeepFloyd IF — DeepFloyd#Text-to-Image#Multimodal#Opensource·deepfloyd.ai·Apr 28, 2023DeepFloyd IF — DeepFloyd
Replicate – Run open-source machine learning models with a cloud API#Multimodal#API#Opensource#Model#Cloud Computing·replicate.com·Sep 12, 2022Replicate – Run open-source machine learning models with a cloud API