Automatic Creative Selection with Cross-Modal MatchingView PDF#Search#Computer Vision#Apple#Paper#PDF·arxiv.org·May 2, 2024Automatic Creative Selection with Cross-Modal Matching
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework#Large Language Models#Opensource#Apple#Paper#PDF·arxiv.org·Apr 24, 2024OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework
ReALM: Reference Resolution As Language Modeling#Large Language Models#Paper#PDF#Apple·arxiv.org·Apr 1, 2024ReALM: Reference Resolution As Language Modeling
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training#Large Language Models#Multimodal#Apple#Paper#PDF·arxiv.org·Mar 17, 2024MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Guiding Instruction-based Image Editing via Multimodal Large Language ModelsDownload PDF#Apple#Multimodal#Editing#Paper#PDF#Opensource·arxiv.org·Feb 7, 2024Guiding Instruction-based Image Editing via Multimodal Large Language Models
Ferret: Refer and Ground Anything Anywhere at Any Granularity#Apple#Large Language Models#Multimodal#Paper#PDF#Opensource·arxiv.org·Dec 26, 2023Ferret: Refer and Ground Anything Anywhere at Any Granularity
LLM in a flash: Efficient Large Language Model Inference with Limited Memory#Apple#Edge Computing#Large Language Models#Paper#PDF·arxiv.org·Dec 22, 2023LLM in a flash: Efficient Large Language Model Inference with Limited Memory