Inference with Reference: Lossless Acceleration of Large Language Models#Inference#Microsoft#Paper#PDF#Large Language Models·arxiv.org·Apr 19, 2023Inference with Reference: Lossless Acceleration of Large Language Models