TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models - Microsoft Research
Text recognition is a long-standing research problem for document digitalization. Existing approaches for text recognition are usually built based on CNN for image understanding and RNN for char-level text generation. In addition, another language model is usually needed to improve the overall accuracy as a post-processing step. In this paper, we propose an end-to-end text […]
A big issue with trying to improve your RAG pipeline is that advanced techniques can require a ton of setup time.We spent this past weekend packaging 7+ advanced techniques in @llama_index, so that you can use all of them through a standardized interface - simply load in your… https://t.co/Pave05z3Ld pic.twitter.com/FnanyoQTfC— Jerry Liu (@jerryjliu0) November 28, 2023
How *YOU* can - and should - build great multimodal AI apps that go viral and scale to millions in a weekend. Featuring the Vercel AI SDK and the new v0.dev AI frontend tool.
Recorded live in San Francisco at the AI Engineer Summit 2023. See the full schedule of talks at https://ai.engineer/summit/schedule & join us at the AI Engineer World's Fair in 2024! Get your tickets today at https://ai.engineer/worlds-fair
About Hassan
Creator of RoomGPT
Secure Your RAG App Against Prompt Injection Attacks
Don't skip securing your RAG app like you skip leg day at the gym! Here's what Prompt Injection is, how it works, and what you can do to secure your LLM-powered application.
Introduction to Augmenting LLMs with Private Data using LlamaIndex
In this post, we're going to take a top-level overview of how LlamaIndex bridges AI and private custom data from multiple sources (APIs, PDF, and more), enabling powerful applications.
Use LangChain, Deepgram, and Mistral 7B to Build a Youtube Video Summarization App - Koyeb
This guide explains how to build a YouTube video summarization using Langchain, Deepgram, and Mistral 7B. Deploy your AI workload on Koyeb to enjoy high-performance microVMs, seamless scaling, and fast global deployments.
Thanks for signing up to The Rundown built by @therundownai and @rowancheung! Enjoy our free Advanced ChatGPT Guide as a warm welcome into the world of AI!
Andrej Baranovskij on X: "Running Starling-7B LLM model on local CPU with @Ollama_ai and getting great results for invoice data extraction, even better than Zephyr, Mistral or Llama2. Prompt: retrieve gross worth value for each invoice item from the table. format response as following {\"gross_worth\":… https://t.co/QPPPrV27JU" / X
Running Starling-7B LLM model on local CPU with @Ollama_ai and getting great results for invoice data extraction, even better than Zephyr, Mistral or Llama2.Prompt: retrieve gross worth value for each invoice item from the table. format response as following {\"gross_worth\":… pic.twitter.com/QPPPrV27JU— Andrej Baranovskij (@andrejusb) November 30, 2023
In this third video of our series on Llama-index, we will explore how to use different vector stores in llama-index while building RAG applications. We will ...
Prompt Engineering for Developers: How AI Can Help With Architecture Decisions
Learn effective prompt engineering techniques for Large Language Models (LLMs) based on generative AI, like ChatGPT, to facilitate software architecture decisions.
Step by step guide to learn Prompt Engineering. We also have resources and short descriptions attached to the roadmap items so you can get everything you want to learn in one place.