LlamaOCR - Building your Own Private OCR System - YouTube
The video demonstrates LlamaOCR, an OCR tool leveraging the Llama 3.2 visual model. It focuses on the tool's ability to convert images and scanned documents into structured Markdown, preserving the original formatting of elements like tables, lists, and spreadsheets. The video covers practical usage examples, offering tutorials and code snippets in both JavaScript and Python within a Colab environment.
For more tutorials on using LLMs and building agents, check out my Patreon
Patreon: https://www.patreon.com/SamWitteveen
Twitter: https://twitter.com/Sam_Witteveen
Colab: https://drp.li/WpdNm
🕵️ Interested in building LLM Agents? Fill out the form below
Building LLM Agents Form: https://drp.li/dIMes
⏱️Time Stamps:
00:00 LlamaOCR Project
00:56 Demo Using their Site
02:43 Colab Demo
04:40 Together.AI Docs
06:06 Pricing
09:16 Python OCR Version
11:20 Thai OCR Project
16:30 Patreon
Structured extraction - where an LLM helps turn unstructured text (or image content) into structured data - remains one of the most directly useful applications of LLMs. NuExtract is a …
Recraft are a generative AI design tool startup based out of London who released their v3 model a few weeks ago. It's currently sat at the top of the [Artificial …
The Beginner's Guide to Visual Prompt Injections: Invisibility Cloaks, Cannibalistic Adverts, and Robot Women | Lakera – Protecting AI teams that disrupt the world.
Learn about visual prompt injections, their appearance, and top defense strategies against these attacks.
Discover how to supercharge your writing workflow with an AI assistant that helps you craft better content faster - without spending a dime! In this comprehensive tutorial, I'll show you how to set up a powerful local AI writing assistant using Obsidian and Ollama that works alongside you as you write, offering suggestions and helping you maintain creative momentum.
✨ What you'll learn:
• How to set up a free AI writing assistant
• Complete Obsidian + Ollama configuration guide
• Tips for optimizing AI suggestions
• Real-world writing workflow demonstration
Whether you're a content creator, writer, or just someone looking to enhance their writing process, this setup will revolutionize how you work. See how I use this system to write my own scripts and learn how you can implement it too!
#AIWriting #ContentCreation #Productivity #ObsidianMD #techtutorial
Here is the User Prompt I use:
{{#context}}Context:\n\n{{context}}\n\n=================================\n{{/context}}
The following text has been written by the user. You will continue writing the next few words of the text as if you were the original writer. Do not begin the text with '...' and don't summarize the text.
{{last_line}}
My Links 🔗
👉🏻 Subscribe (free): https://www.youtube.com/technovangelist
👉🏻 Join and Support: https://www.youtube.com/channel/UCHaF9kM2wn8C3CLRwLkC2GQ/join
👉🏻 Newsletter: https://technovangelist.substack.com/subscribe
👉🏻 Twitter: https://www.twitter.com/technovangelist
👉🏻 Discord: https://discord.gg/uS4gJMCRH2
👉🏻 Patreon: https://patreon.com/technovangelist
👉🏻 Instagram: https://www.instagram.com/technovangelist/
👉🏻 Threads: https://www.threads.net/@technovangelist?xmt=AQGzoMzVWwEq8qrkEGV8xEpbZ1FIcTl8Dhx9VpF1bkSBQp4
👉🏻 LinkedIn: https://www.linkedin.com/in/technovangelist/
👉🏻 All Source Code: https://github.com/technovangelist/videoprojects
Want to sponsor this channel? Let me know what your plans are here: https://www.technovangelist.com/sponsor
Ollama released version 0.4 [last week](https://github.com/ollama/ollama/releases/tag/v0.4.0) with support for Meta's first Llama vision model, [Llama 3.2](https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/). If you have Ollama installed you can fetch the 11B model (7.9 GB) like …
Qwen2.5-Coder-32B is an LLM that can code well that runs on my Mac
There’s a whole lot of buzz around the new Qwen2.5-Coder Series of open source (Apache 2.0 licensed) LLM releases from Alibaba’s Qwen research team. On first impression it looks like …
Everything I’ve learned so far about running local LLMs
Chris Wellons shares detailed notes on his experience running local LLMs on Windows - though most of these tips apply to other operating systems as well. This is great, there's …
Start building AI agents using natural language queries for precise web and app automation. Scrape web data with ease without worrying about complexities of the modern Web
I'm still on the hunt for good options for running evaluations against prompts. ChainForge offers an interesting approach, calling itself "an open-source visual programming environment for prompt engineering". The interface …
Nicholas Carlini introduced this personal LLM benchmark suite [back in February](https://nicholas.carlini.com/writing/2024/my-benchmark-for-large-language-models.html) as a collection of over 100 automated tests he runs against new LLM models to evaluate their performance against …
MIT licensed document extraction Python library from the Deep Search team at IBM, who released [Docling v2](https://ds4sd.github.io/docling/v2/#changes-in-docling-v2) on October 16th. Here's the [Docling Technical Report](https://arxiv.org/abs/2408.09869) paper from August, which provides …
Creating a LLM-as-a-Judge that drives business results
Hamel Husain's sequel to [Your AI product needs evals](https://hamel.dev/blog/posts/evals/). This is _packed_ with hard-won actionable advice. Hamel warns against using scores on a 1-5 scale, instead promoting an alternative he …
Find out how it all started and connect with our vision BEGINNINGS The idea for OmniBridge was seeded in 2014 when co-founder Adam Munder, a profoundly deaf software engineer at Intel, began developing a system to track information informally passed between fellow engineers. A few years later, Adam began working with a small team of […]
In TED Talk, Deaf engineer debuts AI model that transcribes sign language to text in seconds
Adam Munder is a software engineer. Since 2015, he’s been working to bridge the gap between sign language and spoken word. Now, a decade later, he brought it to the TED stage.