AI/ML

AI/ML

2200 bookmarks
Custom sorting
LlamaOCR - Building your Own Private OCR System - YouTube
LlamaOCR - Building your Own Private OCR System - YouTube
The video demonstrates LlamaOCR, an OCR tool leveraging the Llama 3.2 visual model. It focuses on the tool's ability to convert images and scanned documents into structured Markdown, preserving the original formatting of elements like tables, lists, and spreadsheets. The video covers practical usage examples, offering tutorials and code snippets in both JavaScript and Python within a Colab environment. For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWitteveen Twitter: https://twitter.com/Sam_Witteveen Colab: https://drp.li/WpdNm 🕵️ Interested in building LLM Agents? Fill out the form below Building LLM Agents Form: https://drp.li/dIMes ⏱️Time Stamps: 00:00 LlamaOCR Project 00:56 Demo Using their Site 02:43 Colab Demo 04:40 Together.AI Docs 06:06 Pricing 09:16 Python OCR Version 11:20 Thai OCR Project 16:30 Patreon
·youtube.com·
LlamaOCR - Building your Own Private OCR System - YouTube
NuExtract 1.5
NuExtract 1.5
Structured extraction - where an LLM helps turn unstructured text (or image content) into structured data - remains one of the most directly useful applications of LLMs. NuExtract is a …
·simonwillison.net·
NuExtract 1.5
Recraft V3
Recraft V3
Recraft are a generative AI design tool startup based out of London who released their v3 model a few weeks ago. It's currently sat at the top of the [Artificial …
·simonwillison.net·
Recraft V3
Revolutionize Your Notes with AI Magic! - YouTube
Revolutionize Your Notes with AI Magic! - YouTube
Discover how to supercharge your writing workflow with an AI assistant that helps you craft better content faster - without spending a dime! In this comprehensive tutorial, I'll show you how to set up a powerful local AI writing assistant using Obsidian and Ollama that works alongside you as you write, offering suggestions and helping you maintain creative momentum. ✨ What you'll learn: • How to set up a free AI writing assistant • Complete Obsidian + Ollama configuration guide • Tips for optimizing AI suggestions • Real-world writing workflow demonstration Whether you're a content creator, writer, or just someone looking to enhance their writing process, this setup will revolutionize how you work. See how I use this system to write my own scripts and learn how you can implement it too! #AIWriting #ContentCreation #Productivity #ObsidianMD #techtutorial Here is the User Prompt I use: {{#context}}Context:\n\n{{context}}\n\n=================================\n{{/context}} The following text has been written by the user. You will continue writing the next few words of the text as if you were the original writer. Do not begin the text with '...' and don't summarize the text. {{last_line}} My Links 🔗 👉🏻 Subscribe (free): https://www.youtube.com/technovangelist 👉🏻 Join and Support: https://www.youtube.com/channel/UCHaF9kM2wn8C3CLRwLkC2GQ/join 👉🏻 Newsletter: https://technovangelist.substack.com/subscribe 👉🏻 Twitter: https://www.twitter.com/technovangelist 👉🏻 Discord: https://discord.gg/uS4gJMCRH2 👉🏻 Patreon: https://patreon.com/technovangelist 👉🏻 Instagram: https://www.instagram.com/technovangelist/ 👉🏻 Threads: https://www.threads.net/@technovangelist?xmt=AQGzoMzVWwEq8qrkEGV8xEpbZ1FIcTl8Dhx9VpF1bkSBQp4 👉🏻 LinkedIn: https://www.linkedin.com/in/technovangelist/ 👉🏻 All Source Code: https://github.com/technovangelist/videoprojects Want to sponsor this channel? Let me know what your plans are here: https://www.technovangelist.com/sponsor
·youtube.com·
Revolutionize Your Notes with AI Magic! - YouTube
Ollama: Llama 3.2 Vision
Ollama: Llama 3.2 Vision
Ollama released version 0.4 [last week](https://github.com/ollama/ollama/releases/tag/v0.4.0) with support for Meta's first Llama vision model, [Llama 3.2](https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/). If you have Ollama installed you can fetch the 11B model (7.9 GB) like …
·simonwillison.net·
Ollama: Llama 3.2 Vision
GitHub Next | GitHub Spark
GitHub Next | GitHub Spark
GitHub Next Project: Can we enable anyone to create or adapt software for themselves, using AI and a fully-managed runtime?
·githubnext.com·
GitHub Next | GitHub Spark
Painless Data Extraction and Web Automation
Painless Data Extraction and Web Automation
Start building AI agents using natural language queries for precise web and app automation. Scrape web data with ease without worrying about complexities of the modern Web
·agentql.com·
Painless Data Extraction and Web Automation
ChainForge
ChainForge
I'm still on the hunt for good options for running evaluations against prompts. ChainForge offers an interesting approach, calling itself "an open-source visual programming environment for prompt engineering". The interface …
·simonwillison.net·
ChainForge
yet-another-applied-llm-benchmark
yet-another-applied-llm-benchmark
Nicholas Carlini introduced this personal LLM benchmark suite [back in February](https://nicholas.carlini.com/writing/2024/my-benchmark-for-large-language-models.html) as a collection of over 100 automated tests he runs against new LLM models to evaluate their performance against …
·simonwillison.net·
yet-another-applied-llm-benchmark
Docling
Docling
MIT licensed document extraction Python library from the Deep Search team at IBM, who released [Docling v2](https://ds4sd.github.io/docling/v2/#changes-in-docling-v2) on October 16th. Here's the [Docling Technical Report](https://arxiv.org/abs/2408.09869) paper from August, which provides …
·simonwillison.net·
Docling
Our Story - OmniBridge
Our Story - OmniBridge
Find out how it all started and connect with our vision BEGINNINGS The idea for OmniBridge was seeded in 2014 when co-founder Adam Munder, a profoundly deaf software engineer at Intel, began developing a system to track information informally passed between fellow engineers. A few years later, Adam began working with a small team of […]
·omnibridge.ai·
Our Story - OmniBridge
Infinite AI Artboard - Recraft
Infinite AI Artboard - Recraft
Premium image generation and editing tool. Store and share your own styles, create, fine-tune, upscale, and perfect your visuals.
·recraft.ai·
Infinite AI Artboard - Recraft
microsoft/OmniParser · Hugging Face
microsoft/OmniParser · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
·huggingface.co·
microsoft/OmniParser · Hugging Face