Document Parsers

12 bookmarks

Custom sorting

Deepseek ocr

#OCR #Rust

·libhunt.com·Nov 13, 2025

Deepseek ocr

Urn:li:ugc post:7351284834956185600

#Document Parsing

·linkedin.com·Jul 22, 2025

Urn:li:ugc post:7351284834956185600

GitHub - AdemBoukhris457/Docs_Parsing_Techniques: Jupyter notebooks testing different OCR models for document parsing (Dolphin, MonkeyOCR, Marker, Nanonets, ...)

Jupyter notebooks testing different OCR models for document parsing (Dolphin, MonkeyOCR, Marker, Nanonets, ...) - AdemBoukhris457/Docs_Parsing_Techniques

#Document Parsing #Sample-Code #OCR

·github.com·Jul 13, 2025

GitHub - AdemBoukhris457/Docs_Parsing_Techniques: Jupyter notebooks testing different OCR models for document parsing (Dolphin, MonkeyOCR, Marker, Nanonets, ...)

Jerry Liu (@jerryjliu0) on X

Here’s how to build an AI agent that auto-generates a company risk report over dozens of public filings 📈📉 Batch analyzing a ton of documents and writing up a memo would take 20+ hours of work. Agents have the potential to automate this but they completely fall apart without

#AI Agents

·x.com·Apr 19, 2025

Jerry Liu (@jerryjliu0) on X

Transformation Agent | Weaviate

This Weaviate Agent is in technical preview.

#LLM #agentic #AI Agents #RAG #Document Parsing #Document Understanding

·weaviate.io·Mar 12, 2025

Transformation Agent | Weaviate

From PDFs to Insights: Structured Outputs from PDFs with Gemini 2.0

Learn how to extract structured data from PDFs with Gemini 2.0 and Pydantic.

#Document Parsing #Document Understanding #LLM

·philschmid.de·Feb 13, 2025

From PDFs to Insights: Structured Outputs from PDFs with Gemini 2.0

GitHub - getomni-ai/zerox: PDF to Markdown with vision models

PDF to Markdown with vision models.

#Document Parsing #Document Understanding #PDF Parsing

·github.com·Feb 3, 2025

GitHub - getomni-ai/zerox: PDF to Markdown with vision models

Qwen2.5-VL/cookbooks at main · QwenLM/Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud. - QwenLM/Qwen2.5-VL

#LLM #VLM #Document Parsing #Document Understanding #Cookbook

·github.com·Jan 31, 2025

Qwen2.5-VL/cookbooks at main · QwenLM/Qwen2.5-VL

GitHub - X-PLUG/mPLUG-DocOwl: mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding - X-PLUG/mPLUG-DocOwl

·github.com·Jan 29, 2025

GitHub - X-PLUG/mPLUG-DocOwl: mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

We now support VLMs in smolagents!

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Jan 25, 2025

We now support VLMs in smolagents!

EyeLevel | RAG on-Prem

EyeLevel.ai's GroundX APIs are the fastest way to build enterprise-grade RAG on prem or cloud. Trusted by Air France, Dartmouth, UltraCommerce and hundreds more.

#Document Parsing #Document Understanding

·eyelevel.ai·Jan 23, 2025

EyeLevel | RAG on-Prem

Interactive LLM-Powered Data Processing with DocWrangler

DocWrangler is an IDE that provides instant feedback, visual exploration tools, and AI assistance for building and iterating on LLM-powered data processing pipelines

#Document Understanding #Document Parsing

·data-people-group.github.io·Jan 21, 2025

Interactive LLM-Powered Data Processing with DocWrangler