Search AI/ML

Found 13 bookmarks

Custom sorting

AI-Powered Content Audits for Local News

How to responsibly use AI to help with understanding your coverage

#text sanitization #text #safety

·generative-ai-newsroom.com·Nov 19, 2024

AI-Powered Content Audits for Local News

GitHub - DocumindHQ/documind: Open-source platform for extracting structured data from documents using AI.

Open-source platform for extracting structured data from documents using AI. - DocumindHQ/documind

#text sanitization #text #data science

·github.com·Nov 19, 2024

GitHub - DocumindHQ/documind: Open-source platform for extracting structured data from documents using AI.

Docling

MIT licensed document extraction Python library from the Deep Search team at IBM, who released [Docling v2](https://ds4sd.github.io/docling/v2/#changes-in-docling-v2) on October 16th. Here's the [Docling Technical Report](https://arxiv.org/abs/2408.09869) paper from August, which provides …

#text sanitization #text #agent

·simonwillison.net·Nov 3, 2024

Docling

Jaided AI - Distribute the benefits of AI to the world

#OCR #text #image

·jaided.ai·Jun 19, 2023

Jaided AI - Distribute the benefits of AI to the world

GitHub - JaidedAI/EasyOCR: Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

#text #OCR #image

·github.com·Jun 19, 2023

GitHub - JaidedAI/EasyOCR: Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

socketteer/loom: Multiversal tree writing interface for human-AI collaboration

Multiversal tree writing interface for human-AI collaboration - socketteer/loom: Multiversal tree writing interface for human-AI collaboration

#text #writing #creativity #art #story

·github.com·May 6, 2023

socketteer/loom: Multiversal tree writing interface for human-AI collaboration

Langchain gpt-3.5-turbo models reads files - problem

I am making really simple (and for fun) LangChain project. A model can read PDF file and I can then ask him questions about specific PDF file. Everything works fine (this is working example) from P...

#text

·stackoverflow.com·Apr 30, 2023

Langchain gpt-3.5-turbo models reads files - problem

microsoft/table-transformer: Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev...

#text #text sanitization

·github.com·Apr 29, 2023

How to turn Text into Features

A comprehensive guide into using NLP for Machine Learning

#text

·towardsdatascience.com·Apr 14, 2023

How to turn Text into Features

Ask Your PDF

Your gateway to dynamic, interactive, and intelligent conversations with any PDF document

#text #tools

·askyourpdf.com·Apr 6, 2023

Ask Your PDF

Ubisoft Proudly Announces 'AI' Is Helping Write Dialogue

Ubisoft Ghostwriter is described by the company as 'an AI tool'

#text #language #writing

·kotaku.com·Mar 22, 2023

Ubisoft Proudly Announces 'AI' Is Helping Write Dialogue

NLP+CSS 201 Tutorials

Tutorials for advanced natural language processing methods designed for computational social science research.

#nlp #text

·nlp-css-201-tutorials.github.io·Mar 9, 2023

NLP+CSS 201 Tutorials

NER Powered Semantic Search in Python

Semantic search is a compelling technology allowing us to search using abstract concepts and meaning rather than relying on specific words. However, sometimes a simple keyword search can be just as valuable — especially if we know the exact wording of what we're searching for. Pinecone allows you to pair semantic search with a basic keyword filter. If you know that the document you're looking for contains a specific word or set of words, you simply tell Pinecone to restrict the search to only include documents with those keywords. We even support functionality for keyword search using sets of words with AND, OR, NOT logic. In this video, we will explore these features through a start-to-finish example of basic keyword search in Pinecone. 🌲 Pinecone Docs Page: https://www.pinecone.io/docs/examples/metadata-filtered-search/ 🤖 70% Discount on the NLP With Transformers in Python course: https://bit.ly/3DFvvY5 🎉 Subscribe for Article and Video Updates! https://jamescalam.medium.com/subscribe https://medium.com/@jamescalam/membership 👾 Discord: https://discord.gg/c5QtDB9RAP 00:00 NER Powered Semantic Search 01:19 Dependencies and Hugging Face Datasets Prep 04:18 Creating NER Entities with Transformers 07:00 Creating Embeddings with Sentence Transformers 07:48 Using Pinecone Vector Database 11:33 Indexing the Full Medium Articles Dataset 15:09 Making Queries to Pinecone 17:01 Final Thoughts

#nlp #text

·youtube.com·Mar 9, 2023

NER Powered Semantic Search in Python