Dragonfly: A large vision-language model with multi-resolution zoom
GPT-4 Vision API + Puppeteer = Easy Web Scraping
In today's video I do some experimentation with the new GPT-4 Vision API and try to scrape information from web pages using it.GitHub: https://github.com/unc...
$1 Recognizer
2D OCR
Home | Infinigen
Infinigen is a procedural generator of 3D scenes, developed by Princeton Vision & Learning Lab. Infinigen is optimized for computer vision research and generates diverse high-quality 3D training data. Infinigen is based on Blender and is free and open-source (BSD 3-Clause License). Infinigen is being actively developed to expand its capabilities and coverage. Everyone is welcome to contribute.
albumentations-team/albumentations: Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125 -...
Reading thermometer temperatures over time from a video
[Natalie](https://www.instagram.com/natbat.art/) has been experimenting with using a microwave as a kiln for pottery, specifically for [Raku](https://en.wikipedia.org/wiki/Raku_ware).
She wanted to u
Genmo, your creative copilot
Glaze: Protecting Artists from Style Mimicry
ViperGPT: Visual Inference via Python Execution for Reasoning
ViperGPT: Visual Inference via Python Execution for Reasoning.