Recognize Anything: A Strong Image Tagging Model#Computer Vision#Image Recognition#Paper#PDF·arxiv.org·Jun 11, 2023Recognize Anything: A Strong Image Tagging Model
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection#Computer Vision#Baidu#Object Detection#Paper#PDF·arxiv.org·Jun 7, 2023CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
Improving Factuality and Reasoning in Language Models through Multiagent Debate#Reasoning#Large Language Models#Machine Learning#Computer Vision#Paper#PDF·arxiv.org·May 30, 2023Improving Factuality and Reasoning in Language Models through Multiagent Debate
ORCa: Glossy Objects as Radiance Field Cameras#Computer Vision#Pattern Recognition#Sensor#Paper#PDF·arxiv.org·May 29, 2023ORCa: Glossy Objects as Radiance Field Cameras
F-VLM: Open-vocabulary object detection upon frozen vision and language models#Computer Vision#Google#Object Detection#Paper·ai.googleblog.com·May 12, 2023F-VLM: Open-vocabulary object detection upon frozen vision and language models
Random-Access Neural Compression of Material Textures#Graphics#Computer Vision#Games#Nvidia#Paper#PDF·research.nvidia.com·May 6, 2023Random-Access Neural Compression of Material Textures
Scaling Vision Transformers to 22 Billion Parameters#Transformers#Computer Vision#Paper#PDF#Google·arxiv.org·Apr 23, 2023Scaling Vision Transformers to 22 Billion Parameters
DINOv2: Learning Robust Visual Features without Supervision#Computer Vision#Meta#Paper#PDF·arxiv.org·Apr 18, 2023DINOv2: Learning Robust Visual Features without Supervision
Computer Vision Platform and AI Software Company | Landing AI#Computer Vision#Data Model#Platforms#Tools·landing.ai·Mar 30, 2023Computer Vision Platform and AI Software Company | Landing AI
Announcing the ICDAR 2023 Competition on Hierarchical Text Detection and Recognition#OCR#Contest#Computer Vision#Google#Image Recognition#Event·ai.googleblog.com·Mar 7, 2023Announcing the ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-based 3D Object Detection#Computer Vision#Baidu#Paper#PDF·arxiv.org·Mar 7, 2023StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-based 3D Object Detection
A Good Prompt Is Worth Millions of Parameters? Low-resource...#Prompt Engineering#Large Language Models#Computer Vision#PDF#Questions and Answers·arxiv.org·Dec 6, 2021A Good Prompt Is Worth Millions of Parameters? Low-resource...
Google Cloud Introduces Shelf Inventory AI Tool for Retailers#Computer Vision#AI#Inventory#Google·wsj.com·Jan 14, 2023Google Cloud Introduces Shelf Inventory AI Tool for Retailers
Contactless Blood Pressure Estimation System Using a Computer Vision System#Computer Vision#Biology·mdpi.com·Dec 7, 2022Contactless Blood Pressure Estimation System Using a Computer Vision System
Wang, W. and others. (2021). Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions.#Computer Vision#Transformers#CNN·openaccess.thecvf.com·Nov 27, 2022Wang, W. and others. (2021). Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions.
EDGE: Editable Dance Generation From Music#Computer Vision#Dance#Music·arxiv.org·Nov 25, 2022EDGE: Editable Dance Generation From Music
Metaspectral Raises $4.7 Million to Launch Fusion, a Cloud-Based AI Platform - SpaceRef#Hyperspectral Imagery#Computer Vision#Deep Learning·spaceref.com·Nov 25, 2022Metaspectral Raises $4.7 Million to Launch Fusion, a Cloud-Based AI Platform - SpaceRef
A simpler path to better computer vision#Computer Vision#Synthetic Data#Programming#Machine Learning·news.mit.edu·Nov 25, 2022A simpler path to better computer vision
Top Object Detection Algorithms and Libraries in Artificial Intelligence (AI)#Computer Vision#Object Detection#API#Libraries·marktechpost.com·Nov 21, 2022Top Object Detection Algorithms and Libraries in Artificial Intelligence (AI)
Multi-layered Mapping of Brain Tissue via Segmentation Guided Contrastive Learning#Brain Science#Computer Vision#Deep Learning#Self-Supervised Learning·ai.googleblog.com·Nov 11, 2022Multi-layered Mapping of Brain Tissue via Segmentation Guided Contrastive Learning
PI-ARS: Accelerating Evolution-Learned Visual-Locomotion with Predictive Information Representations#Reinforcement Learning#Robotics#Computer Vision#Blog#Google·ai.googleblog.com·Oct 21, 2022PI-ARS: Accelerating Evolution-Learned Visual-Locomotion with Predictive Information Representations
Google reveals what’s next for Cloud AI#Google#Cloud Computing#Computer Vision#Opensource#Machine Learning·venturebeat.com·Oct 12, 2022Google reveals what’s next for Cloud AI
Algorithms predict sports teams' moves with 80% accuracy#Computer Vision#Algorithms#Sports#Prediction·techxplore.com·Oct 6, 2022Algorithms predict sports teams' moves with 80% accuracy
New technique enables on-device training using less than a quarter of a megabyte of memory#Machine Learning#Computer Vision·techxplore.com·Oct 6, 2022New technique enables on-device training using less than a quarter of a megabyte of memory
Getting started with IPUs on Paperspace#Paperspace#Graphcore#Notebooks#Transformers#Computer Vision#Multimodal·graphcore.ai·Sep 30, 2022Getting started with IPUs on Paperspace
MassMIND: Massachusetts Maritime INfrared Dataset#Computer Vision#Oceanography#Massachusetts#Deep Learning#LWIR·arxiv.org·Sep 12, 2022MassMIND: Massachusetts Maritime INfrared Dataset
Researchers create the first artificial vision system for both land and water#Computer Vision#Artificial Eye#Camera·news.mit.edu·Aug 6, 2022Researchers create the first artificial vision system for both land and water
Theator, an AI platform that analyzes surgery videos, closes out its Series A at $39.5M – TechCrunch#Machine Learning#Computer Vision#Medical#Video#Analysis·techcrunch.com·Jul 22, 2022Theator, an AI platform that analyzes surgery videos, closes out its Series A at $39.5M – TechCrunch
Rewriting Image Captions for Visual Question Answering Data Creation#Questions and Answers#Computer Vision#Captions#Image Recognition·ai.googleblog.com·Jul 14, 2022Rewriting Image Captions for Visual Question Answering Data Creation