Recognize Anything: A Strong Image Tagging Model
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
Improving Factuality and Reasoning in Language Models through Multiagent Debate
ORCa: Glossy Objects as Radiance Field Cameras
F-VLM: Open-vocabulary object detection upon frozen vision and language models
Random-Access Neural Compression of Material Textures
Scaling Vision Transformers to 22 Billion Parameters
DINOv2: Learning Robust Visual Features without Supervision
Computer Vision Platform and AI Software Company | Landing AI
Announcing the ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-based 3D Object Detection
A Good Prompt Is Worth Millions of Parameters? Low-resource...
Google Cloud Introduces Shelf Inventory AI Tool for Retailers
Contactless Blood Pressure Estimation System Using a Computer Vision System
Wang, W. and others. (2021). Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions.
EDGE: Editable Dance Generation From Music
Metaspectral Raises $4.7 Million to Launch Fusion, a Cloud-Based AI Platform - SpaceRef
A simpler path to better computer vision
Top Object Detection Algorithms and Libraries in Artificial Intelligence (AI)
Multi-layered Mapping of Brain Tissue via Segmentation Guided Contrastive Learning
PI-ARS: Accelerating Evolution-Learned Visual-Locomotion with Predictive Information Representations
Google reveals what’s next for Cloud AI
Algorithms predict sports teams' moves with 80% accuracy
New technique enables on-device training using less than a quarter of a megabyte of memory
Getting started with IPUs on Paperspace
MassMIND: Massachusetts Maritime INfrared Dataset
Researchers create the first artificial vision system for both land and water
Brickit
Theator, an AI platform that analyzes surgery videos, closes out its Series A at $39.5M – TechCrunch
Rewriting Image Captions for Visual Question Answering Data Creation