Robot umpires are getting their first MLB test during spring training#Sports#Computer Vision·apnews.com·Feb 23, 2025Robot umpires are getting their first MLB test during spring training
Google’s Gemini AI just shattered the rules of visual processing — here’s what that means for you#AnyChat#Gemini#Video#Computer Vision·venturebeat.com·Jan 15, 2025Google’s Gemini AI just shattered the rules of visual processing — here’s what that means for you
AI Godmother Fei-Fei Li Has a Vision for Computer Vision#Trends#Computer Vision·spectrum.ieee.org·Dec 15, 2024AI Godmother Fei-Fei Li Has a Vision for Computer Vision
OS-ATLAS: A Foundation Action Model for Generalist GUI AgentsView PDF#User Interfaces#Graphics#Large Language Models#Computer Vision#Opensource#Paper#PDF·arxiv.org·Nov 5, 2024OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Amazon’s new VAPR tech spotlights packages for easier deliveries#Amazon#Computer Vision#Delivery#Supply Chain·aboutamazon.com·Oct 9, 2024Amazon’s new VAPR tech spotlights packages for easier deliveries
Wimbledon: Line judges to be removed and electronic calling brought in from 2025#AI#Sports#Computer Vision·bbc.com·Oct 9, 2024Wimbledon: Line judges to be removed and electronic calling brought in from 2025
Claude 3.5 Sonnet for vision#Claude#Computer Vision·youtube.com·Jun 21, 2024Claude 3.5 Sonnet for vision
PuLID: Pure and Lightning ID Customization via Contrastive AlignmentView PDF#Computer Vision#Editing#Identification#Paper#PDF#Gradio·arxiv.org·May 2, 2024PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Paint by Inpaint: Learning to Add Image Objects by Removing Them FirstView PDF#Computer Vision#Editing#Paper#PDF·arxiv.org·May 2, 2024Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Automatic Creative Selection with Cross-Modal MatchingView PDF#Search#Computer Vision#Apple#Paper#PDF·arxiv.org·May 2, 2024Automatic Creative Selection with Cross-Modal Matching
STT: Stateful Tracking with Transformers for Autonomous DrivingView PDF#AVs#Transformers#Machine Learning#Computer Vision#Paper#PDF·arxiv.org·May 2, 2024STT: Stateful Tracking with Transformers for Autonomous Driving
Data-Efficient Multimodal Fusion on a Single GPUView PDF#Machine Learning#Computer Vision#Multimodal#Paper#PDF·arxiv.org·May 2, 2024Data-Efficient Multimodal Fusion on a Single GPU
SAGS: Structure-Aware 3D Gaussian SplattingView PDF#Computer Vision#Huawei#Paper#PDF·arxiv.org·May 1, 2024SAGS: Structure-Aware 3D Gaussian Splatting
NHS AI test spots tiny cancers missed by doctors#Medical#AI#Computer Vision·bbc.com·Mar 23, 2024NHS AI test spots tiny cancers missed by doctors
SCIN: A new resource for representative dermatology images#Computer Vision#Datasets#Google#Medical·blog.research.google·Mar 20, 2024SCIN: A new resource for representative dermatology images
Genie: Generative Interactive Environments#Machine Learning#Computer Vision#Google#Research#Paper#PDF#Foundation Models#Games#Robotics·arxiv.org·Feb 26, 2024Genie: Generative Interactive Environments
PIGEON: Predicting Image GeolocationsThe HTML version was okay, though the PDF was not linked. HTML (experimental)#Geography#Machine Learning#Computer Vision#Image Recognition#Paper#Surveillance·arxiv.org·Dec 21, 2023PIGEON: Predicting Image Geolocations
Greg Brockman on X: "ChatGPT Vision for digitizing journal entries:" / X#Transcription#ChatGPT#Computer Vision·twitter.com·Dec 13, 2023Greg Brockman on X: "ChatGPT Vision for digitizing journal entries:" / X
Vision-controlled jetting for composite systems and robots#3D Printing#Robotics#Computer Vision#Biomimetics·nature.com·Nov 25, 2023Vision-controlled jetting for composite systems and robots
Formula One introduces AI 'computer vision' to monitor track breaches#Computer Vision#Motion Tracking#AI·readwrite.com·Nov 24, 2023Formula One introduces AI 'computer vision' to monitor track breaches
Open sourcing Project Guideline: A platform for computer vision accessibility technology#Accessibility#Google#Opensource#Computer Vision·blog.research.google·Nov 21, 2023Open sourcing Project Guideline: A platform for computer vision accessibility technology
SANPO: A Scene understanding, Accessibility, Navigation, Pathfinding, & Obstacle avoidance dataset#Computer Vision#Scene Understanding#Google Research·blog.research.google·Oct 10, 2023SANPO: A Scene understanding, Accessibility, Navigation, Pathfinding, & Obstacle avoidance dataset
Google at ICCV 2023#Computer Vision#Conference#Google·blog.research.google·Oct 2, 2023Google at ICCV 2023
How to Use ChatGPT’s New Image Features#ChatGPT#Computer Vision#Tips·wired.com·Sep 30, 2023How to Use ChatGPT’s New Image Features
DynIBaR: Space-time view synthesis from videos of dynamic scenes#Video#Graphics#Computer Vision·blog.research.google·Sep 29, 2023DynIBaR: Space-time view synthesis from videos of dynamic scenes
GPT-4V(ision) system cardRead paper#GPT-4#Computer Vision#OpenAI#PDF·openai.com·Sep 25, 2023GPT-4V(ision) system card
These new tools could make AI vision systems less biased#Large Language Models#Tools#Computer Vision#Bias·technologyreview.com·Sep 25, 2023These new tools could make AI vision systems less biased
MIME: Human-Aware 3D Scene Generation#Computer Vision#3D#Paper#PDF·arxiv.org·Jun 22, 2023MIME: Human-Aware 3D Scene Generation
Microsoft at CVPR 2023: Pushing the boundaries of computer vision - Microsoft Research#Computer Vision#Microsoft#Paper#Conference·microsoft.com·Jun 21, 2023Microsoft at CVPR 2023: Pushing the boundaries of computer vision - Microsoft Research
Recognize Anything: A Strong Image Tagging Model#Computer Vision#Image Recognition#Paper#PDF·arxiv.org·Jun 11, 2023Recognize Anything: A Strong Image Tagging Model