Computer vision - Wikipedia

Original SourceResource Reference
AI Synthesis
High-Signal Research

Overview

Computer vision is an interdisciplinary field of artificial intelligence focused on enabling computers to derive high-level understanding and actionable insights from digital images and videos by automating the complex functions of the human visual system.

Key Insights

  • Core Objective: The field seeks to transform raw visual data into symbolic information and numerical models using principles from geometry, physics, statistics, and learning theory to facilitate automated decision-making.
  • Multidisciplinary Scope: It encompasses various subfields including scene reconstruction, object detection, video tracking, and 3D pose estimation, often overlapping with "machine vision" in industrial and factory automation contexts.
  • Evolution of Methodology: Historically, the field progressed from 1970s edge-extraction and 1980s mathematical optimization (scale-space, contour models) to 1990s breakthroughs in 3D reconstruction and statistical face recognition.
  • Deep Learning Revolution: Modern computer vision has shifted toward deep learning and complex optimization frameworks, which have significantly increased accuracy across benchmark datasets for tasks such as image classification and segmentation.