[CV] DIP, Image Analysis, and Computer Vision

728x90

Image Processing과 Computer Vision(CV)은
상당 부분 겹쳐있으며, 명확하게 구분하기 어려운 분야들임.

이 두 분야는 연속선상에 있으며, 상당히 경계가 모호함.

공통적으로 두 분야 모두

linear algebra, probability, statistics 등의 수학적 지식에 익숙할수록 깊이 있는 이해와 응용이 가능하며
이론적 개념을 실제로 구현하고 적용하는 것이 필수적이라,
MATLAB, Python, C, C++ 등의 프로그래밍 언어를 다룰 수 있는 능력을 요구함.

프로그래밍 기술은 알고리즘을 구현하고, 실험하며, 결과를 시각화하는 데 핵심적인 역할을 함.

이러한 공통점과 모호한 경계에도 불구하고,
Image Processing, Image Analysis, 그리고 Computer Vision은
각각 고유한 특징과 목표를 좀 더 명확하게 분리해서 이해하고 있을 만한 가치가 있음.

Rafael C. Gonzalez 의 Digital Image Processing (4th ed.) 의 1장의 내용대로

이들 분야를 continuum(연속체)로 간주하고,
저수준에서 고수준 처리로 이어지는 형태로 이해하는 것이 가장 적절하다고 생각됨.

아래 내용은 이를 정리한 것임.

1. Digital Image Processing:

definition: 디지털 컴퓨터를 사용하여 digital images를 처리하는 분야.
- 전통적인 Digital image (raster graphics = bitmaps):
  - 2차원 함수 f(x,y)로 표현되며,
  - x와 y는 spatial coordinates,
  - f는 해당 좌표에서의 intensity 또는 gray level, or vector(3-4개의 요소)을 나타냄.
- Vector Graphics
  - 수학적 공식과 기하학적 도형으로 image를 표현.
  - 점, 선, 곡선, 다각형 등의 수학적 객체들로 구성하고 text형태의 명령어들의 리스트로 이미지 구성.
  - 예: SVG, AI, EPS, PDF
  - 확대해도 화질 손실 없음
- 넓은 의미의 digital image
  - 디지털 형태로 저장되고 처리되는 모든 이미지
  - 실제 많은 현대 시스템에서는 vector graphics를 최종 출력 시 rasterization하여 화면에 표시하며
  - 일부 이미지 처리에서는 raster graphics 와 vector graphics를 함께 사용하기도 함.
범위:
- 입력과 출력이 모두 이미지인 low-level processing 에서
- 이미지에서 attributes를 추출하는 mid-level processing 을 거쳐
- Individual object recognition 까지 포함될 수 있음(단, 상대적으로 낮은 복잡도의 경우를 가리키는 경우가 많음.)
주요 기술:
- Noise reduction, contrast enhancement, image sharpening 등의 preprocessing
- 다음은 CV나 Image Analysis로 분류되기도 함:
  - Detection, Segmentation (이미지를 regions 또는 objects로 분할)
  - Object description 및 classification

https://ds31x.blogspot.com/2023/09/dip-digital-image-processing.html?q=CV

[DIP] Digital Image Processing 이란?

ds31x.blogspot.com

2. Image Analysis:

definition: Digital Image Processing과 Computer Vision 사이의 중간 영역.
특징:
- 입력은 일반적으로 images지만, 출력은 이미지에서 추출된 attributes or symbols 인 경우가 많음.
- 예: Edges, contours, identity of individual objects
주요 기술:
- Object segmentation 및 description
- Feature extraction
- Pattern recognition

3. Computer Vision:

목표: 컴퓨터를 사용하여 human vision을 emulate(모방)하는 것.
특징:
- Learning ability 가 강조됨.
- 시각적 입력을 바탕으로 inference(추론)하고 actions을 취하는 능력
인간의 지능을 모방하는 것을 목표로 하는 Artificial Intelligence (AI)의 한 분야임.

Computer Vision is

Automating Human Visual Processes
- 시각 정보를 이용하는 시스템의 자동화 (가장 본질적인 computer vision)
An Information Processing Task
- computer vision은 일종의 정보 처리라고 볼 수 있음
Inverting Image Formation
- Image가 형성되는 물리적 과정을 역으로 추론하는 것이 computer vision
Inverting Graphics
- Computer Graphics는 scene 에서 image로 변환.
- Vision은 image에서 scene으로의 변환.

Computer Vision이란 인간의 시각 과정을 자동화하는 것으로, 이를 정보 처리 문제로 접근하며, 영상 형성과정 또는 그래픽스 과정을 역으로 추론하는 문제로 이해할 수 있음.

2025.06.22 - [Programming/DIP] - [CV] Scene 과 Image: Computer Graphics 와 Computer Vision

[CV] Scene 과 Image: Computer Graphics 와 Computer Vision

1. Scene 의 세부 종류와 Image:1-1. Physical Scene (물리적 장면): 현실 세계에 존재하는 3차원 구조로, 광원, 물체, 표면 재질 등이 실제로 존재하는 공간우리가 카메라로 촬영하는 실제 세계를 의미함.1-

dsaint31.tistory.com

4. 결론

이 세 영역은 continuum을 형성하며, 다음과 같이 구분할 수 있음:

The continuum from image processing to computer vision

Low-level processes:
- Noise reduction, contrast enhancement, image sharpening 등
- 입력과 출력 모두 images 인 경우가 많으나 항상 그런 것은 아님 (e.g. 특정 영역의 평균값 구하기 )
Mid-level processes:
- Segmentation, object description, classification
- 입력은 images, 출력은 이미지에서 추출된 attributes (예: edges, contours, identity of individual objects)
High-level processes:
- 인식된 objects의 ensemble에 대한 "making sense"(=understanding)
- Human vision과 관련된 cognitive functions 수행

쉽게 생각해서 위의 continuum에서 image processing과 image analysis의 겹치는 영역은

특정 이미지 내 individual regions 또는 objects를 인식(recognition)하는 task라고 할 수 있음.

복잡도에 따라 이는 CV에 속하기도 함.

5. Example

예를 들어, automated analysis of text 분야에서:

DIP:
- image에서 text 영역 sub-image acquisition,
- 해당 image에 대한 preprocessing,
- individual character extraction (~segmentation),
- 컴퓨터 처리에 적합한 형태로 character description 구하기 : feature vector 형태
- individual character recognition
텍스트의 내용(content)의 "이해"는 복잡도에 따라,
- image analysis 또는
- computer vision의 영역에 속함.

'Programming > DIP' 카테고리의 다른 글

[CV] Dynamic Range 란? (2)	2024.09.01
[CV] 공간해상도로 본 광센서와 디스플레이 디바이스 발전사 (2)	2024.09.01
[CV] Ideal Pinhole Size (0)	2024.08.09
[CV] Hungarian Algorithm: Matching on the Bipartite Graph (0)	2024.08.06
[CV] Motion Field vs. Optical Flow (0)	2024.07.18

1. Digital Image Processing:

2. Image Analysis:

3. Computer Vision:

4. 결론

5. Example

'Programming > DIP' 카테고리의 다른 글

티스토리툴바