|
Overview
Digital images are now ubiquitous and easy to aquire. While humans easily recognize the objects and other semantic content in images, it has been much more difficult to do so automatically. Images of the same object can vary significantly because of lighting, slight differences in orientation, shadows and camera parameters. Even when the set of objects is limited and reference images are avaliable, direct comparison of images on a pixel-by-pixel basis is unlikely to yield satisfactory recognition. Higher-level semantics are required.
To obtain higher-level semantics, a set of features must be computed from eash image, and these features are then compared to the features of the reference image. Finding the right set of features by trial and error can be time-consuming and difficult. Therefore, in this project, we will build a system to learn an appropriate set of features from a training set of images, and then apply them to a test set of images. The methodology used in this project has been used successfully for face recognition.
|