We used the digital images of the Leiden 19th-Century Portrait Database
(LCPD)
as image database for our research. This database consists of images
taken from cards (scanned at 300 dpi), containing a picture of a
portrait from 19th-century on one side, and a studio logo or description
on the other. At the time, the LCPD image database contained 3014 images
which were obtained from 1507 cards.
Several methods have been developed for computing a similarity measure of images. The methods differ in pixel domains, feature vectors and metrics. The pixel domains can be categorized in the following groups:
In [
5
] 2D pixel trigrams are introduced. Trigrams are based on statistics of
binary 3x3 texels in a thresholded gradient space. To create a gradient
space, we used the Sobel gradient. Once we threshold the gradient image
into binary gradient images, we can use trigrams. A trigram is a 3x3
texel of 1 bit values that form a 9 bit long string, 9 values of
different powers of 2 (1,2,4,...,256). The idea is to create a histogram
of 3x3 patterns, where there are a possible
binary 3x3 patterns. Every possible 3x3 region in the binary gradient
image is multiplied with the trigram weights to obtain an index into our
histogram. The histogram value at this index is then raised by one. When
all possible 3x3 texels have been visited, a frequency histogram of the
binary image, with respect to all possible 3x3 binary patterns, is
obtained. These feature vectors are normalized before used for matching.
See figure
3
for an example of a computation that results in an index into a
histogram.
Our histogram can be seen as a feature vector of length 512 of the image. We can compute all the feature vectors of our images in the database, and use the vectors for similarity matching. Similarity matching is done by summing the absolute differences of the corresponding feature vector elements for a pair of images (L1-norm). Because this feature vector is rather large (512 different patterns as mentioned before) and contains a relatively large fraction of noise contributions, a bandpass filter is used to suppress all but 75 patterns during the comparison. The resulting distances are stored in a full distance matrix which is used to generate an overview of the Leiden 19th-Century Portrait Database. In [ 6 ] the effectiveness of Trigram feature vectors for image comparison is compared against the performance of projections, Local Binary Patterns and Virage Datablade.
Dr. D.P. Huijsmans