|
|
|
|
Many solutions to image compression have been proposed and they range from bit reduction, image transformation and bit reduction (no loss techniques) to lossy approaches which exploit, after image transformation, scalar or vector quantization ([2]) based on entropic model. While operating solely on bits, the bit-mapped field can be compressed without any loss of information through dedicated algorithms such as run length encoding (RLE). This algorithm allows to describe, in a reduced space, bit sequences in more simple way. For instance a sequence of zeros is substituted by a zero and by the number of its occurrences. Image compression ratios in between 2 and 4 can be obtained. Yet these ratios are not satisfying for wide image and other techniques have to be used. All the advanced compression techniques try to describe the image content in a different domain with respect the color domain by transforming the pixel values in such a way to capture more information in a few coefficients. The objective of any image transform is indeed to decorrelate the pixel values by projecting the original image onto a dual space through a set of basis functions which allows to well localize into reduced set of coefficients the most of the information content. Being the information content of an image localized as in the spatial domain as well in the frequency domain, transforms which are able to recover precise descriptors of these features are requested. They are based on three main step grouping as follows:
JPEG compression, adopting the discrete cosine transform (linear mapping) carried out through Fast Fourier transform, performs a projection in a spatial-like frequency domain. Coding without loss of information the obtained coefficients allows to reach a compression ratio of about 5.5. Nevertheless this approach is not applied to the entire image, which would be very time consuming and really not performant. Let a set of high frequencies with low amplitude be obtained from a DCT transform of an image. Applying DCT to the whole image no spatial localization can be recovered and canceling out this set of frequency can lead to high distortions in image region where also the low frequencies have low amplitude. To overcome this two drawbacks, in JPEG algorithm DCT is applied to 8x8 blocks allowing speeding up the procedure and localize frequency content. Then for each block a set of coefficients are obtained and according to entropy rules the less significant can be excluded. On the other hand the phenomenon of blockiness can not be avoided when image is deeply compressed, i.e. high discontinuities are generated at the edges of each block (see figure 1). Another drawback of Fourier analysis is the phenomenon of splitting the information in the transform domain when localized features are present in the image, i.e. the energy of discontinuous signals is dispersed across many coefficients in the transform domain. Let the image be constituted by a black background with some few white spots. To capture this localized features in dual domain many frequency components are needed. Although very few coefficients are required to encode the image directly in the image domain, much more coefficients are required to encode the image in the frequency domain.
Wavelets ([1], [6], [7]) have been proposed in this field due to the appealing property of representing both smooth areas and discontinuities compactly in transform domain and seems to promise better results with respect of JPEG in term of image distortion and compression ratios.
|
|
|
|