Run Length Encoding

Apr 25, 2001

Error-Free Compression

Probability of each run length vector: h_i = L_i/N_i where L_i is length of individual vector and N_i is total number of pixels in the image with graylevel G_i.

If there are J vectors defining the run sequences of value G_a, and the probability for a vector L_i specifying a specific sequence length L_ai of G_a values is given by h_ai = N_Lai/J, then the entropy E_a for the vectors specifying value G_a is E_a = -Σ_i h_ai·lb(h_ai). A similar expression is obtained for K vectors specifying the value G_b: E_a = -Σ_i h_bi·lb(h_bi).

Defining L_a = (Σ_i L_ai)/J then gives the entropy for the entire binary image using run length coding as

E_rl = (E_a + E_b)/(L_a + L_b).

An example image is:

0	0	0	0	0	0	0
1	1	1	1	1	1	1
1	1	1	1	1	1	1
1	1	1	1	1	1	1
1	1	1	1	0	0	0
1	1	1	1	0	0	0
0	0	0	0	0	0	0

In uncompressed form, this image takes 49 bits. To perform run length encoding, first give the value and the length of all sequences in each row. With one bit for value and three bits for length, this picture will require 36 bits to represent. This represents a compression ratio of 1.36 : 1. A zig-zag scan would further reduce the requirement to 30 bits, using 5 bits for the lengths.

Other approaches find the edges of objects and then encode the edges using Fourier descriptors or 8-bit chain codes. The Fourier method can give errors unless all the Fourier descriptors are used.

Predictive coding:

Assume that pixels are correlated and encode the differences between adjacent pixels instead of the pixel values themselves. This requires some estimation function f_est(x,y), which can be compared to the original image f(x,y). Subtracting f_est(x,y) from f(x,y) gives an error function e(x,y) which hopefully has much smaller values than the pixel values. One simple estimation function (differential coding) is to assume that each pixel has the same value as its predecessor:

f_est(x,y) = f(x-1,y).

Sample image:

0	0	2	3	4
1	1	2	3	3
1	1	3	4	4
1	2	3	4	4
1	0	2	4	4

Straight binary coding of this image using 3-bit code will require 75 bits.

Application of the simple predictor described above, f_est(x,y) = f(x-1,y), going row by row, and specifying the initial value of each row, gives

0: 0, 2, 1, 1

1: 0, 1, 1, 0

1: 0, 2, 1, 0

1: 1, 1, 1, 0

1: -1, 2, 2, 0

Now using 2 bits for the difference graylevels and 3 bits to represent the first graylevel in each row results in a total of 55 bits. The compression ratio is 1.36 : 1. Zig-zag scanning will eliminate the need to give the initial row values after the first row and allow longer sequences. Then only the initial value and 24 difference values are required. Huffman coding makes the average number of pits/pixel equal to 1.87. Adding three bits for the initial value brings the total storage requirement to 48 bits for a compression ratio of 1.56 : 1.

Last modified on April 25, 2001