Video Codecs

jwdohert · Feb 22, 2005

I've used Xvid, DivX, etc. for years now and I've always wondered how they compress the video. I know that often times the video resolution is cut slightly to fit a video on a single cd, but what else is done? And, if you got the time, what is this about video bitrates, etc. ?

Bassyhead · Feb 22, 2005

Simply put, compression of any kind of data, including video, basically removes redundant data and uses complex algorithms to do this. Some compressions are lossy and remove some data that will sacrifice image quality along with removing redundancy. Other compressions are lossless and only remove redundancy while preserving the original quality of the video.

icejunkie · Feb 22, 2005

Well, why does redundancy exist in the first place? Why can't we just create lossless file formats from the getgo?

ghackmann · Feb 22, 2005

Uncompressed 720p HDTV video would take (1280 x 720 pixels / frame) x (3 bytes / pixel) x (24 frames / s) = 66,355,200 bytes = 63.3 megabytes per second even before you started adding audio. If your movie was completely silent, you'd fit around 15 minutes of video on a double-layer Blu-ray disc, or about 2.5 minutes on a standard dual-layer DVD disc.

Granted, that's with no compression at all, but lossless compression would only cut the video down by 50% to 75% if you were lucky.

Bassyhead · Feb 23, 2005

This "redundancy" that I speak of was in the video in the first place. By redundancy I mean that within a frame, lots of patterns/colors are repeated and between frames there might be similar information. Consider several frames of a video. At 24 frames/s, little may be different between many frames of the video if there isn't much going on. Instead of video data describing the contents of each frame over and over and repeating itself, it can simply describe what has changed into the next frame. One way to do this is simply describe what pixels have changed in the succeeding frame. This way, we don't need the entire set of pixels for the frame. To achieve higher compression, the compression format can choose to only decide if a pixel has changed if it is out of a certain range. Another way is to use motion compensation where the video data describes what pixels have moved into the next frame and what has stayed the same.

harrkev · Feb 23, 2005

There are two types of redundancy present in a vide signal -- temporal and spatial. Temporal redundancy means that if a pixel is red, the ones near it are quite likely to also be red. Spatial redundancy means that if a pixel is red, the same pixel in the next frame is also likely to be red.

The spatial redundancy is taken care of exactly like it is in JPEG. You break the signal up into its frequency components, and throw away the ones that you are not likely to notice. The "brightness" information is also kept at its full resolution, but "color" and "saturation" values are reduced in resolution because the eye is not as sensitive to these things.

Here is how temporal redundancy is handled. Out of every nine frames, basic MPEG compression fully compresses one of them using JPEG. An additional two of these nine frames are simply stored as error differences since the last full frame. The other six frames can reference either a previous or a future full frame and encode the error. This means that if you have a static shot of a person talking, the background might not move at all, you there would be NO error for those areas. Then, you only need to encode the slight head and lip motion of the actor speaking.

Further MPEG improvements involve motion compensation. You look for a block of pixels that moves around in the frame, and also encode its location and movements. This can greatly reduce the size, but takes a LOT of processing power to look for the movement.

I hope that this helps.

Jeff7 · Feb 23, 2005

Well, why does redundancy exist in the first place? Why can't we just create lossless file formats from the getgo?

Lossless file formats give poorer compression ratios than lossy - makes sense too, since a lossy file format gets rid of some of the original data. ZIP a WAV file, then create an MP3 of the same thing. The ZIP will probably be bigger, but you'll get the original file back, 100%. MP3 will give you an approximation, but smaller file size.

icejunkie · Feb 23, 2005

Originally posted by: Jeff7

Well, why does redundancy exist in the first place? Why can't we just create lossless file formats from the getgo?

Click to expand...

Lossless file formats give poorer compression ratios than lossy - makes sense too, since a lossy file format gets rid of some of the original data. ZIP a WAV file, then create an MP3 of the same thing. The ZIP will probably be bigger, but you'll get the original file back, 100%. MP3 will give you an approximation, but smaller file size.

Thanks

icejunkie · Feb 23, 2005

Originally posted by: harrkev
There are two types of redundancy present in a vide signal -- temporal and spatial. Temporal redundancy means that if a pixel is red, the ones near it are quite likely to also be red. Spatial redundancy means that if a pixel is red, the same pixel in the next frame is also likely to be red.

The spatial redundancy is taken care of exactly like it is in JPEG. You break the signal up into its frequency components, and throw away the ones that you are not likely to notice. The "brightness" information is also kept at its full resolution, but "color" and "saturation" values are reduced in resolution because the eye is not as sensitive to these things.

Here is how temporal redundancy is handled. Out of every nine frames, basic MPEG compression fully compresses one of them using JPEG. An additional two of these nine frames are simply stored as error differences since the last full frame. The other six frames can reference either a previous or a future full frame and encode the error. This means that if you have a static shot of a person talking, the background might not move at all, you there would be NO error for those areas. Then, you only need to encode the slight head and lip motion of the actor speaking.

Further MPEG improvements involve motion compensation. You look for a block of pixels that moves around in the frame, and also encode its location and movements. This can greatly reduce the size, but takes a LOT of processing power to look for the movement.

I hope that this helps.

Thanks for the info...

Video Codecs

jwdohert

Junior Member

Bassyhead

Diamond Member

icejunkie

Platinum Member

ghackmann

Member

Bassyhead

Diamond Member

harrkev

Senior member

Jeff7

Lifer

icejunkie

Platinum Member

icejunkie

Platinum Member

TRENDING THREADS