The Magic of Video Encoding and How it Enhances the Web

With all of the web video streaming services at our disposal that can instantly serve up 1080p video, it is easy to forget how much of a challenge it is to store and encode moving images and sound. When we think about analog motion picture, film comes to mind and it is easy to visualize the difficulties of loading, storing, protecting, and projecting it. Even relatively short films are incredibly bulky and difficult to manage, giving us some rationale for why we needed to digitize video.

However, there is a complication: audio and video information, in their raw form, take up huge amounts of memory. High quality uncompressed digital audio is made up of around 44 thousand samples, each usually 16 or 24 bits per sample, for every second of playback. For even a three minute pop song this adds up quickly. Uncompressed video is a collection of pixel arrays, usually 24 or 32 bits per pixel, and when you multiply the resolutions of the image by the 30-60 frames per second that you can expect from video, you will be faced with a massive amount of storage. Even though storage is very cheap these day and companies like Google deal in petabytes worth of storage at a time, files of that size are impossible to send over a network with any kind of real-time reliability. This is where encoding comes in: it has enabled the rich playback abilities that we enjoy today.

Encoding schemes such as MPEG take advantage of the fact we maintain persistence of vision and are more susceptible to high frequency changes in an image sequence. Ever wonder why you will usually see small blocky regions in an encoded video? This is because the image is given a spatial frequency analysis in blocks that are usually 8×8. Converting something such as an array of pixels or audio samples into a sum of cosines (known mathematically as DCT) leads to a representation that is significantly more compact, and in the case of most video there is little change from pixel to pixel. The most current encoding schemes, on top of this type of analysis, can perform recognition of objects in frames and motion estimation so that only changes in the image that are perceptible to the viewer are part of the output. What this means is orders of magnitude less information that must be transmitted in order to play the video at reasonable quality.

This is just scratching the surface of the abilities of modern video encoding, but we have established that it is an essential part of modern media delivery capability and a core driver to value in many businesses. Contact us to learn more about our efficient, high quality enterprise video encoding services.

You may be interested

How Time To First Byte (TTFB) Impacts Your Site’s Performance
CDN
724 views
CDN
724 views

How Time To First Byte (TTFB) Impacts Your Site’s Performance

Nadia Benslimane - August 6, 2019

How Time To First Byte (TTFB) Impacts Your Site's Performance Let's all agree that your website speed can be subject to many factors. While it's hard to…

Medianova Recognized in Gartner’s 2019 Market Guide for CDN Services
CDN
624 views
CDN
624 views

Medianova Recognized in Gartner’s 2019 Market Guide for CDN Services

Mujde Karakaya - August 1, 2019

Medianova Recognized in Gartner’s 2019 Market Guide for CDN Services We did it again! We are delighted to announce that we have been mentioned in Gartner’s 2019…

Why The Screen Size Matters When Consuming News Content
Tech News
489 views
Tech News
489 views

Why The Screen Size Matters When Consuming News Content

Nadia Benslimane - July 30, 2019

Why The Screen Size Matters When Consuming News Content A new study conducted by researchers at the University of Michigan and Texas A&M University reported that consuming…

Most from this category