Exploring the Differences in Sampling Resolutions: 16-bit, 24-bit and 32-bit

Aug 02, 2023

Digital audio systems employ different sampling resolutions to capture and reproduce audio, with the most commonly used being 16-bit and 24-bit resolutions. The 16-bit resolution, which is the standard for audio CDs and most MP3s, is generally sufficient for the distribution of mixed-down music as it has an adequate dynamic range to accommodate the needs of most musical compositions. The dynamic range, as previously discussed, refers to the ratio between the softest and loudest sound that a system can reproduce without distortion.

On the other hand, the mixing process often benefits from the use of a 24-bit resolution. This resolution has a larger dynamic range than the 16-bit system, meaning it can reproduce a broader range of sound intensities without distortion. While the additional dynamic range may not significantly affect the quality of the finished mixdowns, it can make a crucial difference during the mixing process. The reason for this is that the increased dynamic range provides more "slop room" or buffer for the rounding errors that inevitably occur during digital processing. This buffer allows these errors to happen without significantly impacting the audibility of the audio.

There's another level of resolution known as 32-bit, commonly referred to in the context of "floating point" representation of digital audio. This stands in contrast to the "fixed point" representation used in 16-bit and 24-bit systems. Although the technical details between the two forms of representation are complex, the key point is that 32-bit floating point audio essentially offers the same dynamic range as 24-bit fixed point audio.

The advantage of 32-bit floating point audio lies in its ability to handle audio above the 0dBFS threshold without clipping. In effect, the system can shift bits from the bottom (quieter) end of the scale to the top (louder) end, thereby increasing the maximum representable amplitude. However, this process elevates the level of quantization noise. In practical terms, the overall dynamic range remains unchanged because the increase in noise level balances out with the rise in maximum representable amplitude.

Nevertheless, it's worth noting that it's not generally advisable to exploit this feature of the floating point system by exceeding the 0dBFS ceiling. Despite the theoretical ability of floating point systems to handle louder sounds without clipping, many plugins in Digital Audio Workstations (DAWs) convert their input audio to fixed point format internally. In this conversion process, any audio that exceeds the 0dBFS threshold will clip. Therefore, even when working within a 32-bit floating point system, it is recommended to keep all audio levels below the 0dBFS threshold to prevent potential clipping and the consequent distortion of the audio.