Sample Rate & Bit Depth Explained
Sample rate and bit depth are two values that you've likely noticed in your export settings. The higher these values, the higher the audio quality of your exported file. So why not just set them to their maximum values? The main reason you don't do this is because your file size will be massive. The key is to identify the maximum sample rate and bit depth values that will yield noticeable results, and then export your project with those settings.
For the most part, you'll be bouncing your tracks with a sample rate of 44100 samples/second and a bit depth of 24. Most people can't hear an increase in quality above these values, so in order to keep file sizes down, it's recommended you export your tracks with the following settings:
Sample rate refers to the number of samples (tiny slices of audio information) that are present within one second of audio. Let's assume we are recording a one-second clip of guitar. There's no higher audio quality than the sound coming out of the guitar and directly hitting your ear. When recording the guitar for playback at a later time, things change. A microphone picks up the sound waves coming out of the guitar, and it converts those waves into an electrical signal. That electrical signal is sent through a cable to your audio interface. Your audio interface performs A/D (analog to digital) conversion and at this point, sample rate becomes a factor.
Similar to how films are just pictures being played in succession, digital audio is also just tiny samples being played one after another. If you record one minute of video at a high frame rate, the fluid motion of it will be much greater, making it appear more life-like. This same concept applies to audio as well. The higher your sample rate, the more fluid your audio file will be when it plays back, making it appear more life-like. It's important to keep in mind that you can't upgrade the sample rate of an audio file once it's been recorded.
If you record your guitar at 44100 samples/second, that's all the information you have to work with. Even if you bounce the guitar file out of your DAW at 96000 samples/second, the guitar will still playback as a 44100 samples/second file. This is why it's important to know your destination sample rate before recording.
If you know you're going to be performing creative audio manipulation that involves time stretching audio files past their recorded length, you may choose to record at a higher sample rate (if your audio interface allows for it). Time stretching an audio file that was recorded at 192000 samples/second can maintain its fidelity far past the point of an audio file that was recorded at 44100 samples/second.
Vince Tennant demonstrates the benefits of recording at 192000 samples/second in the following video using a Sanken CO-100k microphone that is capable of recording up to 100 kHz; it appears as though he's actually pitched the recording down (resulting in time expansion) and then applied time compression to re-align the sound with the voice actor's lip movement.
Does this meant that you should always record at the highest sample rate possible? Not necessarily. You should record at the sample rate of your final destination format. If you're recording a vocalist, for a song, this is probably going to be 44100 samples/second, whereas the the regular dialog for a film would likely be 48000 samples/second. There are actually downsides the recording at unnecessarily high sample rates, as is demonstrated by Mark Furneaux in great detail.
Bit depth dictates the number of possible amplitude values of one sample. Pulse-code modulation (PCM) is the standard form of digital audio in computers. The amplitude of an analog signal is sampled at regular intervals (higher sample rate = shorter intervals) to create a digital representation of the sound source. The sampled amplitude is quantized to the nearest value within a given range. The number of values within this range are determined by bit depth. To better explain this concept, I'll use an example:
On a television you can set the volume to it's minimum amplitude (0dB) or to it's maximum amplitude (70dB). If the television only allows you to set 5 different volume levels (ranging from 0dB to 70dB), it would be considered to have a low amplitude resolution. What if you wanted to set the volume between it's 3rd and 4th volume setting? You wouldn't be able to because its amplitude resolution is too low. If the television allowed you to choose from 100 different volume levels (ranging from 0dB to 70dB), it would be considered to have a high amplitude resolution. The minimum and maximum amplitudes haven't changed, but you're now capable of selecting more fine-tuned amplitude levels. This concept can help explain how bit depth works.
You can calculate the number of possible amplitude values for a given bit depth by using the equation 2 to the power of n (n being your bit depth). At a bit depth of 16, each sample's value is one of 65536 (2 to the power of 16) possible values. At a bit depth of 24, each sample's value is one of 16777216 (2 to the power of 24) possible values. The higher bit depth doesn't make your song louder, it just provides a higher quality amplitude resolution (similar to how the television with 100 volume levels lets you set more fine-tuned levels than the television with 5 volume levels).
A higher bit depth will produce a higher resolution sample. The more dynamically accurate your samples, the truer they'll be to the analog sound source they're meant to reproduce. Lower bit depths produce a lower signal-to-noise ratio (which you generally don't want), but will also yield smaller file sizes. Applying dither and noise shaping on a mastering level can help reduce the noise that results from exporting at lower bit depths.
32-Bit Floating Point Digital Audio
In your DAW, information that reaches above 0dB is clipping, right? Actually, this isn't the case; information above 0dB isn't lost until it's truncated by your D/A converter, or exported to a fixed point file format (such as 16 or 24-bit fixed point). Applying a 32-bit limiter to the stereo buss in your DAW will prevent clipping from occurring; even if the signal running into the limiter is peaking well above 0dB. Yes, this means all the individual tracks in your DAW can be peaking above 0dB, as long as you apply a 32-bit limiter on your master buss. No, this isn't going to make your music louder.
If you export a 32-bit floating point file, the points above 0dB will be saved to the file you export. However, if you try to playback the file through your audio interface, your 24-bit fixed point D/A converter will cause the information above 0dB to disappear. This doesn’t mean the information is missing from the digital file on your computer, your D/A converter just can’t reconstruct the digital signal above 0dB in the analog realm.
If you send a 32-bit floating point file that’s peaking above 0dB to your mastering engineer, they can reduce the level of the file when they import it into their DAW. Quite honestly, there’s no real benefit in bouncing 32-bit floating point files, and they won’t make your mix sound any better, but it’s fun to know how they work. Proper gain staging is still your friend.
If you made your way through each section of this guide, you should have a working understanding of sample rate and bit depth. If you'd like to add anything to this guide, make sure to leave a comment below so that fellow music producers, and audio engineers get a chance to read what you have to say. Follow Black Ghost Audio on Facebook, Twitter, Instagram, and YouTube to stay up to date on the latest music production tips and tricks. New content is published weekly, and I don’t want you to miss out. Thanks for reading!