When I look at the waveform of a vocal recording, it looks lop-sided, with the waveform going further above the zero-crossing point than it goes below it. It sounds fine, though, so what's happening here?
Bruno D'Cunha
SOS Technical Editor Hugh Robjohns replies: This kind of asymmetrical waveform is quite natural and normal, and is particularly common on recordings of speech and vocals, brass instruments, and sometimes also closely miked strings. A lot of percussive sounds are also strikingly asymmetrical, of course.
In the 'BC' era (Before Computers) we didn't look at waveforms, we just listened to them, and this kind of asymmetry was generally inaudible and didn't bother us, although some people are sensitive to absolute polarity, and can actually tell if an asymmetrical waveform is inverted. Waveform asymmetry has been known about for a very long time, and in the few areas where it can be an issue (such as in broadcast processing), technology has long been in place to deal with it. However, since the prevalence of the DAW and its ubiquitous waveform display, a lot of people have become aware of it and asked the same question.
This asymmetry is due mainly to two things, the first being the relative phase relationships between the fundamental and different harmonic components in a harmonically complex signal. In combining different frequency signals with differing phase relationships, the result is often a distinctly asymmetrical waveform, and that waveform asymmetry often changes and evolves over time, too. That's just what happens when complex related signals are superimposed.
The other element involved in this is that many acoustic sources inherently have a 'positive air pressure bias' because of the way the sound is generated. To talk or sing, we have to breathe out, and to play a trumpet, we have to blow air through the tubing. So, in these examples, there is inherently more energy available for the compression side of the sound wave than there is for the rarefaction side, and that can also contribute to an asymmetrical waveform.
Confusingly — and erroneously — this natural waveform asymmetry is often attributed to a 'DC offset', but that's not the case at all. A DC offset is a specific fault condition where the varying AC audio signal voltage is offset by a constant DC voltage, and the 'tell-tale' is that, although the waveform might look asymmetrical, a decaying signal waveform settles away (offset) from the centre zero-line.
DC offsets are virtually unheard-of these days, but can occur in hardware analogue electronics under fault conditions. In the digital world, very early multi-bit A-D converters sometimes suffered a problem in the quantiser that essentially resulted in encoding a fixed-level shift or offset onto the audio sample values — the digital equivalent of an analogue DC offset.
However, a DC offset can be very easily corrected by passing the audio through a high-pass filter tuned to a low frequency (typically 10Hz or lower). It is important to correct DC offsets when they do occur, because editing between an audio clip with a DC offset and one without results in a loud thump or plop at the edit point, which is not good!
In contrast, natural waveform asymmetry cannot be 'corrected' with a high-pass filter, and a rather more complicated solution is required called a 'phase rotator'. Generally, there is no need to 'correct' a naturally asymmetrical signal, but occasionally the asymmetry can restrict how much the signal can be amplified because the stronger half of the waveform will reach the clip level before the weaker side. By using a phase rotator process to alter the harmonic phase relationships, a more balanced symmetry can be established, allowing slightly more gain to be applied before both sides reach the clipping level at the same amplitude. Asymmetrical waveforms can also sometimes confuse the side-chain level-detection circuitry (or algorithms) of some compressors, resulting in less effective compression than might be expected.