Moppel's limited synths - The Math behind DSF Synthesis

The Math behind DSF Synthesis

Introduction

Discrete Summation Formulae (DSF) Synthesis, which has been used in all Moppelsynths so far to achieve maximum signal quality, is a synthesis technique that computes a bandlimited, harmonic signal. The generated timbres are quite similar to those of FM synthesis, yet DSF Synthesis doesn't require oversampling and filtering since it can compute directly only the desired harmonics below the Nyquist frequency. Owing to this, the generated signal is of high quality, free of aliasing artefacts and phase distortion.

However, the quality comes at the price of a bit more CPU-intensity and not-so-trivial theory and implementation. Seeing how people get all orgasmic about VST synths with squashed sound with a reverb on top, you may want to ponder if it's worth the effort.

This article describes how to derive the formulas of DSF Synthesis. At the same time, it explains a DSF variant that has been used in the Tetra and Sonitarium synths and that is more general than the one proposed by Moorer in that it generates an additional series of cosines to obtain a quadrature signal, a stereo signal whose harmonics are pairwise out of phase by 90 degrees. In between, a simple introduction to sine and cosine calculation by complex numbers is provided.

The Objective

To generate harmonic sound, a synthesis technique must generate a series of sinusoids such that the frequency difference between two neighbored sines is everywhere the same. In the digital sound synthesis domain, DSF Synthesis attempts to achieve that in a straightforward manner by fast calculations of formulas of the form

s(t) = ∑_k=0..Nw^k sin(2p(f_c + k f_m) t / sampletime),

where

s(t) is the sample to be output at sample step t
f_c is the fundamental frequency
f_m is the "distance frequency" between the sine waves (hence called harmonics)
w^k is the magnitude of the k-th harmonic (ie. for w<1, the magnitudes of the higher harmonics are getting smaller and smaller)
t is the number of the sample step (with." t / sampletime" being the time in seconds at which sample t is output)
N is the number of harmonics plus 1 (for the fundamental)

To get a more visual impression of what the signal s(t) is like, let's describe its spectrum::
There are N+1 partials, the fundamental is f_c, the distance between two subsequent harmonics is f_m, and the harmonics' magnitudes fall off by factor w. On a decibel scale, the partials fall off linearly.

Spectrum of a DSF-generated waveform with N = 8, f_c = 200, f_m = 50, and w = 0.7.

In order to avoid aliasing , the maximum frequency of the sine waves must be smaller than the Nyquist frequency, which is half the sample rate (eg. 22050Hz when you're sampling at 44100Hz).So N should be chosen small enough such that f_c + N f_m is smaller than the Nyquist frequency. Once that is provided, you could calculate samples by that formula to get a harmonic signal that is free of aliasing artefacts.

Computing all those sines would be quite slow, though, and DSF Synthesis fixes that by computing an equivalent formula that doesn't make it necessary to sum up all the sines. This means it aims for nothing less than generating a bandlimited signal without adding all sines separately (unlike Additive Synthesis), and without having to oversample and filtering out the harmonics above the Nyquist frequency (unlike Frequency- or Phase-Modulation Synthesis have to do).
So, how does DSF Synthesis do that?

Sines and Cosines by Phasor Rotations

In order to understand the math behind DSF synthesis, we first have to examine what sines and cosines are all about. Suppose we have an arc rotating counter clockwise around the center of the Cartesian coordinate system in a plane. If that so-called phasor has length A and rotates by an angle of w per second, then after t seconds the x-coordinate of the tip of the phasor is A*cos(w*t) and the y-coordinate is A*sin(w*t):

Not really spectacular, right. But this already gives us a hint that we can compute sines and cosines quickly once we have such a phasor. And this is where complex numbers come into play.

Sines and Cosines by Complex Number Multiplications

In fact, such phasors can be described conveniently by complex numbers such that a rotation can be performed with a couple of simple arithmetic operations. Complex numbers can be viewed as ordered pairs [a,b] or as points in the plane for which addition, subtraction, multiplication and division are defined so we can calculate with them just like with real numbers. The x-coordinate of a complex number is commonly called its real part and its y-coordinate is called its imaginary part. To denote the real and imaginary parts of a complex number, we define:

RE([a,b]) = a and IM([a,b]) = b

Addition and subtraction of two complex numbers are simply defined like this:

[a1,b1] + [a2,b2] = [a1+a2, b1+b2]
[a1,b1] - [a2,b2] = [a1-a2, b1-b2]

The multiplication is defined like this:

[a1,b1] * [a2,b2] = [a1*a2 - b1*b2, a1*b2 + b1*a2]

The division is somewhat more complicated and defined like this:

[a1,b1] / [a2,b2] = [(a1*a2 + b1*b2) / (a2*a2 + b2*b2), (b1*a2 - a1*b2) / (a2*a2 + b2*b2)]

Notice that the real numbers can be viewed as the subset of the complex numbers in that they can be represented by complex numbers with imaginary component 0:
a1 + a2 = [a1,0] + [a2,0] = [a1+a2,0]
a1 * a2 = [a1,0] * [a2,0] = [a1*a2 - 0*0, a1*0 + 0*a2] = [a1*a2,0]
and
x * [a,b] = [x,0] * [a,b] = [x*a, x*b].

Also notice that every complex number can be viewed as an arc and vice versa. That is, for each complex number [a1,b1] there exists an A and an angle w such that:

[a1,b1] = A * [cos(w),sin(w)],

where A is the length of the arc described by [a1,b1].

One feature of complex numbers that is particularly valuable for sound synthesis is that you can use them easily to implement rotating phasors by complex number multiplications, because it can be shown that

(A*[cos(w),sin(w)]) * (B* [cos(g),sin(g)])
= A*B * [cos(w+g),sin(w+g)]

That means, we can rotate a complex number by the angle w by multiplying it with the complex number [cos(w),sin(w)].

From this observation, we can already derive a simple oscillator that quickly generates both a sine and a cosine wave. For each sample, we just have to perform a complex multiplication of a phasor with a suitable complex number of the form [cos(w),sin(w)]. Every now and then, we have to normalize the phasor, ie. reset it to length 1. Because, otherwise, by the roundoff errors introduced by the arithmetic operations, it will not keep the legnth 1 but degenerate. When using double precision floating point variables, this normalization needs to be done only occasionally, eg. every few thousands of samples.

// f is the frequency
double angle = 2.0*Pi*f / sampleRate;
double phaseAdd_re = cos(angle);
double phaseAdd_im = sin(angle);
double phase_re = 1.0;
double phase_im = 0.0;

void cmul(double a1, double b1, double a2, double b2, double* a3, double* b3)
{
double t_re, t_im;

t_re = a1*a2-b1*b2;
t_im = a1*b2+b1*a2;

*a3 = t_re;
*b3 = t_im;
}

void subprocess(int samples)
{
int i;
for(i=0; i<samples;i++)
{
out1[i] = phase_re; // cos wave
out2[i] = phase_im; // sin wave
cmul(phase_re,phase_im,phaseAdd_re,phaseAdd_im,&phase_re,&phase_im);
}
// normalize the phasor
double t = 1.0/sqrt(phase_re*phase_re + phase_im*phase_im);
phase_re = t * phase_re;
phase_im = t * phase_im;
}

This is just to get a rough idea how complex number arithmetic can be used for synthesis. But complex numbers can do much more, they can help us compute the DSF formula.

Deriving the Classic DSF Formula

We will now examine how to rewrite the target formula

s(t) = ∑_k=0..Nw^k sin(2p(f_c + k f_m) t / sampletime)

by the help of complex numbers such that we get a closed-form expression, ie. the sum will vanish (and by that, there will be no need to sum up all the sines when computing the samples).
In fact, we can calculate s(t) by calculating the following complex formula:

c(t) = ∑_k=0..Na * b^k,

where

a = [cos(2p f_c t / sampletime), sin(2p f_c t / sampletime)]
b = [w*cos(2p f_m t / sampletime), w*sin(2p f_m t / sampletime)]

The samples specified by c(t) are complex numbers. Or, to put it another way, c(t) doesn't specify a series of floating point numbers like s(t) but rather a series of ordered pairs of floating point numbers.
Anyway, s(t) can be easily obtained from c(t) since it is true that

s(t) = IM(c(t)),

which means that we can obtain s(t) by picking the second number from each sample couple generated by c(t). To see that s(t) = IM(c(t)), please observe that

c(t) = ∑_k=0..Na * b^k
=  ∑_k=0..N[cos(2p f_c t / sampletime), sin(2p f_c t / sampletime)] * [w*cos(2p f_m t / sampletime), w*sin(2p f_m t / sampletime)]^k
=  ∑_k=0..N[cos(2p f_c t / sampletime), sin(2p f_c t / sampletime)] *[w^k *cos(k * 2p f_m t / sampletime), w^k *sin(k * 2p f_m t / sampletime)]
=  ∑_k=0..N[w^k cos(2p(f_c + k f_m) t / sampletime), w^k sin(2p(f_c + k f_m) t / sampletime)]

= [∑_k=0..Nw^k cos(2p(f_c + k f_m) t / sampletime), ∑_k=0..Nw^k sin(2p(f_c + k f_m) t / sampletime)]

So we can get s(t) by computing c(t) and picking its imaginary part.
c(t), again, resembles a prominent geometric series and can be simplified on complex numbers just like it's done on real numbers:

c(t) = ∑_k=0..Na * b^k = a * (1 - b^N+1) / (1 -b)

Woosh, the sum is gone! So we can simplify s(t) like this:

s(t)

= IM( c(t) )
= IM( a * (1 - b^N+1) / (1 -b))

which is, since IM(x*y) = RE(x)*IM(y)+IM(x)*RE(y) (this follows from the definition of complex multiplication)

= RE(a)*IM((1 - b^N+1) / (1 - b)) + IM(a)*RE((1 - b^N+1) / (1 - b))

which is, since RE(x/y) = (RE(x)*RE(y)+(IM(x)*IM(y)) / (RE(y)*RE(y)+IM(y)*IM(y)) and IM(x/y) = (IM(x)*RE(y)-RE(x)*IM(y)) / (RE(y)*RE(y)+IM(y)*IM(y))

= ( RE(a)*(IM(1-b^N+1)*RE(1-b)-RE(1-b^N+1)*IM(1-b))
+ IM(a)*(RE(1-b^N+1)*RE(1-b)+IM(1-b^N+1)*IM(1-b)) )
/ (RE(1-b)*RE(1-b)+IM(1-b)*IM(1-b))

= ( cos(u)*((-w^N+1*sin((N+1)*v))*(1-w*cos(v))+(1-w^N+1*cos((N+1)*v))*w*sin(v))
+ sin(u)*((1-w^N+1*cos((N+1)*v))*(1-w*cos(v))+w^N+1*sin((N+1)*v)*w*sin(v)) )
/ ((1-w*cos(v))²+(w*sin(v))²)

where u = 2p f_c t / sampletime and v = 2p f_m t / sampletime. Applying further transformations by trigonometric formulas, this becomes after a while:

= { ( w*sin(v-u) + sin(u) ) + w^N+1* (w* sin(u + N*v) - sin(u + (N+1)*v)) } / ( 1 + w²- 2*w*cos(v) )

That's it! In summary, we have just derived the classic DSF formula proposed by Moorer, which is

Classic DSF Synthesis

s(t) = ∑_k=0..Nw^k sin(u + k*v)

= { ( w*sin(v-u) + sin(u) ) + w^N+1* (w* sin(u + N*v) - sin(u + (N+1)*v)) } / ( 1 + w²- 2*w*cos(v) )

where u = 2p f_c t / sampletime and v = 2p f_m t / sampletime.

Classic DSF Synthesis vs. Complex DSF Synthesis

Actually, we have just derived two methods to perform DSF Synthesis. On the one hand, we can use the classic DSF formula to compute

s(t) = ∑_k=0..Nw^k sin(2p(f_c + k f_m) t / sampletime).

On the other hand, why not compute the complex formula c(t) instead of s(t), using complex arithmetic? After all, computing the complex c(t) samples will give us two samples with each sample step, which can be used for stereo output. As shown before, we have
c(t) = ∑_k=0..N[cos(u), sin(u)] * [w*cos(v), w*sin(v)]^k

= [∑_k=0..N w^k cos(u + k*v), ∑_k=0..Nw^k sin(u + k*v)]

where where u = 2p f_c t / sampletime and v = 2p f_m t / sampletime.
So computing c(t) gives us a complex signal, the imaginary part of which is the sum of sines like it is obtained by classic DSF Synthesis, and the real part of which is a sum of cosines. Sending one signal to the left stereo channel and the other to the right channel, we get a quadrature stereo signal, ie. the harmonics of the signal are pairwise out of phase by 90 degrees (since cos(x) = sin(90° - x)).

Complex DSF Synthesis

c(t) = [∑_k=0..Nw^k cos(u + k*v), ∑_k=0..Nw^k sin(u + k*v)]

= [cos(u), sin(u)] * (1 - [w*cos(v), w*sin(v)]^N+1)/(1 - [w*cos(v), w*sin(v)])

where u = 2p f_c t / sampletime and v = 2p f_m t / sampletime.

This is the synthesis technique used in Tetra and Sonitarium. The computation can be made much faster if the expensive sin and cos functions are not called during each sample step but only when a new key is pressed. At that time, phasors can be initialized by using the standard sin and cos functions, and after that, rotating phasors can be used for fast sine and cosine calculation, as has been described before for the simple osciallator example. This optimization is used in Sonitarium.
By the way, when using the formulas presented so far, the harmonics will fall off to the right side of the fundamental. If you want them to fall off to the left side, just let the complex result of the expression "(1 - [w*cos(v), w*sin(v)]^N+1)/(1 - [w*cos(v), w*sin(v)])" be computed as usual, then flip the sign of its imaginary part. This way you will get another complex signal by a simple sign flip. You can then mix it with the first signal or whatever.

Normalization

On issue not mentioned yet is the normalization of the signal. After all, the signal would get much too loud with all those sines and cosines summed up. A simple estimation that has proven useful for both classic and complex DSF synthesis is that the volume of the computed signal is at most
∑_k=0..Nw^k, which is equal to (1 - w^N+1)/(1-w), where w is the weight factor of the harmonics. Divide each sample by that value, and you're done.

Thanks for reading,
Burkhard (moppelsynths@verklagekasper.de)

Literature

James A. Moorer, "The Synthesis of Complex Audio Spectra by Means of Discrete Summation Formulas", http://www.jamminpower.com/main/articles.html, 1976.

Tim Stilson, Julius Smith, "Alias-Free Digital Synthesis of Classic Analog Waveforms", http://ccrma.stanford.edu/~stilti/papers, 1996.

Back to "Moppelsynths"