Definition 6.1. The Fourier transform of a function \(f:\mR \to \mR ^{n_1\times n_2}\) is defined by
\[ \mF (f)=\omega \mapsto \int _\mR \e ^{-i\omega t}f(t)\,dt. \]
We will also use the notation \(\hat {f}\) for \(\mF (f)\).
Remark 6.2. The function \(f\) can be recovered from its Fourier transform \(\hat {f}\) through the inverse Fourier transform,
\[ \mF ^{-1}g=t\mapsto \frac {1}{2\pi }\int _\mR \e ^{i\omega t}g(\omega )\,d\omega , \]
i.e. we have
\(\seteqnumber{0}{6.}{0}\)\begin{equation} \label {eq:Finverse} f(t)=\frac {1}{2\pi }\int _\mR \int _\mR \e ^{i\omega t} \e ^{-i\omega \theta }f(\theta )\,d\theta \,d\omega . \end{equation}
Remark 6.3. The above version of the Fourier transform is the non-unitary angular frequency one. Other normalization conventions are in use; all such Fourier transforms have essentially the same properties, but various multiplicative constants are different.
Remark 6.4. If time \(t\) is measured in seconds, then \(\omega \) has the dimension radians per second. If we define
\[ \xi =\frac {\omega }{2\pi }, \]
then \(\xi \) is the frequency in Hertz which is the dimension more commonly used in applications. Using \(\xi \) rather than \(\omega \) in the Fourier transform gives
\[ \mF (f)=\xi \mapsto \int _\mR \e ^{-i2\pi \xi t}f(t)\,dt, \]
which is another commonly used form of the Fourier transform (the ordinary frequency unitary one).
Remark 6.5. If \(f(t)=0\) for \(t<0\) (and \(\int _\mR |f(t)|\,dt<\infty \)), then both the Laplace transform and the Fourier transform of \(f\) are well-defined and we have \(\mL (f)(i\omega )=\mF (f)(\omega )\). This relation is our main reason for using the non-unitary angular frequency version of the Fourier transform (since then the relation with the Laplace transform is simplest).
Definition 6.6. The frequency response of (3.1) is the \(\mC ^{p\times m}\) valued function
\[ F(\omega )=C(i\omega I-A)^{-1}B+D, \]
which we note equals \(G(i\omega )\) where \(G\) is the transfer function.
In terms of the Fourier transform we have
\[ \hat {x}(\omega )=(i\omega I-A)^{-1}B\hat {u}(\omega ),\qquad \hat {y}(\omega )=F(\omega )\hat {u}(\omega ). \]
Remark 6.7. Since \(A\), \(B\), \(C\) and \(D\) are real-valued we have
\[ F(\omega )^*=\left [C(i\omega I-A)^{-1}B+D\right ]^* =\left [C(-i\omega I-A)^{-1}B+D\right ]^T =F(-\omega )^T. \]
In particular, if \(m=p=1\), we have \(\overline {F(\omega )}=F(-\omega )\). This allows us to restrict to \(\omega \geq 0\) since no information is lost by ignoring \(\omega <0\).
Remark 6.8. The interpretation from Remark 5.5 in terms of complex exponentials is especially attractive in the case where \(s=i\omega \) and we therefore have the frequency response instead of the transfer function. It becomes even more intuitive when we replace the complex exponential by a sinusoid. Using that
\[ \sin (\omega t)=\frac {\e ^{i\omega t}-\e ^{-i\omega t}}{2i}, \]
and linearity we obtain that the output for input \(u(t)=\sin (\omega t)\) (when \(m=p=1\) and \(x^0\) has the appropriate value) is
\[ \frac {F(\omega )\e ^{i\omega t}-F(-\omega )\e ^{-i\omega t}}{2i}. \]
Using Remark 6.7, this equals
\[ \frac {F(\omega )\e ^{i\omega t}-\overline {F(\omega )}\e ^{-i\omega t}}{2i}, \]
which equals
\[ \imag \left (F(\omega )\e ^{i\omega t}\right ). \]
We write the complex number \(F(\omega )\) in polar coordinates as
\[ F(\omega )=|F(\omega )|\,\e ^{i\arg (F(\omega ))}, \]
so that the output equals
\[ \imag \left (|F(\omega )|\,\e ^{i\arg (F(\omega ))}\e ^{i\omega t}\right ) =|F(\omega )|\imag \left (\e ^{i\left [\omega t+\arg (F(\omega ))\right ]}\right ) =|F(\omega )|\sin \left (\omega t+\arg (F(\omega ))\right ). \]
Hence when \(u(t)=\sin (\omega t)\), the output is (when \(m=p=1\) and \(x^0\) has the appropriate value)
\[ y(t)=|F(\omega )|\,\sin (\omega t+\arg (F(\omega ))), \]
so the input is multiplied by \(|F(\omega )|\) and shifted by \(\arg (F(\omega ))\). Since the modulus and argument of a complex number uniquely determine the complex number, this relation uniquely specifies the frequency response.
Remark 6.9. Since the frequency response is a complex-valued function of a real variable, it is not straightforward to plot (unlike for example the step response, which is a real-valued function of a real variable). There are essentially two common ways to plot it. The first is to treat \(\omega \) as a parameter and plot the curve \(\omega \mapsto F(\omega )\) in the complex plane; this is called the Nyquist diagram. The second is to plot \(|F(\omega )|\) and \(\arg (F(\omega ))\) separately as functions of \(\omega \) (since these are real-valued functions of a real variable, this is straightforward); it is common to use a logarithmic scale for \(\omega \) in both plots and in the magnitude plot to use a logarithmic scale on both axes; the combination of these plots are called the Bode diagram and separately they are called the Bode magnitude diagram and the Bode phase diagram. We will mainly be interested in the Bode magnitude diagram.
The frequency response of the first order system
\[ T\dot {y}(t)+y(t)=Ku(t), \]
is
\[ F(\omega )=\frac {K}{Ti\omega +1}. \]
We plot this frequency response in Figures 6.1a (the Nyquist diagram) and 6.1b (the Bode diagram); the shape does not depend on the parameters \(K\) and \(T\) (through scaling as for the step response). In Figures 6.2a and 6.2b we illustrate how the Bode magnitude diagram depend on the steady state gain \(K\) and the time constant \(T\).
The frequency response of the second order system
\[ T^2\ddot {y}(t)+2\zeta T\dot {y}(t)+y(t)=Ku(t), \]
is
\[ F(\omega )=\frac {K}{-T^2\omega ^2+2\zeta Ti\omega +1}=\frac {K\omega _0^2}{-\omega ^2+2\zeta \omega _0i\omega +\omega _0^2}. \]
The Nyquist diagrams for relevant values of \(\zeta \) are given in Figure 6.3a. The bode diagrams for relevant values of \(\zeta \) are given in Figure 6.3b. When \(\zeta \in [0,\frac {1}{\sqrt {2}})\phantom {]}\), \(|F(\omega )|\) has a maximum at \(\omega _{\rm max}:=\omega _0\sqrt {1-2\zeta ^2}\). Instead, when \(\zeta \geq \frac {1}{\sqrt {2}}\), \(|F(\omega )|\) is monotonically decreasing (for \(\omega >0\)). So whereas \(\zeta =1\) is the value for which a qualitative difference in the step response occurs, it is for \(\zeta =\frac {1}{\sqrt {2}}\) that a qualitative difference in the frequency response occurs.
The frequency response of the second order system
\[ T^2\ddot {q}(t)+2\zeta T\dot {q}(t)+q(t)=Ku(t),\qquad y(t)=\dot {q}(t), \]
is
\[ F(\omega )=\frac {Ki\omega }{-T^2\omega ^2+2\zeta Ti\omega +1}=\frac {K\omega _0^2i\omega }{-\omega ^2+2\zeta \omega _0i\omega +\omega _0^2}. \]
The Nyquist diagram is in Figure 6.4a and the Bode diagram in Figure 6.4b.
A microphone operates as follows. Sound is a pressure wave in air. This causes a diaphragm (which is essentially a mass-spring-damper system) to vibrate. This vibration is then converted into an electrical signal. This is done either through an inductor based conversion (this is done in what are called dynamic microphones) or through a capacitor based conversion (this is done in what are called condenser microphones). Behind the diaphragm there is a small cavity (called the air gap) which is connected by tubes to a larger cavity (called the back cavity).
In Figure 6.5 we give the Bode magnitude diagrams of several microphones as provided by their manufacturers. The hearing range of humans is 20 Hz to 20,000 Hz (which is why this is the range on the horizontal axis). Typically for a microphone we want the Bode magnitude diagram to be approximately constant over this frequency range: this means that all frequencies are equally represented in the recording made through the microphone as they are in the live performance.
To obtain a model of a microphone, it seems easiest to think in terms of energy. There are three sources of potential energy in a capacitor-based microphone:
• The potential energy in the diaphragm (i.e. the spring) is \(\frac {1}{2k}F^2\) where \(k\) is the spring constant and \(F\) is the force across the spring.
• The potential energy due to pressure in the cavity is \(\frac {C_c}{2}p^2\) where \(C_c\) is a constant (related mainly to the volume of the cavity) and \(p\) is the pressure in the cavity.
• The potential energy in the capacitor is \(\frac {C}{2}V^2\) where \(C\) is the capacitance and \(V\) is the voltage across the capacitor.
Additionally there are two sources of kinetic energy:
• The kinetic energy in the diaphragam (i.e. the mass) is \(\frac {m}{2}v^2\) where \(m\) is the mass and \(v\) is the velocity.
• The kinetic energy of the particles in the cavity is \(\frac {L_c}{2}q^2\) where \(L_c\) is a constant and \(q\) is the volumetric flow rate.
We have the following relation between the various variables
\[ F=Ap,\quad Av=q,\qquad F=\alpha V, \]
where \(A\) is the area of the diaphragm and \(\alpha \) is a constant. It follows that the total energy is \(1/2\) times
\[ \left (\frac {1}{k}+\frac {C_c}{A^2}+\frac {C}{\alpha ^2}\right )F^2 +(m+A^2L_c)v^2. \]
From the above it seems reasonable (and this can indeed be justified) that the overall system is of mass-spring-damper type where the “mass” is \(m+A^2L_c\) (i.e. there is a contribution of the mechanical mass and of the mass of the particles) and the “spring constant” is \(k+\frac {A^2}{C_c}+\frac {\alpha ^2}{C}\) (i.e. there are contributions from the mechanical spring, from the pressure in the cavity and from the capacitance of the capacitor). The “damping constant” equals \(d+A^2R_c\) where \(d\) is the damping constant for the diaphragm and \(R_c\) is due to energy loss by the air particles in the cavity. The input to the system is pressure caused by sound, which equivalently is a force and the output is the voltage across the capacitor which is proportional to the force across the spring which is proportional to the displacement of the diaphragm. Therefore we have a mass-spring-damper system with force input and displacement output. This we saw before is a second order (low pass) system and the transfer function is a constant times
\[ \frac {1}{(m+A^2L_c)s^2+(d+A^2R_c)s+\left (k+\frac {A^2}{C_c}+\frac {\alpha ^2}{C}\right )}. \]
As we see in Figure 6.3b, the frequency response of such a system is indeed (relatively) flat for low frequencies. To obtain a good microphone, this flat part should extend to 20,000 Hz.
For an inductor-based microphone the energy stored in the inductor instead is \(LI^2\) where \(L\) is the inductance and \(I\) is the current through the inductor. We have \(\alpha v=I\) so that the total energy now is
\[ \left (\frac {1}{k}+\frac {C_c}{A^2}\right )F^2 +\left (m+A^2L_c+\frac {L}{\alpha ^2}\right )v^2. \]
The output is the current through the inductor (or actually: the voltage across a resistor parallel to the inductor which by Ohm’s law is a constant times the current) which is proportional to the velocity. So now we effectively have a mass-spring-damper system with force input and velocity output. This we saw before is a second order (band pass) system and the transfer function is a constant times
\[ \frac {s}{\left (m+A^2L_c+\frac {L}{\alpha ^2}\right )s^2+(d+A^2R_c)s+\left (k+\frac {A^2}{C_c}\right )}. \]
As we see in Figure 6.4b, the frequency response of such a system is indeed (relatively) flat for an intermediate range of frequencies. To obtain a good microphone, this flat part should extend from 20 Hz to 20,000 Hz.
In Section 6.2 we saw that we want to study (second order) systems which have a frequency response which is “flat” for certain frequencies. The general name for such systems is “filters”.
The second order (low-pass) system with \(\zeta =\frac {1}{\sqrt {2}}\) and therefore transfer function
\[ G(s)=\frac {K\omega _0^2}{s^2+\sqrt {2}\omega _0s+\omega _0^2}, \]
is such that the magnitude of the frequency response is \(\frac {1}{\sqrt {2}}\) times the maximum magnitude (reached at frequency zero) at \(\omega _0\). In this context \(\omega _0\) is then called the cut-off frequency. The idea is that below that frequency the magnitude of the frequency response is relatively flat and that above it this is no longer the case.
In the microphone application (for the capacitor-based case since that results in a second order (low-pass) system), this cut-off frequency would be chosen around the upper limit of the hearing range, i.e. 20,000 Hz.
When we want to pick out frequencies in a given range \([\omega _L,\omega _H]\), then we can use a band-pass filter
\[ G(s)=\frac {(\omega _H-\omega _L)s}{s^2+(\omega _H-\omega _L)s+\omega _L\omega _H}. \]
The quantity \(\omega _H-\omega _L\) is called the bandwidth. The frequencies \(\omega _L\) and \(\omega _H\) are such that the magnitude of the frequency response is \(\frac {1}{\sqrt {2}}\) times the maximum magnitude at these frequencies. We note that this transfer function is of the form
\[ \frac {2\zeta \omega _0s}{s^2+2\zeta \omega _0s+\omega _0^2}, \]
where
\[ \omega _0=\sqrt {\omega _L\omega _H},\qquad \zeta =\frac {\omega _H-\omega _L}{2\sqrt {\omega _L\omega _H}}. \]
See Figure 6.6 for the Bode diagrams for various bandwidths.
In the microphone application (for the inductor-based case since that results in a second order (band-pass) system), we would choose \(\omega _L\) around 20 Hz and \(\omega _H\) around 20,000 Hz so that the pass band \([\omega _L,\omega _H]\) corresponds to the human hearing range. This results in \(\omega _0\approx 632\) Hz and \(\zeta \approx 15.8\) (i.e. very overdamped).
Remark 6.10. The formula \(\omega _0=\sqrt {\omega _L\omega _H}\) says that \(\omega _0\) equals the geometric mean of \(\omega _L\) and \(\omega _H\). Taking logarithms we have
\[ \log (\omega _0)=\frac {\log (\omega _L)+\log (\omega _H)}{2}, \]
so that on a logarithmic scale we have the (usual) arithmetic mean.
Different colors of light are sinusoids of different frequencies (see Figure 6.7). Each object acts as a light filter: it filters out all frequencies in the incident light except for the frequency corresponding to the color we perceive the object as having. Normally the input is white light which contains all frequencies. Therefore illumination with white light allows us to see objects of all different colors. Lasers produce light of a particular color (i.e. of a particular frequency). If we therefore use a laser to illuminate an object (i.e. our input is \(\sin (\omega t)\) for a specific frequency \(\omega \) determined by the laser color) then we can only see objects of the corresponding color (objects of a different color appear black).
In Figure 6.8 we plot the frequency response from the disturbance \(v_e\) to each of the state variables \(v_1\), \(v_2\) and \(T\). Note that we already calculated the corresponding transfer functions in Section 5.2. We also indicated the approximate location of the peak of each of the frequency responses, namely
\(\seteqnumber{0}{6.}{1}\)\begin{equation} \label {eq:tapedrivefreq} \sqrt {\frac {k}{M_1}+\frac {k}{M_2}}, \end{equation}
which is the natural frequency of an undamped mass-spring system with spring constant \(k\) and mass
\[ \frac {1}{\frac {1}{M_1}+\frac {1}{M_2}}, \]
(you might recognize this way of “adding” the masses as the same as adding parallel resistances).
From the Bode magnitude diagram we can see at which frequencies a periodic disturbance of the form \(v_e(t)=\sin (\omega t)\) is most detrimental, namely for frequencies around the peak, so close to the frequency (6.2).
We consider the fixed structure suspension system. In Figure 6.9 we give the Bode magnitude diagram from the road velocity to each of the three performance outputs. Here we have chosen values for the various physical parameters as in the introduction. On the horizontal axis the frequency is given in terms of Hertz rather than radians per second (see Remark 6.4). An input of \(\sin (\omega t)\) corresponds to a sinusoidal road velocity, and therefore a sinusoidal road profile, namely \(\frac {-1}{\omega }\cos (\omega t)\) as a function of time \(t\). Noting that \(t=x/V\) where \(V\) is the horizontal velocity of the car (i.e. the speed we are travelling at), the corresponding road profile as a function of position \(x\) is \(\frac {-1}{\omega }\cos \left (\frac {\omega }{V}x\right )\). This can be thought of as representing an infinite sequence of equally spaced speedbumps. If we write \(\omega _0=\frac {\omega }{V}\), then the road profile is \(\frac {-1}{\omega _0V}\cos (\omega _0 x)\) and therefore \(\omega _0\) is related to the distancing of the speed bumps and \(\omega =\omega _0 V\) depends on both this spacing and on the speed of the car. So to find the relevant value of \(\omega \) in Figure 6.9 we have to consider both the distancing of the speed bumps and the speed of the car.
In Figure 6.9 we see that the Bode diagram has two “peaks” which correspond roughly to complex conjugate pairs of eigenvalues of the matrix \(A\) in the input-state-output system (or equivalently: poles of the transfer function). There is a clear peak around 2 Hz which in the automotive industry is called vehicle heave. This is because it is essentially due to the sprung mass and spring: we note that \(\frac {1}{2\pi }\sqrt {\frac {k_s}{m_s}}\approx 2.5\) Hz, which is the natural frequency of the subsystem consisting of only the sprung mass and spring. There is also a notable peak around 10 Hz which in the automotive industry is called wheel hop (or tyre hop). We note that \(\frac {1}{2\pi }\sqrt {\frac {k_{us}+k_s}{m_{us}}}\approx 12\) Hz, which is the natural frequency of the subsystem consisting of only the unsprung mass and both springs.
Assume a given spacing of speedbumps and consider what happens if we increase the speed \(V\) of the car (so that \(\omega =\omega _0 V\) increases). As we increase the speed up to where vehicle heave occurs, the amplification factor \(|F(\omega )|\) increases. This will make driving over these speedbumps less comfortable. Once we pass the point where vehicle heave occurs, the amplification factor \(|F(\omega )|\) decreases again. However, for handling, it starts to increase again around the frequency where wheel hop occurs (and after that it decreases again).
From the transfer functions computed in Section 5.3, we see that all three frequency responses are zero in zero, which we indeed see in Figure 6.9. The handling transfer function behaves like \(s^{-1}\) near infinity, the suspension stroke transfer function like \(s^{-3}\) and the comfort transfer function like \(s^{-2}\). In Figure 6.9 (since we have a loglog scale), we can see these powers \(-1\), \(-3\) and \(-2\) respectively in the slope for large frequencies.
Consider the (scaled) frequency response
\[ F(\omega )=\frac {1}{-\omega ^2+2\zeta i\omega +1}, \]
of a second order (band-pass) system. Show that when \(\zeta \in [0,\frac {1}{\sqrt {2}})\phantom {]}\), \(|F(\omega )|\) has a maximum at \(\omega _{\rm max}:=\sqrt {1-2\zeta ^2}\) and that when \(\zeta \geq \frac {1}{\sqrt {2}}\), \(|F(\omega )|\) is monotonically decreasing for \(\omega >0\).
Solution. We have
\[ |F(\omega )|^2=\frac {1}{(1-\omega ^2)^2+4\zeta ^2\omega ^2}. \]
We consider the denominator \(g(\omega ):=(1-\omega ^2)^2+4\zeta ^2\omega ^2\). We have
\[ g'(\omega )=-4\omega (1-\omega ^2)+8\zeta ^2\omega . \]
Setting \(g'\) equal to zero gives \(\omega =0\) or \(1-\omega ^2=2\zeta ^2\), i.e. \(\omega =\pm \sqrt {1-2\zeta ^2}\). For \(\pm \sqrt {1-2\zeta ^2}\) to be real and nonzero we require \(\zeta \in [0,\frac {1}{\sqrt {2}}\!\phantom {]})\). We see that at \(\omega _{\max }:=\sqrt {1-2\zeta ^2}\) the function \(g'\) changes sign from negative to positive, so \(g\) has a minimum at \(\omega _{\max }\). It follows that \(|F|^2\) and therefore \(|F|\) has a maximum at \(\omega _{\max }\). If \(\zeta \geq \frac {1}{\sqrt {2}}\), then \(g'>0\) on \((0,\infty )\) so that \(g\) is strictly increasing and therefore \(|F|^2\) and \(|F|\) are strictly decreasing. □