Additive synthesis

{{Listen

| filename = Additive_synthesis_bell.ogg

| title = Additive synthesis example

| description = A bell-like sound generated by additive synthesis of 21 inharmonic partials

| pos = right

}}

Additive synthesis is a sound synthesis technique that creates timbre by adding sine waves together.<ref name="JOS_Additive">

{{cite web

| author = Julius O. Smith III

| title = Additive Synthesis (Early Sinusoidal Modeling)

| url = https://ccrma.stanford.edu/~jos/sasp/Additive_Synthesis_Early_Sinusoidal.html

| quote = The term "additive synthesis" refers to sound being formed by adding together many sinusoidal components

| access-date = 14 January 2012

}}</ref><ref>

{{cite journal

| author = Gordon Reid

| title = Synth Secrets, Part 14: An Introduction To Additive Synthesis

| url = http://www.soundonsound.com/sos/jun00/articles/synthsec.htm

| journal = Sound on Sound

| issue = January 2000

| access-date = 14 January 2012

}}</ref>

The timbre of musical instruments can be considered in the light of Fourier theory to consist of multiple harmonic or inharmonic partials or overtones. Each partial is a sine wave of different frequency and amplitude that swells and decays over time due to modulation from an ADSR envelope or low-frequency oscillator.

Additive synthesis most directly generates sound by adding the output of multiple sine wave generators. Alternative implementations may use pre-computed wavetables or the inverse fast Fourier transform.

Explanation

The sounds that are heard in everyday life are not characterized by a single frequency. Instead, they consist of a sum of pure sine frequencies, each one at a different amplitude. When humans hear these frequencies simultaneously, we can recognize the sound. This is true for both "non-musical" sounds (e.g. water splashing, leaves rustling, etc.) and for "musical sounds" (e.g. a piano note, a bird's tweet, etc.). This set of parameters (frequencies, their relative amplitudes, and how the relative amplitudes change over time) are encapsulated by the timbre of the sound. Fourier analysis is the technique that is used to determine these exact timbre parameters from an overall sound signal; conversely, the resulting set of frequencies and amplitudes is called the Fourier series of the original sound signal.

In the case of a musical note, the lowest frequency of its timbre is designated as the sound's fundamental frequency. For simplicity, we often say that the note is playing at that fundamental frequency (e.g. "middle C is 261.6 Hz"), even though the sound of that note consists of many other frequencies as well. The set of the remaining frequencies is called the overtones (or the harmonics, if their frequencies are integer multiples of the fundamental frequency) of the sound. In other words, the fundamental frequency alone is responsible for the pitch of the note, while the overtones define the timbre of the sound. The overtones of a piano playing middle C will be quite different from the overtones of a violin playing the same note; that's what allows us to differentiate the sounds of the two instruments. There are even subtle differences in timbre between different versions of the same instrument (for example, an upright piano vs. a grand piano).

Additive synthesis aims to exploit this property of sound in order to construct timbre from the ground up. By adding together pure frequencies (sine waves) of varying frequencies and amplitudes, we can precisely define the timbre of the sound that we want to create.

Definitions

250px|thumb|right|Schematic diagram of additive synthesis. The inputs to the oscillators are frequencies and amplitudes .

Harmonic additive synthesis is closely related to the concept of a Fourier series which is a way of expressing a periodic function as the sum of sinusoidal functions with frequencies equal to integer multiples of a common fundamental frequency. These sinusoids are called harmonics, overtones, or generally, partials. In general, a Fourier series contains an infinite number of sinusoidal components, with no upper limit to the frequency of the sinusoidal functions and includes a DC component (one with frequency of 0 Hz). Frequencies outside of the human audible range can be omitted in additive synthesis. As a result, only a finite number of sinusoidal terms with frequencies that lie within the audible range are modeled in additive synthesis.

A waveform or function is said to be periodic if

for all and for some period .

The Fourier series of a periodic function is mathematically expressed as:

<math> \begin{align}

y(t) &= \frac{a_0}{2} + \sum_{k=1}^{\infty} \left[ a_k \cos(2 \pi k f_0 t ) - b_k \sin(2 \pi k f_0 t ) \right] \\

&= \frac{a_0}{2} + \sum_{k=1}^{\infty} r_k \cos\left(2 \pi k f_0 t + \phi_k \right) \\

\end{align} </math>

where

is the fundamental frequency of the waveform and is equal to the reciprocal of the period,
is the amplitude of the th harmonic,
is the phase offset of the th harmonic. atan2 is the four-quadrant arctangent function,

Being inaudible, the DC component, , and all components with frequencies higher than some finite limit, , are omitted in the following expressions of additive synthesis.

Harmonic form

The simplest harmonic additive synthesis can be mathematically expressed as:

where is the synthesis output, , , and are the amplitude, frequency, and the phase offset, respectively, of the th harmonic partial of a total of harmonic partials, and is the fundamental frequency of the waveform and the frequency of the musical note.

Time-dependent amplitudes

{|class=wikitable align=right width=420px

| 280px

| Example of harmonic additive synthesis in which each harmonic has a time-dependent amplitude. The fundamental frequency is 440 Hz.

noicon|150px

Problems listening to this file? See Media help

More generally, the amplitude of each harmonic can be prescribed as a function of time, , in which case the synthesis output is

Each envelope should vary slowly relative to the frequency spacing between adjacent sinusoids. The bandwidth of should be significantly less than .

Inharmonic form

Additive synthesis can also produce inharmonic sounds (which are aperiodic waveforms) in which the individual overtones need not have frequencies that are integer multiples of some common fundamental frequency.<ref name=smith05>

{{Cite book

| last1 = Smith III | first1 = Julius O.

| last2 = Serra | first2 = Xavier

| year = 2005

| chapter = Additive Synthesis

| chapter-url = https://ccrma.stanford.edu/~jos/parshl/Additive_Synthesis.html

| title = PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation

| url = https://ccrma.stanford.edu/~jos/parshl/

| series = Proceedings of the International Computer Music Conference (ICMC-87, Tokyo), Computer Music Association, 1987

| publisher = CCRMA, Department of Music, Stanford University

| access-date = 11 January 2015

}} (online reprint)</ref><ref name=smith11>

{{Cite book

| last = Smith III | first = Julius O.

| year = 2011

| chapter = Additive Synthesis (Early Sinusoidal Modeling)

| chapter-url = https://ccrma.stanford.edu/~jos/sasp/Additive_Synthesis_Early_Sinusoidal.html

| title = Spectral Audio Signal Processing

| url = https://ccrma.stanford.edu/~jos/sasp/

| publisher = CCRMA, Department of Music, Stanford University

| isbn = 978-0-9745607-3-1

| access-date = 9 January 2012

}}</ref> While many conventional musical instruments have harmonic partials (e.g. an oboe), some have inharmonic partials (e.g. bells). Inharmonic additive synthesis can be described as

where is the constant frequency of th partial.

{|class=wikitable align=right width=420px

| 280px

| Example of inharmonic additive synthesis in which both the amplitude and frequency of each partial are time-dependent.

noicon|150px

Problems listening to this file? See Media help

Time-dependent frequencies

In the general case, the instantaneous frequency of a sinusoid is the derivative (with respect to time) of the argument of the sine or cosine function. If this frequency is represented in hertz, rather than in angular frequency form, then this derivative is divided by . This is the case whether the partial is harmonic or inharmonic and whether its frequency is constant or time-varying.

In the most general form, the frequency of each non-harmonic partial is a non-negative function of time, , yielding

Broader definitions

Additive synthesis more broadly may mean sound synthesis techniques that sum simple elements to create more complex timbres, even when the elements are not sine waves.<ref>

{{cite book

| last = Roads | first = Curtis

| author-link = Curtis Roads

| year = 1995

| title = The Computer Music Tutorial

| url=https://archive.org/details/computermusictut00road

| url-access=limited

| publisher = MIT Press

| isbn = 978-0-262-68082-0

| page = 134

}}</ref><ref name="MooreFoundationsCM">

{{cite book

| last = Moore | first = F. Richard

| year = 1995

| title = Foundations of Computer Music

| publisher = Prentice Hall

| isbn = 978-0-262-68082-0

| page = 16

}}

</ref> For example, F. Richard Moore listed additive synthesis as one of the "four basic categories" of sound synthesis alongside subtractive synthesis, nonlinear synthesis, and physical modeling. In this broad sense, pipe organs, which also have pipes producing non-sinusoidal waveforms, can be considered as a variant form of additive synthesizers. Summation of principal components and Walsh functions have also been classified as additive synthesis.<ref>

{{cite book

| last = Roads | first = Curtis

| author-link = Curtis Roads

| year = 1995

| title = The Computer Music Tutorial

| url=https://archive.org/details/computermusictut00road

| url-access=limited

| publisher = MIT Press

| isbn = 978-0-262-68082-0

| pages = 150–153

}}</ref>

Implementation methods

Modern-day implementations of additive synthesis are mainly digital. (See section Discrete-time equations for the underlying discrete-time theory)

Oscillator bank synthesis

Additive synthesis can be implemented using a bank of sinusoidal oscillators, one for each partial.<!--

{{cite web

| title = Additive Synthesis (Early Sinusoidal Modeling)

| author = Julius O. Smith III

| url = https://ccrma.stanford.edu/~jos/sasp/Additive_Synthesis_Early_Sinusoidal.html

| access-date = 14 January 2012

}}</ref> -->

Wavetable synthesis

In the case of harmonic, quasi-periodic musical tones, wavetable synthesis can be as general as time-varying additive synthesis, but requires less computation during synthesis.<ref name="Wavetable Synthesis 101">

{{cite web

|author = Robert Bristow-Johnson

|date = November 1996

|title = Wavetable Synthesis 101, A Fundamental Perspective

|url = http://www.musicdsp.org/files/Wavetable-101.pdf

|access-date = 21 May 2005

|archive-url = https://web.archive.org/web/20130615202748/http://musicdsp.org/files/Wavetable-101.pdf

|archive-date = 15 June 2013

|df = dmy-all

}}

</ref><ref name="Wavetable Matching Synthesis of Dynamic Instruments with Genetic Algorithms">

{{cite journal

| author = Andrew Horner

| date = November 1995

| title = Wavetable Matching Synthesis of Dynamic Instruments with Genetic Algorithms

| journal = Journal of the Audio Engineering Society

| volume = 43 | issue = 11 | pages = 916–931

| url = http://www.aes.org/e-lib/browse.cfm?elib=7923

}}</ref> As a result, an efficient implementation of time-varying additive synthesis of harmonic tones can be accomplished by use of wavetable synthesis.

Group additive synthesis

Group additive synthesis<ref>

{{cite web

| author = Julius O. Smith III

| title = Group Additive Synthesis

| url = https://ccrma.stanford.edu/~jos/sasp/Group_Additive_Synthesis.html

| publisher = CCRMA, Stanford University

| access-date = 12 May 2011

| archive-url = https://web.archive.org/web/20110606200135/https://ccrma.stanford.edu/~jos/sasp/Group_Additive_Synthesis.html

| archive-date = 6 June 2011 | url-status= live}}</ref><ref>

{{cite journal

| author = P. Kleczkowski

| title = Group additive synthesis

| journal = Computer Music Journal

| volume = 13 | issue = 1 | pages = 12–20

| year = 1989

| doi=10.2307/3679851

| jstor = 3679851

}}</ref><ref>

{{cite book

| author = B. Eaglestone and S. Oates

| chapter = Analytical tools for group additive synthesis

| title = Proceedings of the 1990 International Computer Music Conference, Glasgow

| publisher = Computer Music Association

| year = 1990

| chapter-url = http://quod.lib.umich.edu/cgi/p/pod/dod-idx?c=icmc;idno=bbp2372.1990.015

}}</ref> is a method to group partials into harmonic groups (having different fundamental frequencies) and synthesize each group separately with wavetable synthesis before mixing the results.

Inverse FFT synthesis

An inverse fast Fourier transform can be used to efficiently synthesize frequencies that evenly divide the transform period or "frame". By careful consideration of the DFT frequency-domain representation it is also possible to efficiently synthesize sinusoids of arbitrary frequencies using a series of overlapping frames and the inverse fast Fourier transform.<ref name="RodetDepalle_FFTm1">

{{cite journal

| last1 = Rodet | first1 = X.

| last2 = Depalle | first2 = P.

| year = 1992

| title = Spectral Envelopes and Inverse FFT Synthesis

| citeseerx = 10.1.1.43.4818

| journal = Proceedings of the 93rd Audio Engineering Society Convention

}}

</ref>

Additive analysis/resynthesis

[[Image:Sinusoidal Analysis & Synthesis (McAulay-Quatieri 1988).svg|thumb|350px|Sinusoidal analysis/synthesis system for Sinusoidal Modeling (based on )<ref name=MQ1988>{{cite journal

|last1 = McAulay

|first1 = R. J.

|last2 = Quatieri

|first2 = T. F.

|author-link2 = Thomas F. Quatieri

|year = 1988

|title = Speech Processing Based on a Sinusoidal Model

|url = http://www.ll.mit.edu/publications/journal/pdf/vol01_no2/1.2.3.speechprocessing.pdf

|journal = The Lincoln Laboratory Journal

|volume = 1

|issue = 2

|pages = 153–167

|access-date = 9 December 2013

|archive-url = https://web.archive.org/web/20120521071601/http://www.ll.mit.edu/publications/journal/pdf/vol01_no2/1.2.3.speechprocessing.pdf

|archive-date = 21 May 2012

|df = dmy-all

}}</ref>]]

It is possible to analyze the frequency components of a recorded sound giving a "sum of sinusoids" representation. This representation can be re-synthesized using additive synthesis. One method of decomposing a sound into time varying sinusoidal partials is short-time Fourier transform (STFT)-based McAulay-Quatieri Analysis.<ref name=MQ1986>

{{cite journal

| last1 = McAulay | first1 = R. J.

| last2 = Quatieri| first2 = T. F.

| date = Aug 1986

| title = Speech analysis/synthesis based on a sinusoidal representation

| journal = IEEE Transactions on Acoustics, Speech, and Signal Processing

| volume = 34 | issue = 4 | pages = 744–754

| doi = 10.1109/TASSP.1986.1164910 }}</ref><ref>

{{cite web

| title = McAulay-Quatieri Method

| url = http://www.clear.rice.edu/elec301/Projects02/lorisFor/mqmethod2.html

}}</ref>

By modifying the sum of sinusoids representation, timbral alterations can be made prior to resynthesis. For example, a harmonic sound could be restructured to sound inharmonic, and vice versa. Sound hybridisation or "morphing" has been implemented by additive resynthesis.<ref name="XSerraPhD">

{{cite thesis

| degree = PhD

| last = Serra | first = Xavier

| date = 1989

| title = A System for Sound Analysis/Transformation/Synthesis based on a Deterministic plus Stochastic Decomposition

| url = http://mtg.upf.edu/node/304

| publisher = Stanford University

| access-date = 13 January 2012

}}</ref>

Additive analysis/resynthesis has been employed in a number of techniques including Sinusoidal Modelling,<ref>

{{cite web

| last1 = Smith III | first1 = Julius O.

| last2 = Serra | first2 = Xavier

| title = PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation

| url = https://ccrma.stanford.edu/~jos/parshl/Additive_Synthesis.html

| access-date = 9 January 2012

}}</ref> Spectral Modelling Synthesis (SMS), and the Reassigned Bandwidth-Enhanced Additive Sound Model.<ref>

{{cite thesis

| degree = PhD

| last = Fitz | first = Kelly

| date = 1999

| title = The Reassigned Bandwidth-Enhanced Method of Additive Synthesis

| publisher = Dept. of Electrical and Computer Engineering, University of Illinois Urbana-Champaign

| citeseerx = 10.1.1.10.1130

}}</ref> Software that implements additive analysis/resynthesis includes: SPEAR,SPEAR Sinusoidal Partial Editing Analysis and Resynthesis for Mac OS X, MacOS 9 and Windows LEMUR, LORIS, SMSTools,SMSTools application for Windows ARSS.ARSS: The Analysis & Resynthesis Sound Spectrograph

Products

{{multiple image |direction=vertical |align=right |width=165

| header = Additive re-synthesis using timbre-frame concatenation:

| image1 = Wavesequence.svg

| caption1 = Concatenation with crossfades (on Synclavier)

| image2 = Vocaloid's phonemes crossfading - en.jpg

| caption2 = Concatenation with spectral envelope interpolation (on Vocaloid)

}}

New England Digital Synclavier had a resynthesis feature where samples could be analyzed and converted into "timbre frames" which were part of its additive synthesis engine. Technos Acxel, launched in 1987, utilized the additive analysis/resynthesis model, in an FFT implementation.

Also a vocal synthesizer, Vocaloid have been implemented on the basis of additive analysis/resynthesis: its spectral voice model called Excitation plus Resonances (EpR) model<ref name=BonadaICMC01>

{{cite journal

| last1 = Bonada | first1 = J.

| last2 = Celma | first2 = O.

| last3 = Loscos | first3 = A.

| year = 2001

| title = Singing voice synthesis combining Excitation plus Resonance and Sinusoidal plus Residual Models

| periodical = Proc. Of ICMC

| citeseerx = 10.1.1.18.6258

}} (PDF)</ref><ref>

{{cite thesis

| degree = PhD

| last = Loscos | first = A.

| year = 2007

| title = Spectral processing of the singing voice

| location = Barcelona, Spain

| publisher = Pompeu Fabra University

| hdl= 10803/7542

}} (PDF).

See "Excitation plus resonances voice model" (p. 51)

</ref> is extended based on Spectral Modeling Synthesis (SMS),

and its diphone concatenative synthesis is processed using

spectral peak processing (SPP), "Spectral peak processing" technique similar to modified phase-locked vocoder, "Phase locked vocoder" (an improved phase vocoder for formant processing).<ref name=BonadaSMAC03>

{{cite journal

| last1 = Bonada | first1 = Jordi

| last2 = Loscos | first2 = Alex

| year = 2003

| title = Sample-based singing voice synthesizer by spectral concatenation: 6. Concatenating Samples

| url = http://mtg.upf.edu/node/322

| periodical = Proc. of SMAC 03

| pages = 439–442

}}</ref> Using these techniques, spectral components (formants) consisting of purely harmonic partials can be appropriately transformed into desired form for sound modeling, and sequence of short samples (diphones or phonemes) constituting desired phrase, can be smoothly connected by interpolating matched partials and formant peaks, respectively, in the inserted transition region between different samples. (See also Dynamic timbres)

Applications

Musical instruments

Additive synthesis is used in electronic musical instruments. It is the principal sound generation technique used by Eminent organs.

Speech synthesis

In linguistics research, harmonic additive synthesis was used in the 1950s to play back modified and synthetic speech spectrograms.

Later, in the early 1980s, listening tests were carried out on synthetic speech stripped of acoustic cues to assess their significance. Time-varying formant frequencies and amplitudes derived by linear predictive coding were synthesized additively as pure tone whistles. This method is called sinewave synthesis.<ref name=remez81>

{{cite journal

| last = Remez | first = R.E.

|author2= Rubin, P.E.|author3= Pisoni, D.B.|author4= Carrell, T.D.

| s2cid = 13039853 | year = 1981

| title = Speech perception without traditional speech cues

| journal = Science

| bibcode = 1981Sci...212..947R }}</ref> Also the composite sinusoidal modeling (CSM)<ref name=sagayama79a>

{{citation

| last1 = Sagayama | first1 = S. | author-link1 = :ja:嵯峨山茂樹

| last2 = Itakura | first2 = F. | author-link2 = Fumitada Itakura

| year = 1979

|script-title=ja:複合正弦波による音声合成

| trans-title = Speech Synthesis by Composite Sinusoidal Wave

| periodical = Speech Committee of Acoustical Society of Japan

| id = S79-39 | publication-date = October 1979

}}</ref><ref name=sagayama79b>

{{cite conference

| last1 = Sagayama | first1 = S.

| last2 = Itakura | first2 = F.

| date = October 1979

|script-chapter=ja:複合正弦波による簡易な音声合成法

|trans-chapter=Simple Speech Synthesis method by Composite Sinusoidal Wave

|title=Proceedings of Acoustical Society of Japan, Autumn Meeting

| volume = 3-2-3 | pages = 557–558

}}

</ref> used on a singing speech synthesis feature on the Yamaha CX5M (1984), is known to use a similar approach which was independently developed during 1966–1979.<ref name=sagayama86>

{{cite book

| last1 = Sagayama | first1 = S.

| title = ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing

| last2 = Itakura | first2 = F.

| year = 1986

| author-link2 = Fumitada Itakura

| chapter = Duality theory of composite sinusoidal modeling and linear prediction

| publisher = Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.

| volume = 11 | pages = 1261–1264 | publication-date = April 1986

| doi = 10.1109/ICASSP.1986.1168815

| s2cid = 122814777

}}</ref><ref name=itakura04>{{cite journal

|last = Itakura

|first = F.

|author-link = Fumitada Itakura

|year = 2004

|title = Linear Statistical Modeling of Speech and its Applications -- Over 36-year history of LPC --

|url = http://www.icacommission.org/Proceedings/ICA2004Kyoto/pdf/We3.D.pdf

|periodical = Proceedings of the 18th International Congress on Acoustics (ICA 2004), We3.D, Kyoto, Japan, Apr. 2004.

|volume = 3

|pages = III–2077–2082

|publication-date = April 2004

|quote = 6. Composite Sinusoidal Modeling(CSM) In 1975, Itakura proposed the line spectrum representation (LSR) concept and its algorithm to obtain a set of parameters for new speech spectrum representation. Independently from this, Sagayama developed a composite sinusoidal modeling (CSM) concept which is equivalent to LSR but give a quite different formulation, solving algorithm and synthesis scheme. Sagayama clarified the duality of LPC and CSM and provided the unified view covering LPC, PARCOR, LSR, LSP and CSM, CSM is not only a new concept of speech spectrum analysis but also a key idea to understand the linear prediction from a unified point of view. ...

|archive-date = 24 May 2022

|access-date = 24 October 2014

|archive-url = https://web.archive.org/web/20220524112852/https://www.icacommission.org/Proceedings/ICA2004Kyoto/pdf/We3.D.pdf

|url-status = dead

}}</ref> These methods are characterized by extraction and recomposition of a set of significant spectral peaks corresponding to the several resonance modes occurring in the oral cavity and nasal cavity, in a viewpoint of acoustics. This principle was also utilized on a physical modeling synthesis method, called modal synthesis.<ref name=adrien1991>

{{cite book |last = Adrien

|first = Jean-Marie

|chapter = The missing link: modal synthesis

|chapter-url = http://dl.acm.org/citation.cfm?id=131158

|url = https://archive.org/details/representationso0000unse_k7n4/page/269

|publisher = MIT Press

|location = Cambridge, MA

|date = 1991

|isbn = 978-0-262-04113-3

|pages = 269–298

|url-access = registration

}}

</ref><ref name=morrison&adrien1993>

{{cite journal

| last1 = Morrison | first1 = Joseph Derek (IRCAM)

| last2 = Adrien | first2 = Jean-Marie

| title = MOSAIC: A Framework for Modal Synthesis

| journal = Computer Music Journal

| volume = 17 | issue = 1 | publication-date = 1993

| pages = 45–56 | doi=10.2307/3680569

| jstor = 3680569

| year = 1993

}}

</ref><ref name=bilbao2009>

{{citation

| last = Bilbao | first = Stefan

| chapter = Modal Synthesis

| chapter-url= https://ccrma.stanford.edu/~bilbao/booktop/node14.html

| title = Numerical Sound Synthesis: Finite Difference Schemes and Simulation in Musical Acoustics

| publisher = John Wiley and Sons | location = Chichester, UK

| date = October 2009

| isbn = 978-0-470-51046-9

| quote = A different approach, with a long history of use in physical modeling sound synthesis, is based on a frequency-domain, or modal description of vibration of objects of potentially complex geometry. Modal synthesis [1,148], as it is called, is appealing, in that the complex dynamic behaviour of a vibrating object may be decomposed into contributions from a set of modes (the spatial forms of which are eigenfunctions of the particular problem at hand, and are dependent on boundary conditions), each of which oscillates at a single complex frequency. ...

}} (See also companion page)

</ref><ref name=doel&pai2003>

{{cite journal

| last1 = Doel | first1 = Kees van den

| last2 = Pai | first2 = Dinesh K.

| title = Modal Synthesis For Vibrating Object

| url = http://www.cs.ubc.ca/~kvdoel/publications/modalpaper.pdf

| editor-last = Greenebaum | editor-first = K.

| journal = Audio Anecdotes

| publisher = AK Peter | location = Natick, MA

| date = 2003

| quote = When a solid object is struck, scraped, or engages in other external interactions, the forces at the contact point causes deformations to propagate through the body, causing its outer surfaces to vibrate and emit sound waves. ... A good physically motivated synthesis model for objects like this is modal synthesis ... where a vibrating object is modeled by a bank of damped harmonic oscillators which are excited by an external stimulus.

}}

</ref>

History

{{multiple image |direction=horizontal

|header = Lord Kelvin's Tide-predicting machine

|caption1 = Harmonic synthesizer

|image1 = DSCN1739-thomson-tide-machine.jpg |width1=109

|caption2 = Harmonic analyzer

|image2 = Harmonic analyser.jpg |width2=224

}}

Harmonic analysis was discovered by Joseph Fourier, who published an extensive treatise of his research in the context of heat transfer in 1822. The theory found an early application in prediction of tides. Around 1876, William Thomson (later ennobled as Lord Kelvin) constructed a mechanical tide predictor. It consisted of a harmonic analyzer and a harmonic synthesizer, as they were called already in the 19th century. The analysis of tide measurements was done using James Thomson's integrating machine. The resulting Fourier coefficients were input into the synthesizer, which then used a system of cords and pulleys to generate and sum harmonic sinusoidal partials for prediction of future tides. In 1910, a similar machine was built for the analysis of periodic waveforms of sound. The synthesizer drew a graph of the combination waveform, which was used chiefly for visual validation of the analysis.

{{multiple image |direction=horizontal

|caption1 = Helmholtz resonator

|image1 = Helmholtz_Resonator.png |width1=100

|caption2 = Tone-generator utilizing it

|image2 = Helmholtz resonator 2.jpg |width2=148

}}

Georg Ohm applied Fourier's theory to sound in 1843. The line of work was greatly advanced by Hermann von Helmholtz, who published his eight years worth of research in 1863. Helmholtz believed that the psychological perception of tone color is subject to learning, while hearing in the sensory sense is purely physiological. He supported the idea that perception of sound derives from signals from nerve cells of the basilar membrane and that the elastic appendages of these cells are sympathetically vibrated by pure sinusoidal tones of appropriate frequencies. Helmholtz agreed with the finding of Ernst Chladni from 1787 that certain sound sources have inharmonic vibration modes.

{{multiple image |direction=horizontal

|header = Rudolph Koenig's sound analyzer and synthesizer

|caption1 = sound synthesizer

|image1 = Synthesizer after Helmholtz by Koenig 1865.jpg |width1=231

|caption2 = sound analyzer

|image2 = Koenig - klankanalysator purchased in 1996.jpg |width2=102

}}

In Helmholtz's time, electronic amplification was unavailable. For synthesis of tones with harmonic partials, Helmholtz built an electrically excited array of tuning forks and acoustic resonance chambers that allowed adjustment of the amplitudes of the partials. Built at least as early as in 1862, these were in turn refined by Rudolph Koenig, who demonstrated his own setup in 1872. For harmonic synthesis, Koenig also built a large apparatus based on his wave siren. It was pneumatic and utilized cut-out tonewheels, and was criticized for low purity of its partial tones. Also tibia pipes of pipe organs have nearly sinusoidal waveforms and can be combined in the manner of additive synthesis.

In 1938, with significant new supporting evidence, it was reported on the pages of Popular Science Monthly that the human vocal cords function like a fire siren to produce a harmonic-rich tone, which is then filtered by the vocal tract to produce different vowel tones. By the time, the additive Hammond organ was already on market. Most early electronic organ makers thought it too expensive to manufacture the plurality of oscillators required by additive organs, and began instead to build subtractive ones. In a 1940 Institute of Radio Engineers meeting, the head field engineer of Hammond elaborated on the company's new Novachord as having a "subtractive system" in contrast to the original Hammond organ in which "the final tones were built up by combining sound waves". Alan Douglas used the qualifiers additive and subtractive to describe different types of electronic organs in a 1948 paper presented to the Royal Musical Association. The contemporary wording additive synthesis and subtractive synthesis can be found in his 1957 book The electrical production of music, in which he categorically lists three methods of forming of musical tone-colours, in sections titled Additive synthesis, Subtractive synthesis, and Other forms of combinations.

A typical modern additive synthesizer produces its output as an electrical, analog signal, or as digital audio, such as in the case of software synthesizers, which became popular around year 2000.

Timeline

The following is a timeline of historically and technologically notable analog and digital synthesizers and devices implementing additive synthesis.

{| class="wikitable" width="100%"

! width="50" | Research implementation or publication

! width="50" | Commercially available

! width="100" class="unsortable" | Company or institution

! width="50" class="unsortable" | Synthesizer or synthesis device

! class="unsortable" | Description

! width="85" class="unsortable" | Audio samples

| 1900

| 1906

| New England Electric Music Company

| Telharmonium

| The first polyphonic, touch-sensitive music synthesizer. Implemented sinuosoidal additive synthesis using tonewheels and alternators. Invented by Thaddeus Cahill.

| no known recordings

| 1933

| 1935

| Hammond Organ Company

| Hammond Organ

| An electronic additive synthesizer that was commercially more successful than Telharmonium. Implemented sinusoidal additive synthesis using tonewheels and magnetic pickups. Invented by Laurens Hammond.

| 1950 or earlier

| Haskins Laboratories

| Pattern Playback

| A speech synthesis system that controlled amplitudes of harmonic partials by a spectrogram that was either hand-drawn or an analysis result. The partials were generated by a multi-track optical tonewheel.

| samples

| 1958

| ANS

| An additive synthesizer that played microtonal spectrogram-like scores using multiple multi-track optical tonewheels. Invented by Evgeny Murzin. A similar instrument that utilized electronic oscillators, the Oscillator Bank, and its input device Spectrogram were realized by Hugh Le Caine in 1959.

| 1963

| MIT

| An off-line system for digital spectral analysis and resynthesis of the attack and steady-state portions of musical instrument timbres by David Luce.

| 1964

| University of Illinois

| Harmonic Tone Generator

| An electronic, harmonic additive synthesis system invented by James Beauchamp.

| samples (info)

| 1974 or earlier

| 1974<ref name="reid2001">

</ref>

| RMI

| Harmonic Synthesizer

| The first synthesizer product that implemented additive<ref name="flint2008">

</ref> synthesis using digital oscillators. The synthesizer also had a time-varying analog filter. RMI was a subsidiary of Allen Organ Company, which had released the first commercial digital church organ, the Allen Computer Organ, in 1971, using digital technology developed by North American Rockwell.

| 1 2 3 4

| 1974

| EMS (London)

| Digital Oscillator Bank

| A bank of digital oscillators with arbitrary waveforms, individual frequency and amplitude controls, intended for use in analysis-resynthesis with the digital Analysing Filter Bank (AFB) also constructed at EMS. Also known as: DOB.

| in The New Sound of Music Includes a demonstration of DOB and AFB.

| 1976<ref name="leete1999">

</ref>

| 1976

| Fairlight

| Qasar M8

| An all-digital synthesizer that used the fast Fourier transform to create samples from interactively drawn amplitude envelopes of harmonics.

| samples

| 1977

| Bell Labs

| Digital Synthesizer

| A real-time, digital additive synthesizer that has been called the first true digital synthesizer. Also known as: Alles Machine, Alice.

| sample (info )

| 1979

| New England Digital

| Synclavier II

| A commercial digital synthesizer that enabled development of timbre over time by smooth cross-fades between waveforms generated by additive synthesis.

|1996

|Kawai

|K5000

|A commercial digital synthesizer workstation capable of polyphonic, digital additive synthesis of up to 128 sinusodial waves, as well as combing PCM waves.

Discrete-time equations

In digital implementations of additive synthesis, discrete-time equations are used in place of the continuous-time synthesis equations. A notational convention for discrete-time signals uses brackets i.e. and the argument can only be integer values. If the continuous-time synthesis output is expected to be sufficiently bandlimited; below half the sampling rate or , it suffices to directly sample the continuous-time expression to get the discrete synthesis equation. The continuous synthesis output can later be reconstructed from the samples using a digital-to-analog converter. The sampling period is .

Beginning with (),

and sampling at discrete times results in

<math> \begin{align}

y[n] & = y(nT) = \sum_{k=1}^{K} r_k(nT) \cos\left(2 \pi \int_0^{nT} f_k(u)\ du + \phi_k \right) \\

& = \sum_{k=1}^{K} r_k(nT) \cos\left(2 \pi \sum_{i=1}^{n} \int_{(i-1)T}^{iT} f_k(u)\ du + \phi_k \right) \\

& = \sum_{k=1}^{K} r_k(nT) \cos\left(2 \pi \sum_{i=1}^{n} (T f_k[i]) + \phi_k \right) \\

& = \sum_{k=1}^{K} r_k[n] \cos\left(\frac{2 \pi}{f_\mathrm{s}} \sum_{i=1}^{n} f_k[i] + \phi_k \right) \\

\end{align} </math>

where

is the discrete-time varying amplitude envelope

is the discrete-time backward difference instantaneous frequency.

This is equivalent to

where

<math> \begin{align}

\theta_k[n] &= \frac{2 \pi}{f_\mathrm{s}} \sum_{i=1}^{n} f_k[i] + \phi_k \\

&= \theta_k[n-1] + \frac{2 \pi}{f_\mathrm{s}} f_k[n] \\

\end{align} </math> for all

and

References

External links

Digital Keyboards Synergy