Intone

Intone

Formant synthesis voice based on the IRCAM CHANT project (Rodet, Potard, Barriere, 1984).

Intone generates vocal-character sound using 5 parallel formant cells, each producing overlapping FOF (Formant Wave Function) grains: damped sinusoids at formant frequencies. A vowel morph slider smoothly interpolates through /a/, /e/, /i/, /o/, /u/ presets.

16 HP. Monophonic audio output.


How formant synthesis works

The human voice’s distinctive character comes from resonances (formants) in the vocal tract. Different vowel sounds correspond to different formant frequency patterns. Intone models this by summing 5 parallel resonant cells, each generating sound at a formant frequency.

Each formant cell generates FOF grains: short bursts of a damped sinusoid at the formant’s center frequency, with a cosine-windowed attack and exponential decay. Multiple FOFs overlap as they’re re-triggered at the fundamental frequency rate, creating a continuous tone with formant resonance character.


Three excitation modes

Default (nothing patched): Internal F0 oscillator drives FOF generation. V/Oct controls pitch. Works as a vocal-character VCO.

Audio mode (audio patched, switch up): Input audio passes through a parallel resonant bandpass filter bank at the 5 formant frequencies. The audio is formant-shaped: drums gain vocal coloring, oscillators acquire vowel character. V/Oct transposes the formant pattern.

Trigger mode (clock patched, switch down): External rising edges fire fresh FOF grains. Each trigger produces a vowel-character burst. V/Oct transposes the formant frequencies.


Controls

Per-Formant (x5)

ControlRangeDefaultFunction
Freq offset±1 octave0Shifts formant frequency relative to vowel preset
Bandwidth30-500 Hz80 HzWidth of the formant resonance
Amplitude0-1varies per formantLevel of this formant

Global

ControlRangeDefaultFunction
Vowel Morph (slider)0-10 (/a/)Sweeps through /a/ → /e/ → /i/ → /o/ → /u/
Skirt0-10.5FOF attack rate / spectral slope
Mode SwitchAudio/TriggerAudioSelects excitation mode when EXC is patched

Inputs

InputRangeFunction
V/Oct1V/octavePitch (default) or formant transposition (audio/trigger)
EXC±5VExcitation source
F1-F5 Freq CV±5VPer-formant frequency modulation
F1-F5 BW CV±5VPer-formant bandwidth modulation
Vowel CV0-10VVowel morph position
Skirt CV±5VSkirt width modulation

Outputs

OutputFunction
OutMonophonic audio output

Vowel preset frequencies (Hz)

VowelF1F2F3F4F5
/a/7301090244034004400
/e/5301840248034004400
/i/2702290301034004400
/o/570840241034004400
/u/300870224034004400

Spectrum display

Shows 5 formant peaks as colored bell curves on a logarithmic frequency axis (50 Hz - 5 kHz), with a bright composite envelope overlay. Updates in real time as parameters change.


Patch ideas

Vowel Pad: Patch a slow LFO to Vowel CV. The oscillator morphs between vowel sounds continuously.

Vocal Filter: Patch audio to EXC (Audio mode). Drum loops, oscillators, or samples get formant-shaped through the 5 vowel resonances.

Triggered Vowels: Patch a clock to EXC (Trigger mode). Each pulse produces a short vocal burst. Modulate Vowel CV for changing articulation.

Formant Sweep: Patch an LFO to one of the F1-F5 Freq CV inputs. The formant peak sweeps independently of the vowel preset.


Historical context

Based on the FOF technique from the IRCAM CHANT project (1979-1984), originally developed for singing voice synthesis. The same technique was used by composers including Kaija Saariaho, Gerard Grisey, and Marco Stroppa for spectral music composition.