FluidSpectralShape | SuperCollider 3.13.0 Help

Description

Seven of the spectral shape descriptors, computed on a linear scale for both amplitude and frequency.

The descriptors are:

the four first statistical moments (https://en.wikipedia.org/wiki/Moment_(mathematics)), more commonly known as:
- the spectral centroid (1) in Hertz. This is the point that splits the spectrum in 2 halves of equal energy. It is the weighted average of the magnitude spectrum.
- the spectral spread (2) in Hertz. This is the standard deviation of the spectrum envelope, or the average of the distance to the centroid.
- the normalised skewness (3) as ratio. This indicates how tilted is the spectral curve in relation to the middle of the spectral frame, i.e. half of the Nyquist frequency. If it is below the frequency of the magnitude spectrum, it is positive.
- the normalised kurtosis (4) as ratio. This indicates how focused the spectral curve is. If it is peaky, the value is high.
the rolloff (5) in Hertz. This indicates the frequency under which 95% of the energy is included.
the flatness (6) in dB. This is the ratio of geometric mean to the magnitude, over the arithmetic mean of the magnitudes. It yields a very approximate measure on how noisy a signal is.
the crest (7) in dB. This is the ratio of the loudest magnitude over the RMS of the whole frame. A high number is an indication of a loud peak poking out from the overall spectral curve.

The drawings in Peeters 2003 ( http://recherche.ircam.fr/anasyn/peeters/ARTICLES/Peeters_2003_cuidadoaudiofeatures.pdf ) are useful, as are the commented examples below. For the mathematically-inclined reader, the tutorials and code offered here ( https://www.audiocontentanalysis.org/ ) are interesting to further the understanding. For examples of the impact of computing the moments in power magnitudes, and/or in exponential frequency scale, please refer to the helpfile.

The process will return a multichannel control stream with the seven values, which will be repeated if no change happens within the algorithm, i.e. when the hopSize is larger than the signal vector size.

Read more about FluidSpectralShape on the learn platform.

Class Methods

FluidSpectralShape.kr(in: 0, select, minFreq: 0, maxFreq: -1, rolloffPercent: 95, unit: 0, power: 0, windowSize: 1024, hopSize: -1, fftSize: -1, maxFFTSize: -1)

Arguments:

in	Audio-rate signal to analyze
select	An array of `symbols` indicating which analyses to return. The options are `centroid`, `spread`, `skewness`, `kurtosis`, `rolloff`, `flatness`, and `crest`. If nothing is specified, the object will return all the analyses. The analyses will always appear in their normal order, this argument just allows for a selection of them to be returned. Reordering the options in this argument will not reorder how the analyses are returned.
minFreq	The minimum frequency that the algorithm will consider for computing the spectral shape. Frequencies below will be ignored. The default of 0 goes down to DC when possible. Constraints Minimum: `0`
maxFreq	The maximum frequency that the algorithm will consider for computing the spectral shape. Frequencies above will be ignored. The default of -1 goes up to Nyquist. Constraints Minimum: `-1`
rolloffPercent	This sets the percentage of the frame's energy that will be reported as the rolloff frequency. The default is 95%. Constraints Minimum: `0` Maximum: `100`
unit	The frequency unit for the spectral shapes to be computed upon, and outputted at. The default (0) is in Hertz and computes the moments on a linear spectrum. The alternative is in MIDI note numbers(1), which compute the moments on an exponential spectrum.
power	This flag sets the scaling of the magnitudes in the moment calculation. It uses either its amplitude (0, by default) or its power (1).
windowSize	The window size. As sinusoidal estimation relies on spectral frames, we need to decide what precision we give it spectrally and temporally. For more information visit https://learn.flucoma.org/learn/fourier-transform/
hopSize	The window hop size. As sinusoidal estimation relies on spectral frames, we need to move the window forward. It can be any size, but low overlap will create audible artefacts. The -1 default value will default to half of windowSize (overlap of 2).
fftSize	The inner FFT/IFFT size. It should be at least 4 samples long, at least the size of the window, and a power of 2. Making it larger allows an oversampling of the spectral precision. The -1 default value will default to windowSize.
maxFFTSize	Set an explicit upper bound on the FFT size at object instantiation. The default of `nil` or -1 sets this to whatever the initial FFT size is

Returns:

A 7-channel KR signal with the seven spectral shape descriptors. The latency is windowSize.

Inherited class methods

Undocumented class methods

FluidSpectralShape.features

FluidSpectralShape.prProcessSelect(a)

FluidSpectralShape.prWarnUnrecognised(sym)

Instance Methods

Inherited instance methods

Examples

logarithmic scaleThe computation of the spectral centroid can also be done considering a logarithmic pitch scale and the power of the magnitudes. This yields values that are generally considered to be more in line with perception, for instance where the shape is often drawn and described in logarithmic terms, i.e., dB per octave.

Compare the values of the centroid and the spread in both scales. The lower the frequency, the more the linear spectral bias shows. The same applies to the spread. The logarithmic unit is in semitones. To convert, etiher divide by 12 to get the octave of one standard deviation, or divide by 6 to get the width of the filter in octaves. One clear observation is that the width is now in a range that scales with what we hear, growing fourfold as the filter goes from resonanting to more broadband.( { var freq = MouseX.kr(20,20000,1); var q = MouseY.kr(1,100); var noise = BPF.ar(PinkNoise.ar,freq,q.reciprocal); var lin = FluidStats.kr(FluidSpectralShape.kr(noise,[\centroid,\spread]),40)[0]; var log = FluidStats.kr(FluidSpectralShape.kr(noise,[\centroid,\spread],unit:1,power:1),40)[0]; freq.poll(label:"filter frequency "); q.poll(label:"filter q "); lin[0].poll(label:"lin centroid "); lin[1].poll(label:"lin spread "); log[0].midicps.poll(label:"log centroid "); log[1].poll(label:"log spread "); 0.poll(label:"-------------------------"); noise.dup; }.play; )

helpfile source: /usr/share/SuperCollider/Extensions/FluidCorpusManipulation/HelpSource/Classes/FluidSpectralShape.schelp
link::Classes/FluidSpectralShape::

FluidSpectralShape : FluidRTMultiOutUGen : MultiOutUGen : UGen : AbstractFunction : Object Extension