FluidKDTree | SuperCollider 3.13.0 Help

Description

A k-dimensional tree for efficient neighbourhood searches of multi-dimensional data.

FluidKDTree facilitates efficient nearest neighbour searches of multi-dimensional data stored in a FluidDataSet.

k-d trees are most useful for repeated querying of a dataset, because there is a cost associated with building them. If you just need to do a single lookup then using the kNearest message of FluidDataSet will probably be quicker

Whilst k-d trees can offer very good performance relative to naïve search algorithms, they suffer from something called “the curse of dimensionality” (like many algorithms for multi-dimensional data). In practice, this means that as the number of dimensions of your data goes up, the relative performance gains of a k-d tree go down.

Read more about FluidKDTree on the learn platform.

Class Methods

FluidKDTree.new(server, numNeighbours: 1, radius: 0)

Arguments:

server

The Server on which to construct this object

numNeighbours

The number of neighbours to return.

radius

The maximum distance (in high dimensional space) that a returned point can be. Any points beyond radius will not be returned (even if they're within the nearest numNeighbours points). When radius is 0, it is no longer a constraint and the distance of a nearest neighbour is not taken into account.

Constraints

Minimum: 0

Inherited class methods

Instance Methods

.numNeighbours

.numNeighbours = k

Property for numNeighbours. See new

.radius

.radius = r

Property for radius. See new

.fit(dataSet, action)

Build the tree by scanning the points of a FluidDataSet

Arguments:

dataSet	The FluidDataSet of interest. This can either be a data set object itself, or the name of one.
action	A function to execute when the server has completed running fit

.kNearest(buffer, k, action)

Returns the identifiers of the k points nearest to the one passed.

Arguments:

buffer	A Buffer containing a data point to match against. The number of frames in the buffer must match the dimensionality of the FluidDataSet the tree was fitted to.
k	(optional) The number of nearest neighbours to return. The identifiers will be sorted, beginning with the nearest.
action	A function to execute when the server has completed running kNearest

.kNearestDist(buffer, k, action)

Get the distances of the K nearest neighbours to a point.

Arguments:

buffer	A Buffer containing a data point to match against. The number of frames in the buffer must match the dimensionality of the FluidDataSet the tree was fitted to.
k	(optional) The number of nearest neighbours to return. The identifiers will be sorted, beginning with the nearest.
action	A function to execute when the server has completed running kNearestDist

.cols(action)

From superclass: FluidDataObject

The number of columns (dimensions) in this model or dataset / labeset

Arguments:

action

A function to execute when the server has completed running cols

.clear

From superclass: Object

Resets the internal state of the model

Arguments:

(action)

A function to execute when the server has completed running clear

.size(action)

From superclass: FluidDataObject

The number of data points (entries / observations) in this model or dataset / labeset

Arguments:

action

A function to execute when the server has completed running size

.load(dict, action)

From superclass: FluidDataObject

Replace the internal state of the object from a Dictionary.

Arguments:

dict
action	A function to execute when the server has completed running load

.dump(action)

From superclass: FluidDataObject

Dump the state of this object as a Dictionary, which will be passed to the action function provided. This object must first be fit``ted before ``dump can be called.

Arguments:

action

A function to execute when the server has completed running dump

.write(filename, action)

From superclass: FluidDataObject

Save the internal state of the object to a JSON file on disk. This object must first be fit before write can be called.

Arguments:

filename	Path of the file to load from
action	A function to execute when the server has completed running write

.read(filename, action)

From superclass: FluidDataObject

Replace the internal state of the object from a JSON file on disk.

Arguments:

filename	Path of the file to load from
action	A function to execute when the server has completed running read

Inherited instance methods

Undocumented instance methods

.fitMsg(dataSet)

.kNearestDistMsg(buffer, k)

.kNearestMsg(buffer, k)

.kr(trig, inputBuffer, outputBuffer, numNeighbours, radius, lookupDataSet)

.prGetParams

Examples

Big Example// using a KDTree to lookup the point on the plot that is nearest to the mouse ( var ds = FluidDataSet(s).read(FluidFilesPath("../Data/flucoma_corpus.json")); var tree = FluidKDTree(s).fit(ds); ds.dump({ arg dict; var xybuf = Buffer.alloc(s,2); var slicePoints = Buffer.read(s,FluidFilesPath("../Data/flucoma_corpus_slices.wav")); var loader = FluidLoadFolder(FluidFilesPath()); loader.play(s); defer{ FluidPlotter(dict:dict,mouseMoveAction:{ arg view, x, y; xybuf.setn(0,[x,y]); tree.kNearest(xybuf,1,action:{ arg id; var index = id.asInteger; defer{ view.highlight_(id); }; { var start = Index.kr(slicePoints,index); var end = Index.kr(slicePoints,index+1); var sig = PlayBuf.ar(2,loader.buffer,BufRateScale.ir(loader.buffer),1,start)[0]; var dur_sec = min((end-start) / SampleRate.ir,1); var env = EnvGen.kr(Env([0,1,1,0],[0.03,dur_sec-0.06,0.03]),doneAction:2); sig.dup * env; }.play; }); }); }; }); )

radius and num neighbours// Make a plot; once it's up and you're clicking around, change the numbers of // queried neighbours and the permitted radius to see the different behaviours ( var ds = FluidDataSet(s).load( Dictionary.newFrom([ "cols",2, "data",Dictionary.newFrom( 100.collect{ arg i; [i,{rrand(0.0,1.0)}!2] }.flatten ) ]) ); ~tree = FluidKDTree(s); ~tree.numNeighbours = 3; ~tree.radius_(0.04); ~tree.fit(ds); ds.dump({ arg dict; var nn, nnd; var xybuf = Buffer.alloc(s,2); defer{ w = Window(\KDTree,Rect(0,0,705,500)).front; StaticText(w,Rect(500,5,200,20)).string_("numNeighbours:"); TextField(w,Rect(500,25,200,20)).string_(~tree.numNeighbours.asString).action_{|x|~tree.numNeighbours = x.value.asInteger}; StaticText(w,Rect(500,45,200,20)).string_("radius:"); TextField(w,Rect(500,65,200,20)).string_(~tree.radius.asString).action_{|x|~tree.radius = x.value.asFloat}; StaticText(w,Rect(500,85,200,20)).string_("neighbours:"); nn = TextView(w, Rect(500,105,200,40)).string_("").editable_(false); StaticText(w,Rect(500,145,200,20)).string_("distances:"); nnd = TextView(w, Rect(500,165,200,200)).string_("").editable_(false); FluidPlotter(w, Rect(5,5,490,490), dict:dict,mouseMoveAction:{ arg view, x, y; xybuf.setn(0,[x,y]); ~tree.kNearest(xybuf,action:{ arg id; defer{ view.highlight_(id); nn.string = id.asString; }; }); ~tree.kNearestDist(xybuf,action:{ arg id; defer{ nnd.string = id.asString; }; }); }); }; }); )

Queries in a Synth

Input and output is done via buffers, which will need to be preallocated to the correct sizes:

Your input buffer should be sized to the input data dimension (2, in this example)
Your output buffer should be the maximum number of neighbours you are interested in * output dimensionality

We can't simply return labels (i.e. strings) from a UGen, so the query returns the actual data points from a DataSet instead. By default, this is the FluidDataSet against which the tree was fitted. However, by passing a different dataset to kr's lookupDataSet argument instead, you can return different points, so long as the labels in the two datasets match. In this way, the FluidKDTree can be used to perform nearest neighbour mappings in a synth.

For instance, whilst fitting the tree against some n-dimensional descriptor data, our lookup dataset could use the same labels to map descriptor entries back to buffers, or locations in buffers, so that queries can be used to trigger audio.~src = Buffer.read(s,FluidFilesPath("Constanzo-PreparedSnare-M.wav")); // create two datasets: // one of mfcc analyses for each slice and one of the playback information for each slice ( var indices = Buffer(s); var mfccs = Buffer(s); var stats = Buffer(s); var flat = Buffer(s); var playback_info_dict = Dictionary.newFrom([ "cols",2, "data",Dictionary.new; ]); ~ds_mfccs = FluidDataSet(s); FluidBufOnsetSlice.processBlocking(s,~src,indices:indices,metric:9,threshold:0.7); indices.loadToFloatArray(action:{ arg fa; // go through each slice (from one slice point to the next) fa.doAdjacentPairs{ arg start, end, i; var num = end - start; var id = "slice-%".format(i); // add playback info for this slice to this dict playback_info_dict["data"][id] = [start,num]; FluidBufMFCC.processBlocking(s,~src,start,num,startCoeff:1,features:mfccs); FluidBufStats.processBlocking(s,mfccs,stats:stats,select:[\mean]); FluidBufFlatten.processBlocking(s,stats,destination:flat); // add analysis info for this slice to this data set ~ds_mfccs.addPoint(id,flat); }; ~ds_playback = FluidDataSet(s).load(playback_info_dict); ~ds_mfccs.print; ~ds_playback.print; }); ) // we'll use this kdtree to find the nearest neighbour in 13 dimensions (mfccs) ~tree = FluidKDTree(s); ~tree.fit(~ds_mfccs); // of course it will often find "itself" because the source and lookup sounds are the same. // try with different source and lookup sounds ( { var src = PlayBuf.ar(1,~src,BufRateScale.ir(~src),loop:1); var mfccs = FluidMFCC.kr(src,startCoeff:1); var mfccbuf = LocalBuf(mfccs.numChannels); var playbackinfo = LocalBuf(2); var trig = Impulse.kr(10); // could change how often the lookup happens... var start, num, sig_looked_up; FluidKrToBuf.kr(mfccs,mfccbuf); // kdtree finding the nearest neighbour in 13 dimensions ~tree.kr(trig,mfccbuf,playbackinfo,1,lookupDataSet: ~ds_playback); # start, num = FluidBufToKr.kr(playbackinfo); start.poll(label:"start frame"); num.poll(label:"num frames"); // not using num frames for playback here, but one certainly could! sig_looked_up = PlayBuf.ar(1,~src,BufRateScale.ir(~src),trig,start); [src,sig_looked_up * -7.dbamp]; }.play; ) // note that the UGEN kr out outputs the number of valid items found. It is a useful // information when querying with radius size and large k (or k=0) ( { var src = PlayBuf.ar(1,~src,BufRateScale.ir(~src),loop:1); var mfccs = FluidMFCC.kr(src,startCoeff:1); var mfccbuf = LocalBuf(mfccs.numChannels); var playbackinfo = LocalBuf(4); //up to 2 points of 2 dimensions var trig = Impulse.kr(5); var num; FluidKrToBuf.kr(mfccs,mfccbuf); // kdtree finding all nearest neighbour in 13 dimensions within a radius of 20 num = ~tree.kr(trig, inputBuffer: mfccbuf, outputBuffer: playbackinfo, numNeighbours: 0, radius: 20, lookupDataSet: ~ds_playback).poll(trig,"number of valid k: "); FluidBufToKr.kr(playbackinfo).poll(trig * num, "content of buffer: "); src.dup }.play; )

helpfile source: /usr/share/SuperCollider/Extensions/FluidCorpusManipulation/HelpSource/Classes/FluidKDTree.schelp
link::Classes/FluidKDTree::

FluidKDTree : FluidModelObject : FluidDataObject : FluidServerObject : Object Extension

FluidKDTree.new(server, numNeighbours: 1, radius: 0)

Arguments:

.numNeighbours

.numNeighbours = k

.radius

.radius = r

.fit(dataSet, action)

Arguments:

.kNearest(buffer, k, action)

Arguments:

.kNearestDist(buffer, k, action)

Arguments:

.cols(action)

Arguments:

.clear

Arguments:

.size(action)

Arguments:

.load(dict, action)

Arguments:

.dump(action)

Arguments:

.write(filename, action)

Arguments:

.read(filename, action)

Arguments:

.fitMsg(dataSet)

.kNearestDistMsg(buffer, k)

.kNearestMsg(buffer, k)

.kr(trig, inputBuffer, outputBuffer, numNeighbours, radius, lookupDataSet)

.prGetParams

FluidKDTree : FluidModelObject : FluidDataObject : FluidServerObject : Object
Extension