dan’s installation

dan st. Clair is talking about his awesome instillation which involves speakers hangong from trees doing bird like rendition of ‘like a virgin’, which is utterly tweaking out the local mocking birds.

when he was an undergrad he did a nifty project with songs stuck in people’s heads. It was conxeptual and not musical.

when he lived in chicago he did a map of muzak in stores on state street, including genre and de;ivery .ethod. He made a tourist brochure with muzak maps and put them in visitor centers.

he’s interested in popular music in environemtnal settings

max neuhaus did an unmarked, invisible sounds installation in time square. Dan dug the sort of invisible, discovery aspect.

his bird e.ulator is solar powered. Needs no cables. Has an 8bit microcontroller. They’re cheap as hell.

he’s loaded frequncy envelopes in memory. Fixed control rate. Uses a single wavetable oscillator httP://www.myplace.nu/avr/minidds/index.htm

he made recordings of birds and extracted the partials.

he throws this up into trees. However, neighbors got annoyed and called the cops or destroyed the speakers.

he’s working on a new version which is in close proximity to houses. He’s adding a calddendar to shut it down sometimes and amplitude controls.

he has an IFF class to deal with sdif and midi files. SDIFFrames class works with these files.

there’s some cool classes for fft, like FFTPeaks

he’s written some cool guis for finding partials.

his method of morphing between bird calls and pop songs is pretty brilliant.

dan is awesome

live video

sam pluta wrote some live video software. It’s inspired by glitchbot, meapsoft

glitchbot records sequnces and loops and stutters them. Records 16 bar phrase and loops and tweaks it. I think i have seen this. It can add beats and do subloops, etc

the sample does indeed sound glitchy

probability control can be clumsy in live performance. Live control of beats is hard.

MEAPSoft does reordering.

his piece from the last symposium used a sample bank which he can interpret and record his interpretting and then do stuff with that. So there are two layers of improvisation. It has a small initial parameter space nad uses a little source to make a lot of stuff.

i remember his pice from last time

what he learned from that was that it was good, especially for noisy music. And he controlled it by hitting a lot of keys which was awesome

he wrote an acoustic oiece using sound block. Live instruments can do looping differently, you can make the same note longer.

so he wrote michel chion’s book on film and was influenced. He started finding sound moments in films. And decided to use them for source material.

sci-fi films have the best sound, he says.

playing a lot of video clips in fast succession is hard, because you need a format that renders single frames quickly. Pixlet format is good for that.

audio video synch is hard with quicktime, so he loaded audio into sc and did a bridge to video with quartz composer.

qc is efficient at rendering

he wanted to make noisy loops, like to change them. You can’t buffer video loops in the same way, so he needed to create metaloops of playback information. So looped data.

a loop contains pointers to movies clips, but starts from where he last stopped. Which sounds right

he organized the loops by category, kissing, car chases, drones,etc

this is an interesting way of organizing and might help my floundering blake piece.

he varies loop duration based on the section of the piece.

live blog : beast mulch

Scott is talking about beast mulch, which is still unreleased,
there are calsses for controllers, like hardware. There’s a plugin framework to easily extend stuff. BMPulginSpec(‘name’, {|this| etc. . . .

multichannel stuff and swarm granulation, etc.

kd tree class finds closest speaker neighbor

if you want beastmulch, get it from scott’s website

there’s speaker classes, BMSpeaker

BMInOutArray does associations.

beast mulch is a big library for everyone. Everything must be named. There are time references, like a soundflile player;

trying to be adaptible. 100 channels or 8, make it work on both. Supports jit and stems.

a usage example: can be used live. Routing table, control matrixes. Pre and post processing use plugins

i NEED to download this & use it.

http://scottwilson.ca

http://www.beast.bham.ac.uk/research/mulch.shtml

. . .

timbral analysis

dan stowell is talking about beatboxing and machine listening

live blogging the supercollider symposium

analyze one signal and use it to control another. Pitch and amplitude are done. So let’s do timbre remapping.

extract features from sound, decorrelate and reduce dimensions, map it to a space. What features to use? Mfccs, spectral crest factors. That’s looking for peaks vs flatness.

his experiments use simulated degredation to make sure it works in performance.

voice works well with mfccs, but are not noise robust. Spectral crests are xomplimntaery and are npise robust. The two give you a lot of info.

a lot of different analysis give you useful information about perceptual differences.

now he’s talking about an 8bit chip and controlling it. Was this on boing boing or something recently?

spectral centroid 95th percentile of energy from the left ahows rolloff frequency.

he’s showing a video of the inside of his throat

timbral analysis

nick collins is talking about timbral analysis and phase vocoders, which is supercollider-ese for ffts.

i missed the first couple of minutes of this becasue there is an installarion outside of solar-powered speakers in trees, doing bird song ;ike sounds, which played madonna’s ‘like a virgin’ when i walked by and i had to fall over laughing. Hahahah

ok, back to the present AtsSynth does some cool stuff with pitch shifting.

scott wilson’s ugens do loris stuff. Which is noise modulated sine tones. Sinusoidal peak detection.

TPV ugen does pure sinsoidal stuff. Sines and phases. Takes an fft chain input and creates sine outputs with resynthesis. Finds n peaks and uses that number of sinusoids. This is cool. And is part of sc 3.3

SMS is spectral modelling synthesis. Sines plus noise. This is slightly expensive. But it preserves formants in repitching. So it sounds right with shifting speech.

good stuff!

theory continued

theory continued.

time point synthesis. Babbit wanted to serialize parameters in addition to pitch. He used durational sets, which becomes dull. And doesn’t transform well.

instead use integers to map to a table of durations. Your grid has 12 durations just cuz. Andrew Mead did some work on this.

there is a class TimePoints. Which is an array.

this a rythm lib. I should look into this.

we’re listening to ‘homily’ by babbitt, which uses these kinds of transformations.

and the code isn’t on the internets.

and now virtual gamelan graz

this is an attempt to model everything about gamelan.

tuning: well, don’t model everything, just the metalaphones. The tuning should be an ideal. This requires fieldwork and interviweing builders. Or you could just measure existing instruments and measure them.

pick one instrument. Measure root pitches. You’re good.

or do more recording like sethares. Measure more ensembles. Which partial is the root?

these guys sampled the local gamelan and went with that.

the tuning . . . Are wesure of the root pitches? Is it the instruments relative to each other, one in reference to itself, the partials in a single note?

there is an image on a grid, which is hard to see as a slide.

you can do a lot of retuning.

sumarsam is raising a point on pelog tuning. The musicologist in the group is absent so the presenters have to defer.

how to synthesize -samples or synthesis. They use sines and formlet filters.

performance modelling. Model human actors or do contextual knowledge.

they did not go with individuals.

They have an event model. Each note is an event, which hold what you need to know.

audio demo. It does tempo changes right. They use ListeningClocks to do time right. I need to look at this class. They follow each other. You can set empathy and confidence, to how much they deviate.

listening to theory

Live blogging the sc symposium

panel: listening to theory

Soumd in film makes film Real and amchors it to the real world. People infer sources of sound with visual cues.

causation – synchresis is synchronization and synthesis. Does sound exist in a vacuum? This a philosophical question. A realed question is where does sound come from?

is an echo one sound or two? Depending on what you think, your perception changes.

what about form and matter? Is it just a medium, or is it the very stuff of sound?

now we are watching a film of car traffic which looks like it might have been filmed in germany. It’s got sounds of cars and wind and birds.

but all the sounds were made in supercollider!

so what was before intentions or agency is now about algorithms and effects.

now renate wieser will speak. She did an installation called the phaedrus machine. This is related to a socratic dialog, which she is describing. Good people are reincarnated as philosphers, bad people as george bush. (These are my words, not hers.)

to practice good life and avoid a bad reincarnation, she has a video game you can play to practice looking for truth. There are sound cues if you reach truth or if you fall from it. The game is audio only and uses a verticle speaker arrangement. You do get feedback in the form of a spreadsheet at the end which described your reincarnation level.

she has another installation called ‘survival of the cutest.’ It’s a play with voices coming out of different speakers. Sc sends them to whatever channel, semi-randomly.

the excell thing with the speadsheet works because sc writes to a tab dilineated file and excell look at it from time to time.

tom hall will speak now. He’s talking about 20th century stuff. Legacy of musical modernism. What is a muscal object? Instruments vs sounds.

20th century had more math stuff in music than any time since the renaissance. Schoenburg came up with twelve tone almost a hundred yaers ago. Stravinsky took it up after schoennburg died.

stravinsky said when he composed with intervals, he was aware of them as objects. Babbitt took up the 12tone. He was up in the maximum diversity of permutations.

set class theory is an american thing. There’s some set class stuff in supercollider, though.

a set can be represented by an array. Tones are integers in equal temperament, much like midi.

he has a pitchcircle class to visualize sets.

powersets are all permutations of elements. A n size powerset will have size 2**n.

tom johnson wrote a piece called ‘chord catalog’ which sounds cool. Http://www.editions75 . . .

break for 5

machine learning panel

Panel discussion on neural stuff. Jan is speaking about self organizing maps, which is a talk he gave at brum last term.
He’s making snapshots for presets. It can be used to find similar presets. That are like ones he likes.
It creates a meta controller, which is more high level.
He can use it to make sound objects.
And it’s bewtween top down and bottom up appraoches.
He’s got a graph on how he uses it. He plays with it to make snapshots. The snapshots are fed into the som which generates simlar material, which he can use for a meta controller. He can make a map of material amd then make a path to traverse the map andthen control where he is on the the path with a slider.
Soms can be used to control anything, including each other.
A snapshot can be an array or an event. His examples use ron’s preset library.
it is an unsupervised neural network.
SynthDescLib lets you make a gui with preset. Or maybe this is jan’s code lib. There is a button to generate a som from presets in the gui. And a matrix comes up. Some of them are green, which are the ones he picked. The others are related. As you click on them it saves your path. There is a slider at the top that moves through the path. You can save your state.

now dan stowell is recapping and he has made soms as a ugen. He is showing the thng he did at the london sc meetup. It runs on the server and gets trained in advance by ana;yzing samples.

it imposes the eq of one sample onto another sample. Which works and is impressive. The som has a visua;izer. Pretty. It is not for download. I find his gui is set up kind of in reverse of how i’d think about it.

too much coffee for me. Pee break now. Ok back.

david has a flickr feed live from here

now nick collins is showing his work on the topic. He’s got an som implementation too. With a helpfile. He analyzes midi files. He breaks them up into little bits. He will release his files shortly.

now he’s talking of reenforcement learning, which is a way of considering an agent acting in the world. (See david’s photo of the slide) a state leads to an action, which in turn effects tje world which changes the state. Reenforcement learning looks at how effective actions are. So the program must have an idea of the world. This must also have a way of grading the reward of how good the world is. So you need to decide if something sounds good.

he has sc code to deal with this. LGDsarsa is on his website.

because machine learning is computationally expensive, it’s often farmed out to an external batch process. Or you can run in non rt mode. Dan has a nice ugen for this called Logger. Thete’s code examples on the mailing list. It creates data files which can then be used for machine learning.

he’s got a self similarities table for a pixies track.

ok, on to the more panelly part

how do you get the reward state in sarsa? Physiological monitoring is one way. Or you can ask the audience which has a delay, but propgate backwards. Or you can do it in a model.

jan is doing a project with thom which is similar but will generate full pieces. Nick reccomends tom mitchells book on machine learning.

why is a reward better than a rule? Why is it more interesting to train a net vs creating rules? Answer is that they can be used for different applications. Ron notes that rules are implicitly present in selection of training material and assumptions. Nick is talking about flexibility and creative machines. Dan says that ron is correct, but the number of possibi;itites in even a small data set is huge. Ron says constraints are cool. Te panel says that supercollider is cool

james, or leader, is talking about intent. What if we inverted rewards to make the audience unhappy? Nick points out it’s still hard to gauge cultural preferences.

there’s a question about specificity vs building an overly large tool. Jan agrees this is a trap. Nick says that specificity is more musically effective. He talks about hard coding. There’s too much variation sometimes.

performances with live evoluion. Using a human as a fitness fnction is slow. Nck talks abut a cmputer as an impovisor. His phd does this, whch you can download. He’s switced to midi becaus featurewards extraction is hard. Jan s talking abut having few slders.

neural network and machine learning

Live blogging the sc symposium
i showed up late for the talk on neural networks, which sucks, but i needed my coffee.
Tje speaker is demonstarting using a neural network to process gestural input from a wiimote. It makes 64 vectors describing the motion of the wimotes. He can train the neural net by making the same gesture over and over.
The auditorium speakers are making a high pitched squeal.
Now he’s talking about continious time recurrent nueral networks. These are used in robotics. They evolve instead of being trained. (Trained ones are called feet forward)
Ollie Brown did some code for this. He sugges that the smoothibg function be rep;aces with hyperbolic tans and become excitation functions and it does not reach equilibrium. You can use this for interactive evolution.
Squeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeazle.
The 60 hz hum has just caused a problem with the demo. The power to the av thing has cut out. Ron is cursing. People in the audience are whistling difference tones to go with the squeal. Somebody od making multiphonics. Now somebody is playing a sine tone on their laptop. Somebody is sampling and granularizing the feedback. The talk has paused. I wish i’d been here for the start because it’s awesome, but without coffee, i’d still have missed it.
A grad student has just come sprinting in with a cable. And the squal has ceased! Applause!
And the source of the squeal was an uniterruptible power supply. Which is why the av input died. Ohhhhhh! We are nearly back online. I wish this disaster had been at the start so i could have seem the whole thing.
The presenter, by the way is Chris Kiefer. Who is now resuming.
He is using a ctrnn to control a synth. And it can be reinitialized and mutated. This sounds cool, but i don’t under stand how it differs from random numbers. Oh, you pick ones you like and evolve from there.
Http://bit.ly/SC-NNs
http://bit.ly/SC-CTRNNS
HTTP://www.olliebrown.com/files/papers/ . . .