The Geometry of Music

The Geometry of Musical Chords
Eric Thul
Dr. Godfried Toussaint, Professor
Final Project for Pattern Recognition COMP644
December 4, 2006
1 Introduction
Hi! Welcome (that's me on the right).
Today we are going to talk about a little
music, a little math, and a little
geometry. But first, consider the
following.
Imagine if you will: You are a world
renowned music composer. You've
written hundreds of scores, and the
newswire has named you the hottest
composer since Johann Sebastian Bach.
How does that feel? Must feel quite
good. Quite good indeed...
So, now that I have a candid interview with the best composer since Bach, there is one question
weighing on everyone's mind. And that is, "How do you do it?" Computer Scientists, Mathematicians,
Psychologists, Musicologists, and even my next door neighbour want to know. How is it that you create
your compositions?
Perhaps this isn't a fair question. In fact, you probably cannot give a scientific answer. But that's okay.
Let's explore this question together. Let's try to find out what might be going in that mind of yours
when you sit down to compose a magnificent piece of music. However, I suppose that I should mention
there is one catch. And that is, we are going to have fun! Yes, indeed taking a journey to your musical
mind will undoubtably be hard work. But it will pay off. Maybe we will even get to compose a little
music together.
Are you in?
• Yes, I am totally in!
• No... and by no I mean yes
Good, I am glad we are together on this. Before we begin, let us consider one caveat: Scientists
currently (December 4, 2006) do not know how people compose music. This is a problem still waiting
to be solved. However, there are a few strong theories about what might be going on inside a composer's
head. How he or she may visualize the relationship within the music and see where they want to go
within a musical piece.
We will be exploring one such theory. This theory was proposed in a article called, "The Geometry of
Musical Chords" [1] by Dmitri Tymoczko of the Department of Music at Princeton University.
2 Music
Let us begin our jounery with a little music music theory that is. Now, I know that you are the world's
top composer; however, we should start from the very beginning. At the beginning there are two main
concepts in music theory which are important to us.
1. Harmony: Can be thought of in the simple case as two people each singing a note, where the
notes may be the same or different [11, p.55].
Figure 2.1: Example harmony with two people singing at the same time.
2. Counterpoint: Can be thought of in the simple case as a duet between two people singing,
literally meaning note against note [11, p.A14].
Figure 2.2: Example counterpoint between two people singing.
These definitions of harmony and counterpoint may not be very detailed, but consider the subsections
below which elaborate on these two important concepts.
2.1 Harmony
The definition of harmony that we've already seen captures the main idea. Sounding two notes together
pertains to harmony. However, we do not have to limit ourselves to just two notes. For example, we can
sound three, four, five, etc. Typically, Western music practices using harmony using two, three, or four
notes.
I realize that your skill in harmony must be of the highest level, but if you want to get a feel for
harmony, run this simple java applet (and make sure your speakers are set to "on"!). All you have to do
is click on the notes you want to hear together. You have to hold down Ctrl to select more than one note.
Once you press the button, the notes will all sound together.
If you tested out the applet, then you probably found combinations of notes that sound "good" and ones
which sounded "bad". Let us call the ones which sounded good (or more familiar) consonant, and those
which sounded bad (or less familiar) dissonant. These are musical terms which classify groups of notes.
For example, try the notes C,E,G. I bet it sounds pretty good, right? Well, that is what is known as
consonance. If you were to try C,C#/Db,F, then it might not sound as familiar to your as the previous
example. This is a dissonant (very dissonant) chord.
Another important aspect of harmony is the idea of chords. We've been discussing "groups of notes",
but really, the musical term for this is a chord. There are a few types of chords we are concerned with
[11, p.910, 55].
Chord Type Description Example
A dyad is the example of two people singing at the same time. It consists
Dyad
of just two notes.
Triad A triad can be thought of as three people singing at the same time. This
(trichord) consists of three notes.
We simple just add another voice. So in a tetrachord, there are four notes
Tetrachord
playing at the same time.
Table 2.1.1: Chord types
For each type listed above there are many properties which can be described about them. However, for
our purposes we are only concerned with the distance between each note in the chord. This means give
the chord C,E,G, we want to know how far C is from G. In order to find this out, we just have to count.
Consider the twelve notes in order as on a piano.
Figure 2.1.1: Notes on the piano
To find out the distance between C and G we can starting counting the semitones from C to G. This
gives us 7 semitones. But what happens if you want the distance from A to D. Well, once we hit B we
just wrap around to C since if you were to look at this sequence in the middle of the piano, you'll notice
it goes back to C again after B. This will give us 5 semitones. This cyclic sequence can be realized on a
circle. We can see that counting clockwise around the circle gives the distance between notes. Keep the
following figure in mind because this will become very important.
Figure 2.1.2: Pitch circle
So my composer friend, I hope that this discussion on the basics of harmony has been helpful.We now
are going to move to counterpoint which incorporates ideas from harmony.
2.2 Counterpoint
Recall from the previous section about harmony that we were concerned with a single chord and the
distance between the notes. This can be thought of as a vertical distance. In counterpoint, we are
concerned with having more than one chord. For example, having two chords played sequentially. Thus,
in counterpoint we are concerned with the horizontal distance between chords. Let's see how this
works.
Figure 2.2.1: Previous counterpoint example
In the simple example above, we see there are four chords that each have two notes. Each chord has a
high note and a low now. Just for the record, in the example we see two different clefs at the beginning.
The one on the top roughly looks like a G whereas the one on the botton roughly looks like an F (I
know it's a stretch, but that is how they get their nicknames: Gclef (treble) and Fclef (bass)). All we
need to know is that in the example, the notes are lower in the bass cleff than in the treble cleff.
Let's give names to these notes. The notes in the bass clef are called the bass voice. The notes in the
treble cleff are called the soprano voice. To measure horizontal distance between these two chords we
find the distance between each bass voice and each soprano voice. The following diagrams shows this
process for the first transition. On the left we see the transition from G to E in the soprano voice. Note
that since we are going from a higher pitch to a lower pitch, we go counterclockwise around the circle.
On the left we see the transition from C to G in the bass voice. Note that since we are going from a
lower pitch to a higher pitch, we go clockwise around the circle.
Figure 2.2.2: Horizontal distance in the Figure 2.2.3: Horizontal distance in the bass
soprano voice counting around the circle voice counting around the circle clockwise
counterclockwise from G to E is 3 semitones. from C to G is 7 semitones.
From this example, you can see that we can then calculate the distance between each note on each voice
transition.
Figure 2.2.4: Annotated counter point example where the distance between the notes are labeled in
semitones above for the soprano voice and below for the bass voice.
If you want to hear some counter point in action, check out this simple java applet which randomly
creates a melody from a given bass line.
Well, I know this must be a lot to absorb at once. But hey, you are the composer, so I hope so far so
good. If the counterpoint between dyads (chords of two notes) made sense, we can easily expand the
idea to triads or even tetra chords. This process adds more voices to the score. And also adds
complexity since more considerations come into play. For example, we now have to look at four (in the
tetrachord case) notes to harmonize and also the flow of four notes between chords. Consider the
following as an example of what voice leading with tetrachords might look like. And by the way, I am
not claiming the following is an awardwinning score. I just put it together for an example.
Figure 2.2.5: Counterpoint example with four voices.
From bottom to top we can name the voices: bass, tenor, alto, soprano [11, p.85]. Considering the four
voices, we must take care at the choices of each transition. This was also present in the example of two
voices, but with four voices, the idea stands out. This presents an important topic which is the basis for
our discussion here today: voice leading. We can define voice leading to be the interaction of the notes
in each voice and between each voice [11, p.85]. In other words, when we go from chord to chord, the
notes in the bass voice must be carefully chosen and the same goes for the other voices. But not only are
the transitions considered, but also the harmonies that each chords forms are considered. In general for
Western music, the bass voice along with the tenor and alto represent the underlying harmony, whereas
the soprano voice has more melodic freedom [11, p.85].
This idea of voice leading is what drives the curiosity of how a composer's mind works. Since a
composor must factor in all of these variables, a new idea presented by Dmitri Tymoczko [1]
incorporates mathematics and geometry to visualize what might be going on. However, to describe
what is going on, a bit of mathematics must first be introduced. But don't be afriad, I mean, you are a
great composor, so don't you have a flair for math? [10] Okay Okay, I won't make any assumptions.
Let's ease into the subject.
3 Mathematics
Let's start the discussion very basic. But I want you to keep in mind the previous section about music,
since that is how we are going to learn the necessary math.
Through this discussion, the basics will be introduced, then theory about pitch classes, and finally voice
leading will be discussed. These sections are very important because they form a basis for the
geometric construction that is to be presented. So, please check them out because they will greatly help
you in your future reading.
3.1 Basics
Recall from the previous section our circle of notes (as seen in the last section). Here we have twelve
notes. We made the relationship to keys on a piano by saying that if you travel clockwise around the
circle (to the right on a piano) then you will eventually hit your starting point again. As you play each
note the tone (frequency) that you hear gets higher. Once you hit the starting note the tone will be twice
as high. Thus the frequency will be two times the first note played.
Let's take an example of this. Pick note C. If you start at C on our circle of notes and go all the way
clockwise around, we hit C again. This is the same note, but just an octave higher. The octave is a
musical term to describe this relationship. We first played C then played C again but an octave higher.
Consider the following picture.
Figure 3.1.1: Frequency of a note at octave 1 Figure 3.1.2: Frequency of a note at octave 2
We can see that there are twice as many periods on the right than there are on the left. The result of this
is a note which sounds an octave higher. The reason we are reviewing this is because all the notes on a
piano have some frequency. And we can represent these in a more simple manner by chaning the scale.
Consider the following equation [1, p.72], where f is the frequency some note on the piano and p is the
pitch number. Just to mention, this is the standard used by MIDI to represent pitch values.
p = 69 + 12log2(f/440)
Using the change of base rule [15], we can convert this into a function that you can plug into any
modern calculator. Just in case you wanted to test this out on your own.
p = 69 + 12ln(f/440)/ln(2)
In the above equations, we are scaling the pitch frequencies to a new logarithmic range. The idea for the
equation is based on the frequency of 440, which is the note A on the piano above middle C, where
middle C is in the fourth octave. As a matter of fact, the frequency 440 is ISO 16 [16] at the
International Organization for Standardization If you're interested, you can hear the note being played.
The following table shows in the fourth octave (starting at C4, the values for all of the pitch frequencies
in this new representation. Note that rounding took place to get an integer value because the frequencies
are not exact. They are from here.
Pitch Frequency (Hz) Integer Value p
C 261.8 60
C#/Db 277.246 61
D 293.74 62
D#/Eb 311.28 63
E 329.868 64
F 349.241 65
F#/Gb 370.238 66
G 392.176 67
G#/Ab 415.477 68
A 440.348 69
A#/Bb 466.475 70
B 494.017 71
From the table, we see that we get values in the range [60,71] at increments of 1 per semitone. This is
very convenient because it yields sets of 12 values per octave. Note that we start at octave zero, note C
which has integer value 0. So, we only need this set, {0,1,2,3,4,5,6,7,8,9,10,11} to specify our pitches for
octave zero.
With this idea in mind, we can go one step further and represent all the pitches of all the octaves using
the set of twelve integers above. We can do this by taking any number in the set and adding 12 * octave.
It is important to mention that this logarithmic scale actually goes lower than the notes on the piano,
since we can have octave = 0. So, if we wanted to get C in the fourth octave (on the piano) we have to
multiply by octave = 5. The idea is just that the notes on the piano are a subset of this huge integer
range.
3.2 Pitch Classes
Now that we have established that we can generate any note using {0,1,2,3,4,5,6,7,8,9,10,11} plus 12 *
octave, we can say something a little more formal.
Consider again the the set of pitches, Pitches = {0,1,2,3,4,5,6,7,8,9,10,11}. How can we define a set to
contain all of the pitches of any octave. For example, we want a set containing 0,12,24,36,etc... which
represents the pitch C in any octave. Well, we can use the following set definition where we can pick p
to be any pitch.
PitchClass = {p + 12k such that k is an integer >= 0}
The above is a pitch class. This makes it convenient generally refer to pitches and even caculate the
distance between pitches. If you're wondering how this distance is calculated then consider the equation
below [2, p.1].
Pitch Distance = |qp|, where p,q are pitches
The above equation calculated the integer absolute value of the difference between the two pitches p
and q. For example, take pitch p=9, note D and take pitch q=2, note A. We can apply the equation: |29|
= |7| = 7. This is the distance between the two pitches. Now, consider the following equation to
calculate the distance between two given pitch classes [2, p.1].
Pitch Class Distance = ||ab||12Z , where a,b are pitch classes
The notation here might seem a bit strange, but let me explain. First we have to recall that our pitches
can be seen on a circle, such that going clockwise around the circle once will land on the same note, but
at a higher octave. And going around counterclockwise once goes back down an octave. Let's just
remind ourselves with a picture.
Figure 3.2.1: The mapping between the integer notes and our pitches.
Looking at this picture, we want to find the smaller distance either going clockwise or counter
clockwise to get from one pitch class to the other. For example, if we look at the pitch class a=2 and
b=10, we see that the distance to travel around the circle is 3 going counter clockwise and 9 going
clockwise. So, 3 would be our pitch class distance.
3.3 Voice Leading
Now that we are comfortable with calculating the distance between one pitch and one pitch class, we
can in fact calculate the distance between multiple pitches or multiple pitch classes. When we consider
multiple pitches/pitch classes, we are essentially considering chords.
Recall the idea of a chord, or just a set of pitches. For example, P={C,E,G} is a set of notes. Also,
consider another set of pitches, say, Q={D,E,A}. We can define a voice leading to be the following. Fix
some ordering of σp=(E,G,C) and fix some ordering of σq=(D,E,A). A voice leading is a transition
between these two orderings. So we have (E,G,C) → (D,E,A). Notice that we can have many different
voice leadings between these chords. But for now, we just picked one.
In order to determine the size of this voice leading, we can use our distance equations listed above.
Now, notice that I just gave a set of pitches without any indication of octave. This is important. What I
should have said, to be precise is that I am referring to pitch classes, which we talked about before. So,
the actual octave doesn't play a role. We can safely ignore the octave since we are concerned with
efficient voice leadings, meaning minimal distance from chord to chord. This implies that we pick the
best octave to achieve this goal of minimality [3, p.7] I mean, would you pick octave 8 to go to a note
in octave 2? ..I didn't think so. That would just unnecessarily give huge distance.
Okay. So now that we have that out of the way, let's again consider our voice leading of pitch classes.
(E,G,C) → (D,E,A). We know that since these are pitch classes we have to use the second equation for
distance. So, let's apply the equation and generate the following table to summarize results.
Chord (E,G,C) Chord (D,E,A) Pitch Class Distance

E D 2
G E 3
C A 3
We can also view this on the circle. Consider the following representation which shows the same chords
and distances as in the table above. Note that each pitch is color coded by voice. So a red circle on the
left is the same voice as the red circle on the right.
Figure 3.3.2: The chord DEA
Figure 3.3.1: The chord EGC
As a result from either interpretation, we have what is called a displacement multiset [2, p.2]. The
displacement multiset is just a set with duplicate entries allowed which contains the distances from the
voice leading. So, from the above we have the set {2,3,3}. When we are comparing different possible
voice leadings between sets of pitch classes, we use the displacement multiset. This gives us the
intervals between the pitch classes, independent from transposing (rotating all the pitches in the chord
by the same amount in the same direction) or inverting (flipping the circle about the centered y axis,
but leaving the labels in place) a voice in the chords. So, by using the displacement multiset, we have a
method for comparing different possible voice leadings to find the best one.
In order to find the most efficient voice leading, we must take into consider three main ideas. The first
pertains to voice crossing (to be explained). The second pertains to whether each pitch class set has the
same cardinality or not. And the third is with regard to element uniqueness in the pitch class sets.
Let us quickly take care of the pitch class set uniqueness. Let us defining each pitch class set to be a
multiset. So, we are allowed to have duplicate pitch classes in our multisets [2, p.2]. Now consider the
following two subsections for our other considerations.
3.3.1 Consideration A: Voice Crossings
We can think of voice crossing simply as when two voices cross. So, recall that we have four standard
voices: bass, tenor, alto, and soprano. Typically the bass sings the low range, the tenor sings in the low
mid range, the alto sings in the midhigh range, and the soprano sings in the high range.
When there is no voice crossing, the voices never stray from this model. So, the bass will never sing
higher than the soprano, and the soprano will never sing lower than the bass. This might be a rediculous
example, but it gets the point across. More often voice crossing may occur between the tenor and alto.
But with no voice crossing, the tenor will never sing above the alto and the alto will never sing below
tenor. Consider the following example of two chords: one without voice crossing (left) and one with
voice crossing (right). The colored notes show each voice, blue=bass, green=tenor, orange=alto,
red=soprano.
Figure 3.3.1.1: Voice leading with no Figure 3.3.1.2: Voice leading with the orange and
crossings green voices crossing
Now that we know what voice crossing is, when finding the most efficient voice leading all voice
leadings with can be ignored. Voice leadings without voice crossings always produce a smaller voice
leading size [2, p.4]. Before the theorem and proof idea is shown, we must first review an important
constraint when comparing voice leadings.
The displacement multiset we've been working with satisfies the distribution constraint [1, p.1]. So, we
know that in the displacement multiset, there are properties which it assumes. Let's take the example
we had before of {2,3,3}. Consider the following properties [17].
1. Reflexivity, by picking any element from the set, the element is less than or equal to itself.
2. Transitivity, by picking any three elements x,y,z from the set, if x ≤ y and y ≤ z then x ≤ z.
3. Totality, by picking any two elements x and y, we see that x ≤ y or y ≤ x.
With these properties, the distribution contraint has two requirements [2, p.2].
1. That given the sum, k of all the elements in a displacement multiset D, then the following holds:
{k,0,0,...,0n} ≥ {d1,...,dn} ≥ {d1/n,...,dn/n}. This means that a more even multiset with the sum
k is potentially smaller than an uneven multiset with sum k
2. If any element in the multiset is increased, then the total number of elements in the multiset is
not decreased.
If a displacement multiset satisfies both requirements then it strictly satisfies the distribution contraint.
Here is the theorem and proof sketch for this claim [2, p.4].
THEOREM 1 (Tymoczko 2006). Let P and Q be any two nnote multisets of pitches, and
let our method of comparin voice leadings be a total preorder satisfying the distribution
constraint. Then there will exist a minimal bijective voice leading from P to Q that is
crossing free. If the total preorder strictly satisfies the distribution constraint, then every
minimal bijective voice leading from P to Q will be crossing free.
PROOF IDEA. There are two directions which are shown. The first direction is that if the
method chosen to compare voice leadings obeys the distribution contraint, then there is a
bijective voice leading between between any two chords that is crossing free. The second
direction is that if the chosen method for comparing voice leadings violates the distribution
contraint then some crossed voice leading will be smaller than the uncrossed version of the
voice leading [2, p.45]. The full proof is a bit long, so it wasn't included here. However, the
main idea exploits the properties of the distribution contraint. This shows that given a voice
leading from one chord to another, swaping the voices in one chord does not make the voice
leading size any smaller.
This is an important result since it rids us of having to deal with voice crossing possibilities in the
search for an efficient voice leading. Given this, we can now discuss the possibility of having multisets
of pitches with different cardinalities.
3.3.2 Consideration B: Bijective and NonBijective Pitch Class
Sets
To clarify notation, a bijection between pitch class sets means that we have two multisets with an equal
number of elements. For example there is a bijection between, {C,E,G} and {D,F,A}. We can create a
onetoone mapping between the elements. A nonbijection would be if the multsets did not have the
same number of elements. So the multiset with the smaller number of elements would map to two or
more elements of the larger multiset. Consider {C,G} and D,F,A, there is not a onetoone mapping
between them; someone has to share!
3.3.2.1 Function on Bijective Chords
Now, before we can draw any conclusions from this, let us explore what we know about the simple
case: a bijection of a pitch class set with itself. For example, given the pitch class set {C,E,G}, what are
the possible voice leadings to {C,E,G}.
Let us define two functions: transposition and inversion. Consider each one separately below.
• Transposition is denoted Tx where x is the number of semitones to transpose. For example,
when we transpose an ordered set of pitch classes (a chord), we add x to each element of the
chord. Consider the chord A=(C,E,G). We can transpose A by say, four semitones. We have the
following.
T4(A)=(T4(C),T4(E),T4(G)) =(0+4,4+4,7+4)=(4,8,11)=(E,G#,B)
We can say that these chords are related via transposition. And transposition is exactly the same
as a rotation of the notes around the circle. We can see this below.
Figure 3.3.2.1.2: After rotation by four units
Figure 3.3.2.1.1: Chord (C,E,G) clockwise, (E,G#,B)
• Inversion is denoted Ix where the function returns the values which send the input to zero; it
returns the inverse. Since we are dealing with a cyclic group remember that any negative
number is moving counterclockwise on the circle. For example, 1=11, 2=10, etc.... So, for the
inverse, let us consider the example of A=(C,E,G)
Ix(A)=(Ix(C),Ix(E),Ix(G)) =(0,4,7)=(0,8,5)=(C,G#,F)
We can say that these chords are inverses of each other. This also presents an interesting
geometric perspective where we have the mirror image. Consider the images below.
Figure 3.3.2.1.3: Chord (C,E,G) Figure 3.3.2.1.4: After inversion, (C,G#,F)
There is an observation which can be drawn given a chord A and some function F over that chord. If
there is a permutation of F(A) which yields a chord similar to A, then there is an efficient voice leading
between those chords [2, p.8].
Consider the following example where A=(C,E,G). If we take the inverse, we get A'=(C,G#,F). Now we
can permute the elements to arrive at σ(A')=(C,F,G#). This yields a displacement multiset of {0,1,1}.
Where in [2, p.811] there is not a clear definition of resemblence, this voice leading appears quite
efficient. So (C,E,G) → (C,F,G#) is perhaps an efficient voice leading.
We can draw two conclusions from this. The first is that if the chord has pitch classes which are all very
close to one another, like (C,C#,B) then there will only be an efficient voice leading when the applied
function returns a chord which is close to trivial. The second conclusion is that if the pitch classes
divide the octave evenly, which means we try to maximize the minimum distance between each pitch
class. For example, there are 4 semitones between each pitch class for {C,E,G#}. Consider the
following image.
Figure 3.3.2.1.5: Maximally Even Chord (C,E,G#)
From this, we can arrive at an efficient voice leading by transposition where we transpose by the size of
the chord. In this case, by 4. If we do this, then we will arrive at (E,G#,C), which is the same by
permutation.
3.3.2.2 Minimal Bijective Voice Leading
From the case discussed above, we can prove that the lower bound on the a voice leading between
transposition is from the chords which evenly divide the octave. For example, the {C,E,G#} multiset
described previously. This follows from the statement, "Thus the perfectly even chord... has the smallest
possible minimal bijective voice leading to all of its transpositions" [2,p.11]. As the chords become less
even then the voice leading among the transpositions tends to increase. Consider the following theorem
which presents this idea more formally.
THEOREM 2 (Tymoczko 2006). Let A be any multiset of cardinality n, and let our
method of comparing voice leadings be a total preorder satidfying the distribution contraint.
For all x, the minimal bijective voice leading between A and Tx(A) can be no smaller than
the minimal bijective voice leading between E and Tx(E), where E divides pitchclass space
into n equal parts.
PROOF IDEA. The main idea behind this proof uses the fact from the previous section
that with a maximally even chord that divdes the octave, we will have a voice leading size
of zero. Then by taking advantage of the distribution constraint, we can find that the sum of
the displacement multiset will be at most a constant which can be picked to be as small as
possible for the even chord. Then, considering a chord not maximally even, the lower bound
on the smallest sum of the displacement multiset works out to be at least that constant
picked.
From the theorem, we see that the chords which evenly divide the octave present the most effiecient
voice leadings among the transpositionally related chords.
4 Geometry
Ah, glad to see that you're still with me. You might be wondering: isn't the title of this project "The
Geometry of Musical Chords"? You are correct. However, we weren't ready to jump into the
geometry... until now. Here we consider the geometric aspects of what we've been talking about all
along, and hope to shed some light on how we can visualize what might be going on in that composer
mind of yours.
In a sense, we will use geometric spaces to navigate paths through music! And this is what Tymoczko
[1] describes to be how composers may write music. That they see this path through a musical
geometric space and navigate it. Along the way they discover interesting harmonies and choose paths
which yield efficient voice leading. So this is where all of what we've been talking about fits together.
From the very basics of music theory we mentioned, to the idea of what efficient voice leading is and
how to measure it, the geometry is what ties everything together. In this musicgeometric space, we
have chords which sit in this space and connect to one another depending on their boice leading. Also,
chords that evenly divide the octave will be in the center of this space.
Now, this might sound complex, but we are going to start simple and use a lot of pictures! So, let us
begin with the one dimensional chord geometry. This can be seen as a straight line, or if you prefer, we
can use a circle. Consider the following discussion on the one dimentional chord geometry.
4.1 One Dimension
You may have noticed that even if it wasn't explicitly state, we've been looking at a few geometric
aspects of music. Recall the figure of our pitchclass circle. This is what is known to be a pitchclass
space [1, p.72]. However, when we are just dealing with pitches, then we can view them as points on a
line that extends in both directions [9]. These are the one dimensional cases of our musical space.
Just to refresh our memories, let us take a look at the pitch space and pitchclass space in the figures
below.
Figure 4.1.1: Example of the pitch space on a line, the number in the point is the octave.
Figure 4.1.2: Example of the pitchclass space on a circle.
Now, you might be wondering how this space is navigated. Well, the basic form of motion is by step
around the circle, or on the line. And just for ease of discussion, let's consider only the pitchclass
space. This is because the circle is easier to visually represent motion through the space. This motion is
linear, so it is just going from one point around the circle to the other. If we are just talking about one
voice, then that voice can move around from pitchclass to pitchclass generating a melodic line.
Consider the following animated figure which shows the first few measures in the first verse of Bob
Dylan's Blowin' In the Wind.
Figure 4.1.3: The opening verse of Bob Dylan's Blowin' In the Wind, note that this is from
Sheetmusicplus.com
We can also view more than one voice moving in this space. For example, Dmitri Tymoczko created a
movie which shows a chord progression of Chopin on the circle. Consider the following movie below.
Note that I've included it locally for speed, but the orginal files can be found here

Figure 4.1.4: This shows Chopin's music on the circle. This was created by Dmitri Tymoczko.
Just to explore this further, consider the following harmonic plan of Beethovan's Violin Sonata Op. 24
("Spring") second movement, measures 3754. The following image was generated from an example by
Julian Hook [7, p.49].
Figure 4.1.5: The harmonic plan of Beethovan's Violin Sonata, Op.24. This was made from an example
by Julian Hook [7, p49].
From all of these images and movies, we can conclude that the overal movement is fairly simple. There
are no huge leaps around the circle whether we are looking at a single melodic line, or ever chords that
are rotating around the circle. This is an important observation because since many people consider
these musical examples to be "good", perhaps we can say that good music is related to the small size of
the movement. And this is important because it is exactly what Dmitri Tymoczko is trying to find out
[1]. He is curious about efficient (small) voice leading between chords.
Even though the one dimensional case is fascinating on its own, we can also look at a two dimensional
space to gain a little more insight to the voice leading between chords. Consider the discussion below.
4.2 Two Dimensions
Instead of a circle, we need some geometric structure to represent two dimensions. Now, I know what
you might be thinking, can we use an xy coordinate plane? The simple answer is we cannot. And this
is because our geometric space needs to have have a special property which we saw in the case of the
circular pitchclass space.
This property can be thought of in the following way. Imagine yourself standing on a piece of paper (or
wood if you're afraid to fall through the paper). Now, consider you have only one item, which is a
compass. But the compass only has eight directions on it: N,S,E,W,NE,NW,SE,SW. Now, let's consider
that you are in the very center of this space. What do you see?
Well, you see a music chord. Since this space is filled with music chords of size two. Now, since you
are in the middle, you will be standing on the chord which evenly divides this space. Since we have
twelve pitch classes, what two classes divide twelve evenly? The answer can be any two pitch classes,
just as long as they are 6 semitones apart, like C and F#/Gb. However, let's consider C and F#/Gb the
center because C is typically the starting point. Any other even chord is a transposition of (C,F#/Gb)
.
Recalling this notion of transposition, let's say that when you walk East, the chord you are standing on
is tranposed up by one semitone and when you walk West, the chord is transposed down by one
semitone. So, if you are standing on (C,F#/Gb) and walk East on unit, you will be standing on
(C#/Db,G). If you were to walk West by one unit, then you would be standing on (B,F)Let's invision
what we have so far.
Note in the figure that we are just considering a set of pitch classes, so the order doesn't matter. For
example, (C,F#/Gb)=(F#/Gb,C) Also, the figure uses the numeric notation of the notes, as we've seen
before. The images that follow are from Dmitri Tymoczko's program used to explore these geometric
spaces [4].
Figure 4.2.1: A slice of the full space to show what happens walking East or West.
Now, in the picture, you might notice something odd. The use of the [9,3]. This means that when you
step on this spot you are carried to the other 9 3 on far left of the picture. So, in a sense you can think of
this as circular behavior. You repeat to the other side once you hit a chord in square brackets.
And as you can see since order doesnt matter we have six unique ways to evenly split the octave with
two notes with (6,0) in the center.
Okay, so we have this space where we more or less have a circular representation. However, we take this
one step further when considering the other compass direction we can travel. Let's dive right into the
full space visualization and then go for the explanation. Consider the following space of chords size
two.
Figure 4.2.2: The full pitchclass space of Dmitri Tymoczko's construction [4].
We see the familiar bracketed hotspot which carries you to the corresponding chord. Also, notice that
walking up to the top or down to the bottom you are cut off and cannot go any further. Since this is a
discrete space, we have our limitations. But going left, right, or along the center diagonal brings you
around to the corresponding point (those chords have a bracketed counterpart).
You might be wondering what kind of geometric space has this strange property. Well, the space can
actually be mapped to a Mobius strip. This is a surface which only has one side. So, this means if you
keep following the surface, like walking East or West in our diagram, you will end up back where you
started. Maybe a picture will help. Consider the following Mobius strip.
Figure 4.2.3: Example of a mobius strip by Escher [18].
As you can see, we have these ants following around the path which keep walking forever. And they
cannot walk to the edge since they would fall off. But let us try to map Figure 4.2.2 to this mobius strip
to see if we can see what is going on.
Figure 4.2.4: Image of mapping some of the pitch class chords to the mobius strip [19].
Hopefully the above picture isn't too confusing. The big red band that is vertically on the back of the
image is where the chords are glued to their counter parts in brackets. However, to do this we needed to
give the picture in Figure 4.2.2 a halftwist so that they line up properly. This gives us our two
dimensional pitchclass space for chords. We can view the navigation of this space similarly as in the
one dimensional case, except now we can also look at aspects of voice leading.
We didn't encounter this in the voice leading section above, but there are different types of motion in
voice leading. Consider the following which describes each type. Note that we have two notes in each
chord.
• Contrary can be described as having each voice travel in the opposite direction. For example
when the soprano travels to a higher note in the next chord and when the bass travels down to a
lower note in the next chord we have contrary motion.
• Oblique can be described as one voice going up or down, while the other stays on the same note.
For example, when the soprano travels to a higher note and the bass sings the same note in the
next chord, we have oblique motion.
• Parallel can be described as when the voices either both travel up or both travel down together.
This yields parallel motion.
In the figures of our two dimensional chord space, we didnt have any lines connecting the points. Well,
we can add lines which show the different types of motion between the chords. Consider the following
three images which show these types of motion in our space.
Figure 4.2.5: this shows the lines which connect chords that move in contrary motion.
Figure 4.2.6: this shows the paths which connect chords which move in oblique motion.
Figure 4.2.7: this shows the paths which connect the chords that move in parallel motion.
We can also have some fun and see how songs travel through this space. Consider the following two
movies by Dmitri Tymoczko [4]. The first is Chopin moving in the 2D space and the second is a song
by Deep Purple in this space.

Figure 4.2.8: This shows Chopin's music on the mobius strip. This was created by Dmitri Tymoczko.

Figure 4.2.9: This shows Deep Purple's Smoke on the Water on the mobius strip. This was created by
Dmitri Tymoczko.
Notice that in the examples, the chords chosen stayed around the center of the mobius strip. And I mean
center when the mobius strip is cut and flattend out. Also interesting to note is that the rock song by
Deep Purple stayed very close to the middle of the space. These are the chords that divide the octave
most evenly remember. The Chopin musical piece also stayed close to the more even chords, but
ventured further from them.
The next stop is now the three dimensional case, or in other words having chords of length three.
Consider the next section's discussion.
4.3 Three Dimensions
We've seen that the mobius strip is a sufficient space to represent the properties of chords in two
dimension. But how are we going to create the notion of three voices (chords of size three) in a
geometric framework? The answer is very similar to what we already discussed. We don't use a mobius
strip, but we still need the property of a circular space.
Consider again that we are working with the chords formed by our pitchclasses. However, they are of
size three. This means we have a lot of possible chords. Here we can consider the permutations of each
set of pitchclasses to be different. So, this gives us a 123 or 1728 possible chords. Our space is divided
by these chords.
Recall that in the two dimentional case all we needed was a mobius strip. This mobius strip gave us a
space with a line which was used to glue points together to form the Mobius strip. Recall that red strip
in the figure above. This is important to note because we are working in two dimensions so we used to
endpoints to make this line. And this line is considered the base of the space.
In the three dimension case we have three points which compose the base. This implies that a triangle is
our new base. When we slice along the base and unwrap the space, we will have a three dimensional
prism with two faces which are triangles [2, p.19]. Consider the following diagram below which shows
a simple model of the space.
Figure 4.3.1: model of the three dimensional chord space [2, p.19]
We can imagine where the chords lie in this space. And recall that the two triangular faces are really
glued together. So this space wraps around. However, since the chords on the faces have to line up when
we glue, we must twist the space before putting the faces together. Instead of doing a full halftwist as
in the Mobius strip, we twist by 120 degrees. The chords in this space have similar positioning to the
two dimensional case. Meaning that the chords which evenly divide the octave are in the middle (that
grey line) and the chords which do not evenly divide the octave are on the edges. Consider the
following example.
Figure 4.3.2: model of the three dimensional chord space [2, p.19]
As you can see in the picture, as you move up or down along that dashed grey line, you get all the
chords which evenly divide the octave by transposition. And the chords along the edges are those which
don't divide the octave at all, namely CCC. This is the same basic ideas which was presented with the
Mobius strip. We have a center of this space, and then when you reach a face, you go around the circle
(since the faces are really glued together).
Let's look at the whole three dimensional space with all of the chords. This might look a bit confusing,
but it shows how many choices are possible when consider music composition. Note that these images
were generated from a program that Dmitri Tymoczko wrote to explore the space [4]. Note that in the
figure I've highlighted one of the triangular faces in red and also drew the dashed center line.
Figure 4.3.3: model of the three dimensional chord space [4]
The above figure might be a little rotated, but as you can see it is the same as figure 4.3.1 but with all
the chords inside. We also can have edges to show two of the different types of voice leading that we
described before: contrary and parallel. We also can see the line which connect single step voice
leading. This means going up or down by a single semitone. Consider the following images depitcting
these types. Note that these images show the types of voice leading between all three voices.
Figure 4.3.4: model showing contrary voice leading between three voices [4]
Here we have higlighted two chords 3,4,9 = D#/Eb,E,A and 4,5,8 = E,F,G#/Ab. We can see that the first
voice goes up: D#/Eb → E, the second voice goes up: E → F, but the third voice goes down: A →
G#/Ab. So we have contrary motion. And since there are only two directions (ignoring oblique motion),
have two voices are going to have to travel together while the other goes the opposite way.
Figure 4.3.5: model showinng parallel voice leading between three voices [4]
Again consider the highlighted chords. We have 3,4,9 = D#/Eb,E,A and 4,5,10 = E,F,A#/Bb. Here we
have all the voices going up: D#/Eb → E, E → F, and A → A#/Gb.
Figure 4.3.6: model showing single step voice leading between three voices [4]
In the single step we have two chords connected: 3,4,9 = D#/Eb,E,A and 2,4,9 = D,E,A. Here we can
see that only one voice moves by a single semitone or single step.
From each of the example, we have all the lines possible to allow the freedom of true composition. As
you can see, it gets complicated very quickly. So my friend the composer, what do you think of this
construction? Do you navigate these geometric spaces with ease as you compose your greatest works?
We don't know the true answers yet, but this notion of chord geometry provides a theory on what might
be going on backstage, so to speak.
The following section presents some of the mathematics behind the geometric constructions. Continue
reading if you like. Or you can jump ahead to the applet which let's you explore these spaces and
compose music of your own.
4.4 Higher Dimensions
Behind all of this geometry we've been talking about is a mathematical concept which describes such
spaces as the Mobius strip and the three dimensional prism with triangles for faces. The term
mathematicians use to describe these spaces is orbifold.
To define an orbifold, think of what we've been doing when we construct these chord spaces. Think of
the two dimensional case. We saw that we have a surface containing all the two chords of two pitch
classes. And we also saw that there are points on this surface that allow you to circle back to the other
side. To construct our space, we folded, twisted, and glued those points together which allowed you to
circle back to the otherside. This created our chord space and is in fact the way an orbifold is created.
Given a symmetric surface (our chords) and then by taking all the points which look alike we fold and
twist until the identical points have been brough together (glued) [6].
The origin of the term orbifold (orbitmanifold) is due to William Thurston [5] and is a special kind of
manifold. We will not get into manifolds here, but it is worth mentioning that the orifold is a kind of
manifold.
The notion used for this is as follows.
Symbol Meaning
This refers to a torus in n dimensions. You can think of a torus as a donut, or to be more
Tn
specific, a torus is the product of two circles [20].
This refers to the symmetric group in n dimensions, where we have the set of permutations
Sn
on the objects in S
Rn The set of all real numbers, in n dimensions.
12Zn The set of all positive and negative multiples of 12 in n dimensions.
Here we have a set of sets. Each element in Rn/12Zn is a set of integers. To see this, consider
Rn/12Zn the case when n=1. Here we have R12Z = { {0,12,24,36,...}, {1,13,25,37,...}, ...,
{11,23,35,47,...} }. You can think of these as our pitch classes.
Using the notation above, we can define an ntorus Tn to be Rn/12Zn. And we can then define our
orbifold to be Tn/Sn. This works out to [Rn/12Zn]/Sn, which is really Rn/[12Zn⋊Sn]. Now, we need a
special way to take the product between the groups 12Zn and Sn. We can define the product to be the
semidirect product. Without getting too much into group theory, this means that there is a group which
can be formed by taking the semidirect product between 12Zn and Sn [2, p.7]. This group retains
certain properties [22].
The group Rn/[12Zn⋊Sn] is our orbifold. These orbifolds can be constructed by describing what is
called a fundamental domain of 12Zn⋊Sn in Rn. What this means is that a smaller pattern can be used to
describe a larger pattern [21].
This is useful since it can be used to describe the the mobius strip in the two dimensional case and the
prism with triangular faces in the three dimensional case, and also describe geometric spaces up to
dimension n. Consider of all the points in Rn, only the ones where the coordinates are sorted in
nondecreasing order. Let them be of the form (x1,...xn). Now we further restrict these points so that xn ≤
x1 + 12 and that the sum of all the xi is greater than or equal to zero and less than or equal to twelve. By
gathering all these points we will form an n dimensional prism with the base being a figure in n1
dimensions, in this case a line. Consider the prism in two dimensions below with a line for the base
(highlighted in red).
Figure 4.4.1: 2D prism from the fundamental domain construction.
From the above we see something interesting. Recall that since we are working in a group modded by
12Z any negative number x is really 12x. This is exactly like going counter clockwise on our pitch
class circle. We just start at zero on the circle and step counterclockwise x times. So, in our space we
see that the point (6,6) is really the same as (6,6). Starting to seem familiar? It should remind you of
the following figure.
Figure 4.4.2: 2D chord space
If you tilt your head a little then this is what we saw earlier in the two dimensional case of chord
geometries. So, we see that algebraically we can describe this space. And Dmitri Tymoczko gives a
function for the twisting of the surface to match up points with thier counterparts [2, p.7].
O(x1,...,xn) = (x212/n, ...,xn12/n, x1+1212/n)
By using this function on all the points in our prism, we can find all the chords related by transposition
and with the pitch classes summing to the same value. In the orbifold, this function is a rotation when
the prism has odd dimension and a rotation and reflection otherwise [2, p.7].
One last note about the orbifolds. Those chords which lie at the center of the orbifold (that divide the
octave evenly) are known as Tsymmetrical chords. This is because by transposition, they are all related.
Also we can define the chords which lie along the boundary, furthest from dividing the octave evenly.
These are called Psymmetrical since up to permutation they are the same. For example CCC is a P
symmetrical chords. Lastly, we can define those chords which are inversionally symmetrical to be I
symmetrical chords [1, p.74].
5 Future Work
Some of the other application that this geometric representation can be used for is when dealing with
rhythm. The author states that since rhythms are cyclic patterns they can also be modeled as an
orbifold. Perhaps these spaces could be used to study African and other nonWestern rhythms [1, p.74].
6 Applet
This section is divded into two parts. The first is for those who just can't wait to use the Geometry of
Chords applet. The second part is for those who want to read a little about what the applet is about. In
either case, enjoy!
6.1 Running the Applet
I made the applet load off of a separate page because it is rather big. The dimensions are 900x800.
Sorry to those who have a smaller screen resolution, but I needed the extra pixels.
The following link will load the applet, and just be sure to give it a minute to load up. You should see
12 orange circles arranged in a circle along with some controls. If you dont see the 12 circles just give it
a second. Also, make sure your speakers are set to on!
Click To Load Applet
As a note, for better performance, if you copy the link location above and pass it to the java command
appletviewer like appletviewer copiedurl then it may run a little better.
6.2 About the Applet
The main idea behind this applet is to let you navigate the chord spaces which are proposed by Dmitri
Tymoczko and which have been explain in this tutorial. The applet provides two dimensions: the first
dimension as a pitch class circle and the second dimension as a pitch class grid.
The default screen that loads is the one dimensional pitch class circle. You can choose which dimension
you want to compose in with the dropdown menu on the topright of the applet. Also, there are options
to pick your rhythm and instrument.
Another important option is the voice leading lines option. This is only available in the second
dimension since the first dimension does not have different types of voice leading. For this you have
three options: parallel, contrary, and oblique. You can chose any of them or none at all to display.
You can change the options at any time, but to add notes to My Song you have to click on the circles in
the pitch class space. When you click on a pitch, it will be added to the list on the right which
represents your song. You can also remove any of the pitches by clicking on the pitch in the list and
then clicking the remove button.
At any time you can play the song that you're working on by clicking the play button at the bottom of
the applet. The play loop will continuously play your song until you hit the stop button.
As a special remark, if you pick some notes in dimension one and then switch to compose in dimension
two, you can have a song with both pitches and chords. Neat!
That is the main idea of the applet, below are some screen shots which show the applet in action.
Figure 6.2.1: Showing the initial loading of the applet in pitch class dimension one.
Figure 6.2.2: Showing an example in the second dimension.
Figure 6.2.3: Close up view of the options.
7 References
[1] Dmitri Tymoczko. The Geometry of Musical Chords. SCIENCE 313:7274, 2006.
[2] Dmitri Tymoczko. Supporting Online Material for The Geometry of Musical Chords. SCIENCE
313:7274, 2006.
[3] Dmitri Tymoczko. Scale Theory, Serial Theory, and Voice Leading. Under Review at Music
Analysis. http://www.music.princeton.edu/~dmitri/scalesarrays.pdf, 2006.
[4] Dmitri Tymoczko. Chord Geometries [Online].
http://www.music.princeton.edu/~dmitri/ChordGeometries.html.
[5] William P. Thurston. The Geometry and Topology of ThreeManifolds [Chapter 13].
"http://www.msri.org/publications/books/gt3m/, 2002.
[6] Heidi Burgiel. An Introduction to Orbifolds [Online]. The Geometry Center.
http://www.geom.uiuc.edu/education/math5337/Orbifolds/introduction.html, 1996.
[7] Julian Hook. Exploring Musical Space. SCIENCE 313:4950, 2006.
[8] Richard Webb. Calculated Tones. NATURE 442:149, 2006.
[9] E.K. Seeing the Music. Humanities, Then and Now 39:1, 2006.
[10] John Baez. This Week's Finds in Mathematical Physics (Week 234).
http://math.ucr.edu/home/baez/week234.html, 2006.
[11] Robert Gauldin. Harmonic Practice in Tonal Music, Second Edition. W. W. Norton & Company:
New York, 2004.
[12] Criag Stuart Sapp. Pitch to Frequency Mappings.
http://peabody.sapp.org/class/350.868/lab/notehz/, Accessed: 2006 Nov.
[13] Wikipedia. Chordal Space. http://en.wikipedia.org/wiki/Chordal_space, Accessed: 2006 Nov.
[14] Wikipedia. Pitch (Music). http://en.wikipedia.org/wiki/Pitch_%28music%29, Accessed: 2006 Nov.
[15] Wikipedia. Logarithm, Change of Base. http://en.wikipedia.org/wiki/Logarithm#Change_of_base,
Accessed: 2006 Nov.
[16] Wikipedia. A440. http://en.wikipedia.org/wiki/A440, Accessed: 2006 Nov.
[17] Wikipedia. Total Preorder. http://en.wikipedia.org/wiki/Total_preorder, Accessed: 2006 Nov.
[18] Mobius Strip Image. http://www.math.utah.edu/~lars/mathart/artpix/eschermobius_strip_II.jpg,
Accessed: 2006 Nov.
[19] Mobius Strip Image. http://www.atsweb.neu.edu/math/cp/blog/mobius.gif, Accessed: 2006 Nov.
[20] Wikipedia. Torus#Topology. http://en.wikipedia.org/wiki/Torus#Topology, Accessed: 2006 Nov.
[21] Wikipedia. Fundamental Domain. http://en.wikipedia.org/wiki/Fundamental_domain, Accessed:
2006 Nov.
[22] Wikipedia. Semidirect Product. http://en.wikipedia.org/wiki/Semidirect_product, Accessed: 2006
Nov.
[23] Music Note on index page. http://www.giculturalsociety.org/, Accessed: 2006 Dec.
[24] South Park Game. http://images.southparkstudios.com/games/create/sp_game.swf, Accessed: 2006
Dec.

The Geometry of Music

Încărcat de

Informații document

Descriere originală:

Drepturi de autor

Formate disponibile

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Drepturi de autor:

Formate disponibile

The Geometry of Music

Încărcat de

Drepturi de autor:

Formate disponibile

The Geometry of Musical Chords

Chord (E,G,C) Chord (D,E,A) Pitch Class Distance

S-ar putea să vă placă și