Sunteți pe pagina 1din 5

Sound and musical representation:

the Acousmographe software

Yann Geslin, Adrien Lefevre


INA - Groupe de Recherches Musicales, Paris
ygeslin@ina.fr - support@adlef.com

Abstract does not help much in characterizing the sonic differences


that are perceived by the listeners (Delalande 2001).
The “Acousmographe“ software has been developed by the
Over the years, experiments have revealed certain
GRM’s research team since 1991 to allow graphical
directions in the way one can transcribe sound and music
representations of sound. This software is specifically useful
recordings: first possibility is a coded representation of the
for annotating, analyzing and representing non-traditional
elements, and the second is a spectro-morphological
music, in particular music that doesn’t lend itself to
representation of sonic parts. The coded representation uses
traditional score representation. This software has been
a set of elementary and abstracted units related to the
widely used by musicologists, teachers and web designers to
sounds; it needs a reference sheet that explains the arbitrary
build interactive representations of electroacoustic, ethnic,
chosen relations between the drawing units and the sounds;
jazz or improvised music. A fully revised version will soon
some of the bests examples have been recently developed by
be released, that facilitates access for non-expert users
Lasse Thoresen (Thoresen, 2002, Couprie, 2003) (see
while including numerous new professional features. After a
figure 2). The spectro-morphological representation
general and historical presentation, the new data structure
attempts to draw and highlight some of the dynamic and
organization will be described and discussed.
spectral figures of the sounds (Smalley, 1986, Bayle, 1993).
It usually needs an FFT spectral background representation
1 Sound transcription that improves our perception and comprehension of the
Occidental music is generally represented by a score that spectral nature of the sounds. But this spectral approach
indicates the operations necessary to perform a piece. This misses lots of perceived information such as transients.
score is also used by musicologists to analyze the musical
content in terms of structural properties. In the case of non-
written music (ethnic, electroacoustic or improvised), the
description and analysis of the musical content is not easy.
A graphical medium is essential and has to be realized: this
representation is not a score, but a sound transcription
(Roads, 1995). This is in the sense that the result is a
representation of what one hears (nobody will try to play the
music from reading its representation), and that the visual Figure 2. Lasse Thoresen coded transcription of Ake
aspect of the representation is fully dependent on the Parmerud “les Objets Obscurs“
transcriber’s choices and points of view (Besson 1991).
Another main issue addresses the questions of the time
to sound relation: does the sound transcription have to be a
dynamic or static representation of the time? A static
representation is easier to realize, but of course misses some
obviousness of sound to spectral lack of accuracy. On the
opposite, the experience of adding a time cursor to some
achieved representation (i.e. the Ligetti’s Artikulation
transcription by Rainer Webinger or the Stockhausen’s
Figure 1. Bernard Parmegiani: “Enfer“ transcription Studie II genuine score) shows that one can accept or reject
some representation because of the lack or confusing excess
Furthermore, if one wants to analyze and compare of synchronized representation.
several interpretations of a classical piece of music (i.e. a
Bach prelude by famous pianists), the score is available but

Proceedings ICMC 2004


2 The Acousmographe history This release was designed by Didier Bultiauw, Mathieu
Reynaud and Jean-Baptiste Thiebaut (under the
Since the fifties, the GRM’s composers and researchers responsibility of Hugues Vinet and Emmanuel Favreau) on
have made a frequent use of sound representation, either for a Microsoft cross-platform environment; it is still in use, in
the study and description of sounds properties (in case of the its PC or Macintosh version and it is freely distributed.
need of a music representation at concerts) or even as a However, the software is heavy to control and use, and it
necessity to deposit their music at the Musical Rights does not easily meet some demands like automatic export of
Society. The first sound representations were made Flash™ and Quicktime™ sequences or the combination of
manually or with the help of early sonograms capacities. In musical score and graphic representation (in the case of
1972, an experience of an animated and graphical mixed music). Some major uses of the software can be seen
representation was realized by François Delalande and on the GRM interactive CD-Rom “Les Musiques
Jacques Vidal (Delalande, 1972), showing the interest of a Électroacoustiques“ (Groupe de Recherches Musicales,
time-synchronized representation. At the end of the eighties, 2001) and the analysis part of the GRM’s web site:
the need of a specific tool for sound and music “Portraits Polychromes“ directed by Evelyne Gayou and
representation was obvious, and the help of the early realized by Dominique Saint-Martin (Groupe de Recherches
computer drawing software was demonstrated, in particular Musicales, 2001-2004).
by Denis Dufour on his MSX™ personal microcomputer for
the analysis and teaching of the electroacoustic repertoire.
The first realization of the so-called Acousmographe
software was made by Olivier Koechlin around 1990 on a
request of the GRM’s director and composer François
Bayle. It was a prototype for the Supercard™ Macintosh
graphical environment that could superpose an FFT
representation to vectorial white and black figures on a 5 to
10 minutes sound file. Simultaneously to the graphical
representation, the sound file could be listened on a
Dyaxis™ card and later on the first Digidesign™ Sound
Designer audio card (Koechlin and Vinet 1991). The
software was intensively used despite its instability and
limited experimental capacities; it was also the base of an
interactive CD on sound representation and animation Figure 4. the Acousmographe version 2 (1995-2003)
(Besson, Koechlin and al 1993).
3 The need of new features
In 2003, the GRM was funded by the French
Educational Ministry to provide a new version of the
software that would be used in all schools for musical and
non-musical purposes. On the other hand, the experiment of
the two former releases of the software by very different and
professional users gave rise to a huge amount of expectation
and requests for a new version. Some of those requests
seemed not compatible, as for example the need for more
power and professional features and on the other hand for
more simplicity. A fully new development of the
Acousmographe was decided and then started in December
2003 by Adrien Lefevre for the main programming aspects,
under the direction of Yann Geslin and Emmanuel Favreau.
The new release is expected for December 2004. The team
decided to split the features in two frames: one to offer a
Figure 3. The Acousmographe version 1 (SuperCard) 1991 better access to inexperienced users, and a second to better
connect the software to common professional tools like
A second realization of the Acousmographe was decided score writing, drawing and multimedia software (Vinet,
a few years later, in particular with the purpose of allowing 1999). The core of the software will still be independent and
young pupils to draw and build their own representation of integrate the most useful and basic tools needed to make or
any kind of sound and music at school. A multiplatform show a full sound representation. Then a plugin structure
capacity was requested (Window PC and Macintosh
platform), as well as libraries and multi-layer specifications.

Proceedings ICMC 2004


will allow to import/export the most necessary parts of the sound representation can be obtained without too much time
representation from the Acousmographe to other software. or knowledge.
However, some very specific aspects remain, that have For the expert uses, it was observed that a significant
to be solved in the core. The first is the use of very long part of the users employed the Acousmographe as a starting
sound files and FFT files, with multichannel capacities and point, doing long FFT and automatic or manual
very good time-synchronization. A second is the ability to segmentation, then manually exporting the information
realize and lock a sound representation to an identified under their favorite graphical and multimedia software (see
commercial audio CD; this is for author’s rights protection figure 5). Besides this, the decision was made that it was
reasons as well as the necessity of reducing the required size impossible to a very small team to offer lots of the
of the files to download. The third is to allow very protected professional tools that experts want to freely use. As far as
uses, for example by children and non-expert users, and to possible the connection will be kept between such
offer a slide-show capacity that can be used by the teacher heterogeneous software, by allowing the exportation of the
during lectures. And the last one is to offer a better FFT representation layer, timeline data, segmentation
integrated and automatic sound segmentation, so that a draft detection and/or time and position lists of any drawing
layer.

Figure 5. Acousmographe + GraphicConverter transcription of Parmegiani “L’Oeil Écoute“ by Philippe Mion (2004)

Secondly, the Acousmographe files (transcriptions,


4 Software architecture and libraries) can contain a considerable number of graphic
symbols, and the sound files (and thus spectrograms) can
implementation have durations of one or two hours. The size of the data to
be treated is thus substantial (about 100MBytes in some
4.1 Specific problems and objectives of GBytes); the traditional solution consisting of loading
development integrally the file in the memory when opening a document
is not suitable, and specific solutions must thus be
The development of the new Acousmographe (version
considered.
III) is directed around several major stakes.
Lastly, the software must run under Windows XP, but
First, the Acousmographe is fully a software of graphic
also if possible under Mac OS X. The user interface of the
symbols edition; the diversity of the symbols for the
Acousmographe must be standard, reactive, intuitive and
representation of a timbre or a sound is potentially infinite.
simple to use. In particular, powerful undo/redo commands
These symbols can be of several natures (bitmaps, vectorial
are implemented, and a history feature of user actions
2D, even 3D), under very different data formats (Illustrator,
should be achieved.
Finale, GuidoLib, etc). This requires an open architecture,
allowing the addition and importation of new types of
symbols, without having to modify the data-processing code 4.2 Solutions
of the Acousmographe; it constitutes typically an The Acousmographe is divided into two distinct
architecture with plugins. software parts: the core and the graphic user interface. The
core contains all programs of pure processing data and the

Proceedings ICMC 2004


user interface is the software part allowing the control of the used to carry out calculations in a specific way: import of
core. Moreover, the core treats the operations in the form of graphic symbols under various formats, export of the
standardized "actions"; in return it is able to send messages transcriptions (scores) in the form of films, calculations of
to the user interface for some graphic refresh stage (see spectrogram FFT or wavelets, automatic positioning of
figure 6). temporal markers, etc. Even if the plugin API of the
Aiming a maximum portability and ensuring the Acousmographe remains exactly the same for one of these
recognition of the data-processing code by the greatest two categories, their operation is however very different.
number of compilers, the core is written in a C-ANSI The plugins managing graphical symbols constitute in
language. This guarantees good performances and allows fact classes with objects, each plugin/class inheriting from
the implementation on WindowsXP, MacOSX, and possibly another plugin/class. The mechanisms of the heritage
on other Unix. between plugins are not clarified here, but this part of the
Several solutions are considered for the realization of the software is extensively tested. The crucial point is that
GUI (Graphic User Interface) of the Acousmographe, by thanks to this system of classes with objects, families of
taking into account that it must be developed in parallel on plugins (layers, shapes, markers, etc) can be defined in a
MacOSX and WindowsXP. Moreover, the plugins also natural way, each corresponding family with a basic abstract
comprise modules of GUI (preferences panel, inspector, class, a standard interface and knowing of the core that
etc). The choice of an adapted cross-platform library of GUI allows an easy communication between core and plugins
(wxWindows, PowerPlant, FoxToolkit, etc.) is thus crucial (see figure 7).
for the effectiveness and the continuation of the software
and to preserve the simplicity of development of the Internal Classes Plugins
plugins.
Abstract Base Class

P
Layer

Actions Procs Back Layer

GUI Core P
Messages Signal Viewer

FFT Viewer

P
WaveletsView

Figure 6. Main modules of the Acousmographe


Shape

4.3 Plugins and objects


Polygon
The choice of a plugins architecture in the
Acousmographe is essential, because it is the only valid
Bezier Curve
mean that ensures in the future importation capacities of
new types of graphic symbols. It acts as an open software,
Image
comprising at its base only one central control device - a
container - whose content is managed by plugins. The
My Curve
plugins behave as many software elements that come and
connect themselves to this central control device; they can
3D blobs
be developed by other teams, each one bringing a new
symbolic notations to the software.
The manufacture of the plugins must absolutely remain
Figure 7. Inheritance diagram
simple, if not, the option to open the software would
become immediately obsolete. For this purpose, and parallel
to the development of the core, we carry out an API 4.4 The Acousmographe document: a tree
(Application Programmer Interface) that is simple, clear and data structure
documented. Whether in a library or a score, we find the same type of
The plugins of the Acousmographe line up in two hierarchy that acts as a tree data structure. In the case of the
principal categories: graphicals and calculators. The first library, we wish to gather and sort symbols, in folders and
allow the representation of the graphical symbols, and their sub-folders, as in a traditional tree structure of hard disk. In
management in the form of objects of class. The seconds are the case of the score, the most complete hierarchy can be

Proceedings ICMC 2004


described in the following way: channels contain layers, References
which on their turn contain symbols; an additional level of
Bayle, F. (1993). “Musique Acousmatique
hierarchy can be added, by gathering the layers in layer-
propositions…positions” pp. 224 - sheet XXV to XXXII. INA-
groups. In the graphical representation, the channels are Buchet/Chastel. Paris. ISBN 2-7020-1584-0
separated, and the layers are superimposed (see figure 8). Besson, D. (1991). “La transcription des musiques
The core manages this tree data structure in a generic électroacoustiques: que noter, comment et quoi?” In L’analyse
manner; it is able to assume standards operations such as: musicale, N°24, pp. 37–41. Paris.
cut, copy, paste, delete, add objects, etc. on the whole of the Besson, D., Koechlin, O., Bonfils, M. (1993). “Les
tree. Each one of these operations is coded, and saved in a Musicographies” a CD-rom produced by INA-GRM
file stack, thus allowing to preserve the history of the Couprie, P. (2003). “La Musique Electroacoustique:Analyse
actions and to ensure the operation of the undo/redo morphologique et representation analytique“ Université de
Paris IV Sorbonne - Ecole doctorale concepts et langages.
commands.
Delalande, F., and Vidal, J (1972). “Pierre Schaeffer: Partition
animée” (two graphical representations) - 16mm movie.
• Channel 01 Delalande, F. (2001). “Les outils matériels d’analyse du son” In Le
• Layer group 01 Son des musiques, entre technologie et esthétique, pp.
• Background layer (spectrogramm) 224–239. INA-Buchet/Chastel. Paris. ISBN 2-283-01850-1
• Layer group 02 Groupe de Recherches Musicales (2001). “Les Musiques
• Layer graphics 01 Électroacoustiques” interactive CD-Rom (in french), directed
• Symbol polygon by François Delalande
• Symbol polygon Groupe de Recherches Musicales: “Portraits Polychromes”,
• Symbol marker website (2001-2004), directed by Evelyne Gayou, realized by
• etc. Dominique Saint-Martin. http://www.ina.fr/grm/acousmaline
• Layer graphics 02 Koechlin, O., and H. Vinet. (1991). “The Acousmographe, a
• Layer graphics 03 Macintosh software for the graphical representation of sounds”
• etc. In Proceedings of the International Computer Music
• Layer group 03 Conference, pp. 586–588. Montreal: Faculty of Music, McGill
• etc. University.
Roads, C. (1995). “Notation from sound” In The Computer Music
• Channel 02 Tutorial, pp. 730–734. MIT Press. Cambridge. ISBN 0-252-
• Layer group 01 18158-4 - ISBN 0-262-68082-3
• Background layer (signal) Smalley, D (1986) “Spectro-Morphology and Structuring
Processes” in The Language of Electroacoustic Music, pp. 17-
40. ed. Simon Emmerson (New York: Harwood).
• Channel 03 Thoresen, L. (2002) “unpublished article” available at the library
• etc. of the Norwegian Academy of Music, Oslo, Norway. See:
http://www.ina.fr/grm/outils_dev/theorique/seminaire/semi_20
02/semi6/
Figure 8. A score hierarchy Vinet, H. (1999). “Concepts d’interfaces graphiques pour la
production musicale et sonore” In Interfaces homme-machine
et création musicale, sous la direction de Hugues Vinet et
François Delalande, pp. 97–121. Hermes. Paris. ISBN 2-7462-
5 Conclusion 0080-5
The new Acousmographe will be delivered at the end of
2004 and widely distributed to schools. A team will be
established to follow the use of the software, then collect
and show some of the best realizations. On the other hand,
the main part of the software should be offered as an Open
Source code to the computer music community. The GRM
hopes that further developments and contributions in the
professional domain will be possible, as well as exchanges
capacities with some of the best commercial software.

Proceedings ICMC 2004

S-ar putea să vă placă și