Sunteți pe pagina 1din 12

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/230855615

Advances in Intelligent Data Analysis IX, 9th


International Symposium, IDA 2010, Tucson, AZ, USA,
May 19-21, 2010. Proceedings

Conference Paper  in  Lecture Notes in Computer Science · January 2010


DOI: 10.1007/978-3-642-13062-5

CITATIONS READS
2 27

3 authors:

Paul R. Cohen Niall M. Adams


The University of Arizona Imperial College London
383 PUBLICATIONS   5,840 CITATIONS    132 PUBLICATIONS   1,910 CITATIONS   

SEE PROFILE SEE PROFILE

Michael R Berthold
Universität Konstanz
268 PUBLICATIONS   4,300 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

SAR on small molecules View project

Streaming Learning View project

All content following this page was uploaded by Paul R. Cohen on 24 February 2014.

The user has requested enhancement of the downloaded file.


Lecture Notes in Computer Science 6065
Commenced Publication in 1973
Founding and Former Series Editors:
Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen

Editorial Board
David Hutchison
Lancaster University, UK
Takeo Kanade
Carnegie Mellon University, Pittsburgh, PA, USA
Josef Kittler
University of Surrey, Guildford, UK
Jon M. Kleinberg
Cornell University, Ithaca, NY, USA
Alfred Kobsa
University of California, Irvine, CA, USA
Friedemann Mattern
ETH Zurich, Switzerland
John C. Mitchell
Stanford University, CA, USA
Moni Naor
Weizmann Institute of Science, Rehovot, Israel
Oscar Nierstrasz
University of Bern, Switzerland
C. Pandu Rangan
Indian Institute of Technology, Madras, India
Bernhard Steffen
TU Dortmund University, Germany
Madhu Sudan
Microsoft Research, Cambridge, MA, USA
Demetri Terzopoulos
University of California, Los Angeles, CA, USA
Doug Tygar
University of California, Berkeley, CA, USA
Gerhard Weikum
Max-Planck Institute of Computer Science, Saarbruecken, Germany
Paul R. Cohen Niall M. Adams
Michael R. Berthold (Eds.)

Advances
in Intelligent
Data Analysis IX

9th International Symposium, IDA 2010


Tucson, AZ, USA, May 19-21, 2010
Proceedings

13
Volume Editors

Paul R. Cohen
University of Arizona, Department of Computer Science
1040 East 4th Street, Tucson, AZ 85721, USA
E-mail: cohen@cs.arizona.edu

Niall M. Adams
Imperial College London, Department of Mathematics
South Kensington Campus, London SW7 2AZ, UK
E-mail: n.adams@imperial.ac.uk

Michael R. Berthold
University of Konstanz, Department of Computer and Information Science
Box 712, 78457 Konstanz, Germany
E-mail: michael.berthold@uni-konstanz.de

Library of Congress Control Number: 2010926371

CR Subject Classification (1998): H.3, H.4, I.2, F.1, H.2.8, J.3

LNCS Sublibrary: SL 3 – Information Systems and Application, incl. Internet/Web


and HCI

ISSN 0302-9743
ISBN-10 3-642-13061-5 Springer Berlin Heidelberg New York
ISBN-13 978-3-642-13061-8 Springer Berlin Heidelberg New York

This work is subject to copyright. All rights are reserved, whether the whole or part of the material is
concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting,
reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication
or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965,
in its current version, and permission for use must always be obtained from Springer. Violations are liable
to prosecution under the German Copyright Law.
springer.com
© Springer-Verlag Berlin Heidelberg 2010
Printed in Germany
Typesetting: Camera-ready by author, data conversion by Scientific Publishing Services, Chennai, India
Printed on acid-free paper 06/3180
Preface

The background to IDA 2010, the 9th International Symposium on Intelligent


Data Analysis (IDA), is rather unusual. Previously, the symposia were held bien-
nially at European venues. Over this time, the IDA Symposium had established
an identity, a dedicated group of Program Committee members, and a regular
audience. However, this success had come at a cost to the original ambitions for
the symposium – concerned with interfacing AI, statistics and computer science
for important and difficult real-world data analysis problems – being compro-
mised in favor of more standard data mining content. IDA 2010 was organized
explicitly to re-align the IDA Symposia series with a set of objectives evolved
from the original ambitions. This should be construed not as a criticism of rou-
tine data mining research but rather as an admission that the IDA symposium
had taken the path of least resistance with respect to the call for papers and the
reviewing process.
This is the proceedings volume of IDA 2010, a special event held only a year
after the eighth symposium in an attempt to revitalize the area of IDA. There
were two major changes compared to previous symposia. First, the Call for Pa-
pers (CfP) was completely rewritten, placing great emphasis on algorithms and
systems that support modelling and analysis of complex real-world systems. Mo-
reover, the CfP explicitly discouraged submissions that might be characterized
as “incremental advances in data mining algorithms.” Second, the reviewing me-
chanism was extended to include a “senior Programme Committee,” in response
to perceived shortcomings in the existing reviewing process. In part, this was
an experiment in reviewing which is discussed in our contribution in the present
volume.
IDA 2010 took place at the wonderful Biosphere-2 in Arizona, USA, May 19-
21, 2010. The invited speakers were Lise Getoor (University of Maryland, USA)
and David Krakauer (Santa Fe Institute). The meeting received more than 40
submissions. While this may seem a low number, it should be interpreted in the
context of both a new CfP and a novel point in the annual conference calendar.
The Program Committee selected 21 submissions for publication. This inclu-
ded five papers which were focussed on important and challenging applications,
but were perhaps preliminary – the precise type of submission we were keen to
encourage.
It is a pleasure to express our gratitude to the many people involved in the
organization of the symposium and the reviewing of submissions. Some specific
thanks are in order. These proceedings would not exist without the efforts of
Richard Van Stadt. We are indebted to Lupe Jacobo for local organization.
VI Preface

Finally, we are very grateful for the generous support of a number of sponsors:
School of Information: Science, Technology and Arts, University of Arizona;
Sante Fe Institute; University of Konstanz, Germany, and the ALADDIN project.

May 2010 Paul Cohen


Niall Adams
Michael Berthold
Conference Organization

General Chair
Paul R. Cohen University of Arizona, USA

Program Chairs
Niall M. Adams Imperial College, UK
Michael R. Berthold University of Konstanz, Germany

Publicity Chairs
Elizabeth Bradley University of Colorado, USA
Jaakko Hollmén Helsinki University of Technology, Finland

Senior Program Committee Members


Niall M. Adams Imperial College, UK
Rob St. Amant North Carolina State University, USA
Tucker Balch Georgia Institute of Technology, USA
Michael Berthold University of Konstanz, Germany
Jean-Frannçois Bolicaut Université Lyon, France
Elizabeth Bradley University of Colorado, USA
Paul Cohen University of Arizona, USA
Werner Dubitzky University of Ulster, UK
João Gama University of Porto, Portugal
Lawrence Hall University of South Florida, USA
Howard Hamilton University of Regina, Canada
Jaakko Hollmén Helsinki University of Technology, Finland
Adele Howe Colorado State University, USA
Eammon Keogh University of California, Riverside, USA
Frank Klawonn University of Applied Sciences Braunschweig,
Germany
Joost Kok Leiden University, The Netherlands
Rudolf Kruse Otto von Guericke University, Magdeburg,
Germany
Xioahui Liu Brunel University West London, UK
Tim Oates University of Maryland Baltimore County, USA
Sajit Rao Massachusetts Institute of Technology, USA
Sunil J. Rao Case Western Reserve University, USA
VIII Organization

David Salmond DSTL, UK


Roberta Siciliano University of Naples, Italy
Michael Stumpf Imperial College, UK

Programme Committee Members


Christoforos Anagnostopoulos Imperial College, UK
Fabrizio Angiulli University of Calabria, Italy
Alexandre Aussem University of Lyon, France
Tony Bagnall University of East Anglia Norwich, UK
Bettina Berendt K.U. Leuven, Belgium
Daniel Berrar Systems Biology Institute, Tokyo, Japan
Klemens Boehm University of Karlsruhe, Germany
Christian Borgelt European Center for Soft Computing, Spain
Bruno Crémilleux University of Caen, France
Saso Dzeroski Jozef Stefan Institute, Slovenia
Fazel Famili IIT - National Research Council Canada,
Canada
Ad Feelders University of Utrecht, The Netherlands
Ingrid Fischer University of Konstanz, Germany
Adrian Flanagan Ofvigo, Finland
Elisa Fromont University of Saint-Etienne, France
Alex Gammerman University of London, UK
Gemma Garriga Université de Paris VI, France
Gerard Govaert UTC, France
Pilar Herrero Polytechnic University of Madrid, Spain
Eyke Huellermeier University of Marburg, Germany
Jiri Klema Czech Technical University, Czech Republic
Peter Kokol University of Maribor, Slovenia
Walter Kosters Leiden University, The Netherlands
Paul Krause University of Surrey, UK
Pedro Larranaga Technical University of Madrid, Spain
Nada Lavrac Jozef Stefan Institute, Slovenia
Hans-J. Lenz Free University of Berlin, Germany
Trevor Martin University of Bristol, UK
Dunja Mladenic Jozef Stefan Institute, Slovenia
Maria-Carolina Monard University of Sao Paulo, Brazil
Clayton Morrison University of Arizona, USA
Alberto Munoz Garcia Carlos III University, Spain
Mohamed Nadif Paris Descartes University, France
Detlef Nauck BT, UK
Andreas Nürnberger University of Magdeburg, Germany
Nicos Pavlidis Imperial College, UK
Mykola Pechenizkiy Eindhoven University of Technology,
The Netherlands
José-Maria Peña Technical University of Madrid, Spain
Organization IX

Ruggero Pensa University of Turin, Italy


Adriana Prado University of Antwerp, Belgium
Bhanu Prasad Florida A&M University, Tallahassee, Florida,
USA
Ronaldo Prati Universidade Federal do ABC, Brazil
Fabrizio Riguzzi University of Ferrara, Italy
Gordon Ross Imperial College, UK
Céline Rouveirol University of Paris-Nord, France
Stefan Rueping Fraunhofer IAIS, Germany
Antonio Salmeron University of Almeria, Spain
Maarten van Someren University of Amsterdam, The Netherlands
Myra Spiliopoulou Otto von Guericke University Magdeburg,
Germany
Martin Spott British Telecom, UK
Stephen Swift Brunel University, UK
Dimitris Tasoulis Imperial College, UK
Maguelonne Teisseire University of Montpellier, France
Hannu Toivonen University of Helsinki, Finland
Vincent S. Tseng National Cheng Kung University, Taiwan
Allan Tucker Brunel University, UK
Antti Ukkonen Universitat Pompeu Fabra / Yahoo! Research,
Spain
Antony Unwin University of Augsburg, Germany
Dirk Van den Poel Universiteit Ghent, Belgium
Zidong Wang Brunel University, UK

Additional Referees
Jorn Bakker Natalja Friesen
Albert Bifet Aneta Ivanovska
Peggy Cellier Axel Poigne
Marcos Cintra Georg Ruß
Ivica Dimitrovski Pancho Tolchinsky
Fabio Fassetti Katerina Taškova
Table of Contents

Changing the Focus of the IDA Symposium . . . . . . . . . . . . . . . . . . . . . . . . . 1


Niall M. Adams, Paul R. Cohen, and Michael R. Berthold

Invited Papers
Graph Identification (Extended Abstract) . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Lise Getoor

Intelligent Data Analysis of Intelligent Systems . . . . . . . . . . . . . . . . . . . . . . 8


David C. Krakauer, Jessica C. Flack, Simon Dedeo,
Doyne Farmer, and Daniel Rockmore

Selected Contributions
Measurement and Dynamical Analysis of Computer Performance
Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
Zachary Alexander, Todd Mytkowicz, Amer Diwan, and
Elizabeth Bradley

Recursive Sequence Mining to Discover Named Entity Relations . . . . . . . 30


Peggy Cellier, Thierry Charnois, Marc Plantevit, and
Bruno Crémilleux

Integration and Dissemination of Citizen Reported and Seismically


Derived Earthquake Information via Social Network Technologies . . . . . . 42
Michelle Guy, Paul Earle, Chris Ostrum, Kenny Gruchalla, and
Scott Horvath

Detecting Leukaemia (AML) Blood Cells Using Cellular Automata and


Heuristic Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
Waidah Ismail, Rosline Hassan, and Stephen Swift

Oracle Coached Decision Trees and Lists . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67


Ulf Johansson, Cecilia Sönströd, and Tuve Löfström

Statistical Modelling for Data from Experiments with Short Hairpin


RNAs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
Frank Klawonn, Torsten Wüstefeld, and Lars Zender
XII Table of Contents

InfraWatch: Data Management of Large Systems for Monitoring


Infrastructural Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
Arno Knobbe, Hendrik Blockeel, Arne Koopman, Toon Calders,
Bas Obladen, Carlos Bosma, Hessel Galenkamp,
Eddy Koenders, and Joost Kok
Deterministic Finite Automata in the Detection of EEG Spikes and
Seizures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
Rory A. Lewis, Doron Shmueli, and Andrew M. White
Bipartite Graphs for Monitoring Clusters Transitions . . . . . . . . . . . . . . . . . 114
Márcia Oliveira and João Gama
Data Mining for Modeling Chiller Systems in Data Centers . . . . . . . . . . . . 125
Debprakash Patnaik, Manish Marwah, Ratnesh K. Sharma, and
Naren Ramakrishnan
The Applications of Artificial Neural Networks in the Identification of
Quantitative Structure-Activity Relationships for Chemotherapeutic
Drug Carcinogenicity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
Alexander C. Priest, Alexander J. Williamson, and
Hugh M. Cartwright
Image Approach towards Document Mining in Neuroscientific
Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
Jayaprakash Rajasekharan, Ulrike Scharfenberger,
Nicolau Gonçalves, and Ricardo Vigário
Similarity Kernels for Nearest Neighbor-Based Outlier Detection . . . . . . . 159
Ruben Ramirez-Padron, David Foregger, Julie Manuel,
Michael Georgiopoulos, and Boris Mederos
End-to-End Support for Dating Paleolandforms . . . . . . . . . . . . . . . . . . . . . . 171
Laura Rassbach, Ken Anderson, Liz Bradley, Chris Zweck, and
Marek Zreda
Spatial Variable Importance Assessment for Yield Prediction in
Precision Agriculture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184
Georg Ruß and Alexander Brenning
Selecting the Links in BisoNets Generated from Document
Collections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196
Marc Segond and Christian Borgelt
Novelty Detection in Projected Spaces for Structural Health
Monitoring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208
Janne Toivola, Miguel A. Prada, and Jaakko Hollmén
A Framework for Path-Oriented Network Simplification . . . . . . . . . . . . . . . 220
Hannu Toivonen, Sébastien Mahler, and Fang Zhou
Table of Contents XIII

A Data-Driven Paradigm to Understand Multimodal Communication


in Human-Human and Human-Robot Interaction . . . . . . . . . . . . . . . . . . . . 232
Chen Yu, Thomas G. Smith, Shohei Hidaka, Matthias Scheutz, and
Linda B. Smith

Using CAPTCHAs to Index Cultural Artifacts . . . . . . . . . . . . . . . . . . . . . . . 245


Qiang Zhu and Eamonn Keogh

Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259

View publication stats

S-ar putea să vă placă și