Documente Academic
Documente Profesional
Documente Cultură
USMARC and UKMARC are based on the materials, all MARC formats have undergone a
MARC format developed by the Library of considerable development. Today not only
Congress in the 1960s. The purpose of that books but also continuing resources, computer
format was to exploit the new computer files, maps, music, sound recordings, visual
technology for the distribution of catalogue material, and mixed materials are handled by
data. A number of basic characteristics, MARC formats.
therefore, are common to both formats: the MARC21 appears more appropriate than
main field groups, according to their overall danMARC2 when applied to cartographic
functions, are frequently the same. The materials and document collections. On the
descriptive title and originator entries are, in other hand, danMARC2 seems more
USMARC and UKMARC, placed in fields appropriate for the registration of music and
210-24X, and notes are placed in fields sound recordings. Fields, which apparently
500-599. UNIMARC (IFLA, 1994), on the correspond in the two formats, are often used
other hand, has a different division into main for data elements defined for different purposes
groups, and title information is not placed in in the two formats.
field 245, but in field 200.
Although MARC21 is often described as an Cataloguing rules and formats
international standard, it is only used in a The Danish format ``danMARC'' and the
limited number of countries. UNIMARC, Danish cataloguing rules were revised at the
which was created within IFLA, is in fact closer same time, which meant that the many
to an international standard. UNIMARC (first
interdependent issues could be dealt with in the
published in 1977) was meant to be an
right context. For instance, all examples in the
international exchange format between national
cataloguing rules are presented, as they should
bibliographic institutions. According to the
be formatted in danMARC2. The revision
preface to the latest edition (IFLA, 1994), it
meant moving away from AACR2. The most
was designed as a model for the development of
substantial change was the abolition of main
new machine-readable formats. UNIMARC
and added entry elements, which were replaced
has been applied in several south and eastern
by access points and a voluntary location
European countries as well as some Third
element. This difference is not very important
World countries.
This section has explained why MARC is not when searching on controlled forms of data,
equal to MARC21. In the following section, a since these can be isolated from non-controlled
more detailed description of the differences forms in search indexes. Since the choice of
between MARC21 and danMARC2 will be main entries and location elements is based on
provided. different principles, however, presentation as
well as location will differ according to the set of
rules employed.
Differences between danMARC2 and
Format structure
MARC21
In MARC21 as well as danMARC2 each field
danMARC2 was developed in Denmark in the contains at least one and often more sub-fields.
1990s. This was done in order to create a Each field code is followed by two indicators,
common format for all sectors and to modernise which can contain various informations on
the registration of library materials, since online treatment or interpretation of data. MARC21
catalogues (electronic publishing of does not have indicators and sub-fields in
bibliographic records) have obviously replaced control fields 001-008. In MARC21, indicators
the card catalogues. danMARC2 is in many are widely used for different purposes, e.g. for
ways a more modern format than MARC21. generating text as ``former title'' and to indicate
The format is in various ways adapted to the how many characters should be ignored when
technological development as it appeared in the alphabetising in the first sub-field. In
middle of the 1990s. In order to be able to danMARC2 indicators are only used in a very
handle types of material other than printed limited number of fields: relations of periodica
41
After MARC ± what then? Library Hi Tech
Leif Andresen Volume 22 . Number 1 . 2004 . 40-51
in field 860-879, and fields defined for local secondary authors (sub-field *f). Colon before
purposes. sub-title and slash before authors has to be
In the descriptive fields in danMARC2, the keyed in:
division of fields into sub-fields follows ISBD's MARC21:
division into elements. This facilitates the 245 04 *a The listing attic ; The unstrung harp /*c
generation of separating characters in displays by Edward Gorey
and printouts in ISBD and other formats. 740 01 *a Listing attic
MARC21's descriptive fields are not specified 740 01 *a Unstrung harp
in as many sub-fields as in danMARC2, and the
sub-fields are not repeatable to the same extent danMARC2:
as in danMARC2. MARC21 also requires 245 00 *a The ¨listing attic *a The ¨unstrung
typographical separating characters to be keyed harp *e by Edward Gorey
in, as in those cases where separating character Repetition of sub-field a (*a) and alphabetising
and a sub-field code coincide. symbol ``¨'' facilitates direct indexing. The
In a number of fields, MARC21 uses an smaller number of sub-fields in MARC21
alphabetising indicator to state how many means that design of indexes is less flexible and
characters should be ignored when the possibilities for variation of presentation
alphabetising in the first sub-field. In formats are more limited than in danMARC2.
danMARC2, the alphabetising indicator is To a certain extent, MARC21 compensates for
replaced by an alphabetising symbol ``¨'', which this by facilitating larger redundancy of data in
is placed where the alphabetising in a given the records.
sub-field is supposed to start. This is a more In MARC21 author and title of an added
flexible indication of alphabetisation, since the work are placed in the same sub-field:
alphabetisation is not tied solely to the first MARC21:
sub-field in a field. 245 14 *a The analysis of the law /*c Sir Matthew
Hale. The students companion / Giles Jacob
Examples
MARC21: danMARC2:
245 03 *a Le Bureau = La Oficina = Das BuÈro 245 00 *a The ¨analysis of the law *e Sir Matthew
246 10 *a Oficina Hale *x The students companion *e Giles Jacob
246 10 *a BuÈro
With a specific sub-field for ISBN it is possible
to use this information for system functions ±
danMARC2:
such as entry on ISBN in another system:
245 00 *a Le ¨Bureau *p La ¨Oficina *p Das
¨BuÈro MARC21:
020 ## *a 0877790019 (black leather) *z
Sub-field p (*p)[1] and alphabetising symbol 0877780116 : *c $14.00
(¨) facilitate direct indexing. An equals sign
before parallel titles must be keyed-in in danMARC2:
MARC21: 021 00 *a 0877790019 *c sort lúder *d $14.00 *x
MARC21: 0877780116
245 10 *a He who hunted birds in his father's
Codes are significantly different in MARC21
village : *b the dimensions of a Haida myth /*c
Gary Snyder ; preface by Nathaniel Tarn ; edited and danMARC2. The differences apply to
by Donald Allen application areas, definitions of the individual
codes, and the values of these. In MARC21's
danMARC2: field 007, a considerable number of
245 00 *a He who hunted birds in his father's position-determined one-character codes have
village *c the dimensions of a Haida myth *e Gary been defined for the statement of physical
Snyder *f preface by Nathaniel Tarn *f edited by details. They are entirely different from
Donald Allen
danMARC2's material type codes, and seem
MARC21 does not allow for differentiation more suited for library-technical functions such
between primary authors (sub-field *e) and as choice of play-back equipment and storage
42
After MARC ± what then? Library Hi Tech
Leif Andresen Volume 22 . Number 1 . 2004 . 40-51
conditions than for the generation of texts for an advantage of the MARC formats. Gradually,
search and presentation purposes. however, the text dependency has also turned
into a disadvantage. In practice the MARC
Multi-volume works format allows for only strictly bibliographic
In Denmark, two different methods for the data. Data such as reviews, indexes in their
formatting of multi-volume works are original forms, and image and sound files
facilitated, either independent registrations of cannot be added. Supplementary information,
individual levels in separate bibliographic which now and in the future will be used to
records or independent registrations of enhance registrations in the catalogues, cannot
individual levels in one bibliographic record. be exchanged in a standardised form.
MARC21 does not offer a general instruction as Furthermore, there will often be a loss of data,
to the formatting of multi-volume works. The because local additions cannot be read by a
most common practice seems to be a one-level recipient system.
method, where the collective work is described A MARC format is by nature a strict format,
in one record. Both British Library and the since it was originally intended for the
Swedish union catalogue LIBRIS have production of printed catalogue cards;
considered various methods in connection with therefore, the interaction between different
their shift to MARC21, but they have not made library systems is not reflected. The lack of
an unambiguous choice. flexibility means that local additions might
MARC21 is not very flexible. There is no hinder exchange between local systems and
possibility for registering multi-volume works in union catalogue systems. The various library
several-records structures, which is facilitated systems in the Danish public libraries have
and frequently used in danMARC2 records. invented their own non-standardised
The previous description of the differences peculiarities in order to update local records
between the two formats has served to illustrate with national updatings without overwriting
that USMARC, which has now been local additions/corrections. The research
re-christened MARC21, is a product of a 1960s libraries in Denmark do not interplay as much
technology. The Danish danMARC2 format with the union catalogue as the public libraries.
has been modernised and adapted to the online Nevertheless, the research libraries differ
age, but is nevertheless a product of the 1980s according to cataloguing policy and practice,
technology. The implementation of a MARC which causes matching problems in the union
format is in many ways time-limited and it has catalogue, where the identification and merging
hopefully been demonstrated that MARC is of records for the same title/edition are
much more than MARC21. hampered.
Large volumes of data cannot easily be
incorporated in a MARC record, although there
Problems with MARC is no fixed limit for the amount of data. The
author has seen MARC-based database systems
This section deals with a number of problems used for large volumes of text, which have
associated with the MARC formats. It must be appeared rather clumsy. This is primarily
stressed that these problems are associated with caused by the database systems, but the
the practical application of the MARC formats construction of MARC does not really support
and the problems mentioned could possibly be the display of continuous texts.
solved with a MARC format. Nevertheless, it is The foundation of MARC on the printed
important to address the actual applications of catalogue means that the implementation of the
the MARC formats and not what might, under MARC format is directed towards description
different conditions, already have been worked and presentation of the individual document on
out. card or screen. This means that it is difficult to
MARC is used for text-based data input. The handle relations between data that are
division into fields and sub-fields supports a described in different fields. In danMARC2,
relatively specific division of data. This is sub-field *aÊ[2] is used to tack fields together,
suitable for bibliographic records and therefore which means that, for example, connected title
43
After MARC ± what then? Library Hi Tech
Leif Andresen Volume 22 . Number 1 . 2004 . 40-51
FRBR recommends a basic level for the characterising and individualising data at all
functionality of bibliographic records. The four levels, in order to facilitate the
bibliographic record should help the user: identification and qualified choice among the
(1) To find all documents containing: records in a search set.
. the works for which a given person or The MARC formats support, with their title
corporation is responsible; and document orientation, the Paris principles,
. the different representations of a given while FRBR is oriented towards the handling of
work; relations between works as well as subject-based
. works on a given topic; and access. Denton (2003) has written about the
. works in a given series. relationship between FRBR and the cataloguing
(2) To find a certain document: rules. There is no pre-packaged model for the
. when name(s) of person(s) and/or implementation of FRBR in practice, but work
corporation(s) responsible for work(s) with implementing and applying FRBR will
in the document is/are known; continue on an international level.
. when the title of the document is So far, FRBR has not been incorporated in
known; and either a format or in cataloguing rules, but the
. when the document identification is problematic issues are being studied and which
known. requirements, in relation to user interfaces in
(3) To identify a work. the library systems, FRBR would entail.
(4) To identify a representation of a work.
(5) To identify a document. XML
(6) To choose a work. Extensible Markup Language (XML) is a
(7) To choose a representation of a work. simple and flexible text format derived from
(8) To choose a document. SGML (ISO 8879). Originally XML was
(9) To gain access to a document. designed to meet the challenges of large-scale
In these functional requirements the electronic publishing. XML is also playing a
bibliographic entities are defined as: more and more important role in the exchange
. A work is a delimited intellectual or artistic of a wide variety of data on the Web and
product. elsewhere. XML was developed by The World
. A representation of a work (expression) is a Wide Web Consortium (W3C), which develops
work given an intellectual or artistic form. interoperable technologies (specifications,
. A document (manifestation) is a guidelines, software, and tools) to lead the Web
representation of a work fixed in a physical to its full potential.
form. An example of XML: a sample of a
. An item is a copy of a document. MODS-record with title information:
<mods:titleInfo>
There are some discussions about the <mods:title>Sound and fury :</mods:title>
delimitation between these bibliographic <mods:subTitle>the making of the
entities (see, for example, Jonsson, 2002). punditocracy/</mods:subTitle>
</mods:titleInfo>
FRBR and MARC
An important difference between the Paris in MARC21 would look like this:
principles and FRBR is that FRBR requires that 245 10 *a Sound and fury :*b the making of the
punditocracy
a document contain at least one representation
of one work. It is not sufficient to facilitate a XML offers a more general standard for
search for the documents' forms of title and describing data for transport. This was not the
originator data, to facilitate collocation of an case before and when tools like MARC and ISO
author's works by using controlled forms of 2709 were developed.
names, and to facilitate collocation of the One changed condition is the space on discs
representations of a work. It is also necessary for and cables. An XML-formatted record takes up
the records to contain sufficient, unambiguous far more space than a MARC record, but the
data for identification at all levels as well as for requirement for space is no longer as important
45
After MARC ± what then? Library Hi Tech
Leif Andresen Volume 22 . Number 1 . 2004 . 40-51
only requires information about formats that libraries, archives and museums is published on
are present in the record. the Internet:
The same title field in the above example Common data elements for Danish
would look like this in MODS: Archives-Libraries-Museums:
<datafield tag="245'' ind1="1" ind2="0"> Title
<subfield code="a"> Sound and fury : Creator
</subfield> Contributor
<subfield code="b"> the the making of Publisher
the punditocracy</subfield> Subject
</datafield> Description
Date ± Refinements by: Created, Modified
This kind of supplement to ISO 2709 is
Type
obviously made in XML, but offers only some
Format
of the advantages inherent in XML. Certainly,
Identifier
XML is a mainstream technology, but it is still
Source
the MARC format with fields, sub-fields and
Language
indicators that is embedded in the record.
Relation
Coverage ± Refinements by: Spatial, Temporal
Interplay with other sectors and functions Rights
Date ± Refinements by: Created, Valid, There has been an ongoing discussion among
Available, Issued, Modified, Copyrighted, Dublin Core users as well as within the
Submitted, Accepted, Captured
DCMI[6] whether DC is to be acknowledged
Type
as the simple set or whether it should be greatly
Format ± Refinements by: Extent, Medium
expanded.
Identifier ± Refinements by: Identifier Citation
Source
Language
Relation ± Refinements by: Is Version Of, Is Conclusion
Format Of, Has Format, Is Replaced By,
Replaces, Is Part Of, Has Part, Requires, Is The development of the Internet means that
Referenced By, References data from different sectors are more visible and
Coverage ± Refinements by: Spatial, Temporal accessible than before. A natural consequence is
Rights that the borders between sectors is being
Audience demolished. This also applies to libraries. It is
Edition necessary to be open to the world outside and to
Location share standards and data with other sectors.
Bibliographic records should be much easier
The question of rights management turns out to
to reuse ± by other institutions as well as other
be a major issue outside library systems. This
has not been the case for the libraries, since libraries. The question, then, is: how to create
they, as far as book material is concerned, have descriptions with several layers, which can be
been covered by a clause in the copyright law, used in various contexts and still keep the
which states that loan can take place without an duplication of data to a minimum?
agreement from the originator. One possible model could contain a division
When comparing the examples above, one of data:
suggestion for a common selection
. Core data.
(alphabetical order) could be:
. Supplement: data generally necessary for
. Creator; libraries.
. Contributor;
. Supplement: national need for libraries.
. Date (available/created/published);
. Supplement: specific needs of the
. Description; individual library.
. Edition;
. Supplement: data necessary for
. Format (technical specifications); governmental information.
. Identifier; The core data could be selected with inspiration
. Language; from Dublin Core's 15 elements and
. Publisher; applications of Dublin Core in other sectors.
. Relation; The list in the previous section might act as an
. Subject; inspiration, because it illustrates the necessity
. Title; and for a general schema to look further than
. Type (genre). Dublin Core, based as that is on Internet
This could lead to a future catalogue based on resources. For data generally necessary for the
Dublin Core. But one has to bear in mind the libraries, there are a great number of sources.
basic principle for Dublin Core: to describe FRBR is undoubtedly important, but also data
resources on the Internet for the purpose of element standards like ISO 8459 can be
searching. The idea of a general format for considered. Supplements should not be seen as
describing Internet resources has to some a layer-upon-layer superstructure on the Core.
extent been successful. An obvious danger is Output will be many-faceted, and vary
just to use the Dublin Core to reinvent MARC. depending on different contexts.
The draft for DC-Library Application Profile Document descriptions could be embedded
illustrates a Dublin Core problem, since the in a common structure that reflects relations
element Edition is neither a part of Dublin core between documents in relation to the
nor any of the acknowledged refinements. FRBR-model's expression and work. It must
49
After MARC ± what then? Library Hi Tech
Leif Andresen Volume 22 . Number 1 . 2004 . 40-51
offer the user the chance to navigate between developments for a future catalogue practice.
different concrete documents as well as being Ideally, the suggested solution should be for
able to see their mutual relations. ``the whole world'', but not for all kind of
The transition from the present formats will resources. The idea in this article is related to
of course cause some problems. But, if a new describing and discovering ``information
format is sufficiently specific, various MARC resources'' which are primarily text-based.
formats can be created based on data from such Metadata for the registration of other resources
a system, thereby ensuring that new data can be ± from persons to nails and bricks ± are also
used in old systems. On the other hand, used, but the core metadata are completely
conversion from a MARC format is not always different.
easy, since the specificity will often be lower in Talking about core metadata for the whole
the recipient format. A comparison between world of information resources is also tackling a
MARC21 and danMARC2 shows that very big objective. In the practical world, there
MARC21 has a relatively low specificity. One will be different metadata sets for different
could consider a revision of MARC21 adapted communities and not all of them will be
to the online age. Although MARC21 contains connected. Yet it will be useful to work with this
many archaic traits, it would seem more goal as a point of departure. Though not
reasonable to look forward than back to what applying to the whole world, it will still be very
might have been a good idea some ten years valuable to bring the different MARC
ago. Furthermore, a format transformation is communities together. It should be possible to
quite costly. Over the past few years XML has work out a broader solution. In any case: such a
become a magic term. But embedding of solution will have the potential for being the
MARC data in XML is not enough. Replacing basis for interoperability between sectors.
ISO 2709 with a MARC XML format would be
useful, but does not allow for data interplay
with other sectors.
It is obvious that an XML format must be Notes
developed in cooperation with other sectors. 1 The delimiter symbol is represented by a ``*'' in this
This XML format must be able to contain article.
several layers in the description and at the same 2 The letter ``aÊ'' is a special Danish letter.
time accommodate different types of data. 3 Extensible Markup Language (XML), available at:
Embedment in XML is not perfect for all types www.w3.org/XML/
4 Metadata Object Description Schema (MODS),
of data, but for the most part an embedment available at: www.loc.gov/standards/mods/
with a trivial http-reference will be quite 5 Online Information eXchange (ONIX), available at:
sufficient. It is important to get the balance www.editeur.org/onix.html
right. We need international development 6 Dublin Core Metadata Initiative (DCMI), available at:
efforts, because the definition of core data has http://dublincore.org/
to be based on broad consensus in the
international library world. This also includes
the choice of XML. The library world will have References
to leave MARC for something different and
more flexible. I have tried to suggest in which Denton, W. (2003), FRBR and Fundamental Cataloging
Rules, Miskatonic University Press, available at:
direction they ought to move. Yet, there is a
www.miskatonic.org/library/frbr.html
long way to go and it is going to take time. (Der) Deutsche Bibliothek (1961), ``Statement of principles'',
XML presents some opportunities and it adopted by the International Conference on
could certainly be developed further. Primarily, Cataloguing Principles, Paris, October, available at:
a development of the FRBR-model and a www.ddb.de/news/pdf/paris_principles_1961.pdf
resulting change of paradigm in cataloguing and DokForm-gruppen (2002), Metadata til udveksling mellem
EDH/ESDH-systemer, IT ± og Telestyrelsen,
registration practice are needed. This article
Copenhagen, available at: www.oio.dk/files/
does not claim to have given an exhaustive horingsex-efter.doc
account of either FRBR or XML, but hopefully IEEE Learning Technology Standards Committee (LTSC)
there are some useful pointers to possible Learning Object Metadata Working Group (2002),
50
After MARC ± what then? Library Hi Tech
Leif Andresen Volume 22 . Number 1 . 2004 . 40-51
Standard for Information Technology ± Education Jonsson, G. (2002), The Basis for a Record in Major
and Training Systems ± Learning Objects and Cataloguing Codes and the Relation to FRBR, IFLA,
Metadata, IEEE P1484.12, available at: http:// Glasgow, available at: www.ifla.org/IV/ifla68/papers/
ltsc.ieee.org/wg12/ 052-133e.pdf
IFLA (1994), UNIMARC Manual: Bibliographic Format 1994, KatalogdataraÊdet (1998), danMARC2. Edb-format til
available at: www.ifla.org.sg/VI/3/p1996-1/sect.htm inddatering og udveksling af bibliografiske data i
IFLA Study Group on the Functional Requirements for maskinlúsbar form, Biblioteksstyrelsen, Copenhagen,
Bibliographic Records (1998), Functional available at: www.kat-format.dk/danMARC2/
Requirements for Bibliographic Records: Final Report, default.html
K.G. Saur, MuÈnchen, available at: www.ifla.org/VII/ Library of Congress (2003), MARCXML MARC21 XML
s13/frbr/frbr.htm Schema, available at: www.loc.gov/standards/
International Organization for Standardization (ISO) marcxml/
(1996), ISO 2709:1996 Information and Library of Congress, Network Development and MARC
documentation ± Format for Information Exchange, Standards Office (2000), ONIX to MARC21
ISO, Geneva. Mapping, available at: http://lcweb.loc.gov/marc/
International Organization for Standardization (ISO) onix2marc.html
(2003), ISO 15836:2003 The Dublin Core Metadata Open Archives Initiative (2002), The Open Archives Initiative
Element Set, ISO, Geneva, available at: http:// Protocol for Metadata Harvesting Protocol Version 2.0,
dublincore.org/documents/dces/ (not ISO-style, but available at: www.openarchives.org/OAI/2.0/
DCMI-style). openarchivesprotocol.htm
51