Documente Academic
Documente Profesional
Documente Cultură
To cite this article: Hee-Sook Shin (2003) Quality of Korean Cataloging Records in Shared Databases, Cataloging &
Classification Quarterly, 36:1, 55-90, DOI: 10.1300/J104v36n01_05
Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained
in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no
representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the
Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and
are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and
should be independently verified with primary sources of information. Taylor and Francis shall not be liable for
any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever
or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of
the Content.
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any
form to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://
www.tandfonline.com/page/terms-and-conditions
Quality of Korean Cataloging Records
in Shared Databases
Hee-sook Shin
Downloaded by [DePaul University] at 20:39 14 November 2014
1. INTRODUCTION
number of CJK records entered into shared databases, particularly in the two
largest online cataloging databases in the United States, OCLC Online Com-
puter Library Center and Research Libraries Information Network (RLIN). By
the year 2000, the total number of CJK records both in OCLC and RLIN
reached almost 2 million. With this influx of records into the shared cataloging
environment, the quality of the records requires attention. Although two noted
studies assessed Chinese records (one study by Zeng and one by Simpson), no
one has yet formally scrutinized Japanese and Korean records. The purpose of
this study is to examine the quality of Korean-language cataloging records in
OCLC’s WorldCat in order to identify areas where errors occur frequently and
Downloaded by [DePaul University] at 20:39 14 November 2014
detect errors that inhibit record retrieval. The study follows a methodology
similar to that used by Lei Zeng in her 1993 thesis, An Evaluation of the Qual-
ity of Chinese-Language Records in the OCLC OLUC Database and a Study of
a Rule-Based Data Validation System for Online Chinese Cataloging.1 The
current study offers comparisons to Zeng’s findings and identifies shared error
patterns.
Based on the study’s results, the author makes recommendations towards
maintain quality Korean-language records and hopes to stimulate discussion
among libraries regarding the quality in the cataloging process. The present
findings and recommendations can be directly applied to CJK catalogers, and
perhaps to other catalogers as well, as they can be used in the development of
training manuals and workshops.
2. LITERATURE REVIEW
The large volume of literature on quality in cataloging and its impact on the
user reaffirms that this issue is a fundamental concern for scholars working in
the information field.2,3,4,5,6 Quality is well defined in the literature, as several
authors have proposed evaluative criteria and methodology.7,8,9 For instance,
Graham’s article defines the essential considerations in assuring quality cata-
loging, emphasizing extent and accuracy, while Thomas establishes that qual-
ity is “dynamic and dependent on the values and needs of cataloging users,”
reviewing options such as copy cataloging, cooperative cataloging, minimal
level cataloging, and outsourcing. Reeb, in light of a case study carried out by
his university, discusses how systematically to evaluate quality, concluding
that record evaluation could potentially improve a cataloger’s skills and under-
standing of the rules; however, he cautions that evaluation could be used
merely as a grading system.
Although no earlier studies assess the quality of Korean cataloging records
in the shared databases, several researchers have assessed general database
Hee-sook Shin 57
rean the 17th largest of the more than 200 languages represented in OCLC. As
of March 2000, the RLIN database contained 2,366,077 CJK titles, of which,
185,536 corresponded to Korean materials, with 145,822 records containing
vernacular scripts.19,20
As such a large number of CJK records have been added to the databases
over such a short period of time, the quality of the shared databases could have
been compromised, as records have been acquired from diverse sources and
formats. Even given established quality mechanisms, the overall quality of the
shared databases is questionable, as many records are as yet unchecked. This
study aims partially to remedy this by evaluating the quality of Korean records,
Downloaded by [DePaul University] at 20:39 14 November 2014
identifying those factors that might diminish the effectiveness of the online
cataloging system.
5. RESEARCH METHODOLOGY
(because they generally contain only the minimum number of required fields,
and thus it is unclear whether or not missing fields are errors made by catalog-
ers); (3) juvenile literature records (because such records are generally sim-
ple); and (4) Pre-AACR2 records (to ensure that all selected records followed
the same set of cataloging rules). Thus, this study examined only full-level (I,
L-level) records contributed by OCLC or RLIN (see Appendix 2).
These criteria resulted in a sample of 623 records out of a total of 2,000 rec-
ords, as indicated in Table 1 below. As OCLC and RLIN have different input
requirements for CJK records, the sample was divided into two categories:
(1) records created by OCLC members and (2) those created by RLIN mem-
Downloaded by [DePaul University] at 20:39 14 November 2014
bers and tape-loaded into OCLC WorldCat. Although some local libraries are
members of both organizations, records were categorized according to the en-
coding level in the fixed fields. For comparative purposes, the categories were
analyzed separately.
Format errors
F1 Incorrect field tag
F2 Incorrect indicator
F3 Incorrect or missing subfield code, incorrect sequence of subfields
F4 Incorrect or missing punctuation and space (as required by ISBD)
(e.g., incorrect ending punctuation)
Content errors
M1 Missing a whole field or containing an extra field
M2 Missing a part of the entry or containing an extra part for the entry (e.g., subfield = d in 100
field)
M3 Missing the corresponding vernacular field or containing an extra entry for a non one-to-one
name
S1 Inconsistency between romanized and vernacular data (including number, subfield code,
words, punctuation)
S2 Inconsistency between corresponding fields or values (mismatching fixed and variable
fields)
R1 Incorrect content in a whole field (e.g., missing access point for a second author or using an
entry different from the authority entry)
R2 Other content error (e.g., a series used a 440 entry which is coded nna (old heading) in the
authority file)
R3 Incorrect content (e.g, incorrect subfield or qualifier is partly missing)
R4 Incorrect call number
R5 Incorrect subject heading
Editing &
Inputting errors
E1 Incorrect upper and lower case
E2 Punctuation used CJK mode
E3 Space used CJK mode
E4 Other editing and inputting errors (e.g., misspelling of English words)
Romanization errors
RO1 Incorrect or misspelling of romanization
RO2 Incorrect word division (including hyphen usage)
RO3 Incorrect or missing diacritics
RO4 Incorrect or missing vernacular character
62 CATALOGING & CLASSIFICATION QUARTERLY
Each individual record was analyzed for: (1) completeness and (2) errors
Downloaded by [DePaul University] at 20:39 14 November 2014
Data was first entered into MS Excel spreadsheets (see Appendix 4) and
then transferred into the SPSS software program. Errors were compared both
by type and by frequency, and they were analyzed within their respective fields
and within the MARC format. Graphs and tables were used to compare errors
occurring in OCLC and RLIN, and errors in Korean records were compared
with errors in Chinese records. Comparisons were not made between individ-
ual member libraries and their contributions, nor was any attempt made to
identify the errors of specific institutions.
6. RESEARCH FINDINGS
accessed and identified in the database using information within encoded and
structured fields, was affected by errors, as the errors tended to be clustered
within certain field types, most notably the title statement (245) field and im-
print (260) field.
The sample included 148 error-free records (23.8%), a much higher per-
centage than that found in other studies (only 18% of records were error-free in
Zeng’s study). Reasons for this may include this study’s exclusion of incom-
plete records and the assumption that the records accurately reflected books
represented. Out of the 623 records, 58 were duplicates, 41 (8%) in OCLC and
17 (14.8%) in RLIN. Most duplicate records occurred in the OCLC system be-
cause words were romanized differently. The major cause of record duplica-
Hee-sook Shin 65
tion was variation in word division, followed by access points that differed
from authority headings and missing matching fields.
tion and spacing errors were introduced into the RLIN records when they were
tape-loaded into WorldCat, and thus errors were the result of differences be-
tween the two systems.
As illustrated in Figure 1, which displays the frequency distribution of error
occurrences, the maximum number of errors in an individual record was 16,
while the minimum was zero.
18
16
14
12
Downloaded by [DePaul University] at 20:39 14 November 2014
10
Error
⫺2
Reviewed Records
fields and values (mismatching of fixed and variable fields), incorrect call
numbers, and missing fields (for example, omitting the 041 field). Finally, the
260 field chiefly involved incorrect punctuation and inadequate spacing ac-
cording to ISBD standards, followed by incorrect romanization and incorrect
use of upper and lower case.
For the most part, the error rate and category distribution occurring in the
OCLC records mirrors that of the sample as a whole (see Appendix 5), with
one exception. In addition to punctuation, spacing, romanization, and word di-
vision errors, improper use of upper and lower case was also a significant oc-
currence in the 245 field. The OCLC sample also differs from the total sample
in that the third largest rate of errors occurred in the 5xx (or notes) fields. These
errors were generally related to incorrect punctuation and content, missing
fields, and incorrect use of upper and lower case.
With regard to the RLIN records, the 245 field similarly exhibited the
highest error frequency. Again, punctuation, spacing, romanization, and
word division errors were the most prevalent; however, inconsistencies be-
tween romanized and vernacular fields also occurred at a significant rate. The
TABLE 5. Number of Error Occurrences by Field and Category
RO RO RO RO
Downloaded by [DePaul University] at 20:39 14 November 2014
F1 F2 F3 F4 M1 M2 M3 S1 S2 R1 R2 R3 R4 R5 E1 E2 E3 E4 1 2 3 4 Total %
Fixed & 0xx 4 1 5 10 31 0 0 0 124 2 2 2 37 0 0 0 0 1 0 0 0 0 219 0.35
1xx 7 0 0 12 6 12 3 1 0 5 0 6 0 0 0 0 0 1 4 0 1 0 58 0.09
240 2 8 0 3 12 2 2 0 0 2 2 0 0 0 2 0 0 0 1 1 1 0 38 0.06
245 0 17 12 123 1 3 0 30 0 1 3 5 0 0 39 5 0 1 85 61 25 2 413 0.66
246/740 2 10 0 1 30 2 12 0 0 0 2 0 0 0 3 0 0 0 9 12 1 0 84 0.13
250 0 0 0 2 6 1 0 0 0 0 2 0 0 0 0 0 0 0 1 11 1 0 24 0.04
260 0 0 6 73 1 1 2 10 0 1 7 5 0 0 20 1 0 0 20 11 5 1 164 0.26
300 0 0 16 42 1 11 0 0 0 0 3 12 0 0 1 0 0 2 0 0 0 0 88 0.14
4xx/8xx 16 15 0 24 15 0 4 2 0 2 3 7 1 0 10 3 0 0 8 3 4 0 117 0.19
5xx 6 1 0 82 14 0 1 3 0 2 1 0 0 0 15 0 0 6 8 8 4 0 151 0.24
6xx 1 1 8 9 2 1 12 0 0 4 0 5 0 24 6 0 0 10 3 0 0 0 86 0.14
7xx 4 3 0 20 39 13 2 2 0 4 0 6 0 0 8 0 0 2 5 0 2 0 110 0.18
Total 42 56 47 401 158 46 38 48 124 23 25 48 38 24 104 9 0 23 144 107 44 3 1552 2.49
260 field proved to be the second most problematic area, mostly involving in-
correct punctuation and inadequate spacing, followed by incorrect use of up-
per and lower case and inconsistency between corresponding fields or values.
Errors occurring in the 300 field formed the third most frequent category and
were generally related to incorrect punctuation and inadequate spacing, omit-
ting or containing an extra part in the entry, and incorrect content (see Appen-
dix 5).
Normalization was used to account for variations in required MARC fields
and access points. For instance, although the 245, 260, and 300 fields are man-
datory in all records, the 1xx is required as applicable. Additionally, fields
Downloaded by [DePaul University] at 20:39 14 November 2014
such as the 5xx, 6xx, and 7xx fields may be displayed more than once in the rec-
ord. The method of normalization for the number of errors by field is the actual
number of errors in a field (y) divided by the total instances of this field occur-
ring in all sample records (x), giving the rate of error occurrence per entry. Ta-
ble 6 displays error occurrences by fields after normalization.
As Table 6 and Figure 2 illustrate, normalization changes the error rate dis-
tribution. Although the 245 field is still the most error prone, it is followed in
error frequency by the 4xx/8xx and 240 fields. Appendix 6 describes the error
rate distribution after normalization by record type (whether OCLC or RLIN).
MARC Field X Y R z
(Entry No.) (Error No.) (Rate = y/x) (r* Sample No.)
Fixed & 0xx 623 219 0.35 219.00
1xx 409 58 0.14 88.35
240 96 38 0.40 246.60
245 623 413 0.66 413.00
246/740 217 84 0.39 241.16
250 311 24 0.08 48.08
Downloaded by [DePaul University] at 20:39 14 November 2014
7xx 1xx
6xx
240
245
5xx 246/740
440/490/8xx 250
300 260
heading would have been more suitable, but these cases were not counted as
errors.
Tables 5 reveals that there were 546 format errors present in the sample,
while editing and inputting errors occurred 136 times, most significantly as er-
rors of incorrect capitalization resulting from mis-application of AACR2. All
23 errors involved the misspelling of English words. With regards to the 9 er-
rors involving the use of CJK-mode punctuation, this error type was included
in OCLC records because CJK-mode punctuation potentially affects record re-
Hee-sook Shin 71
Overall, the record quality in this study is comparable to that found in previ-
ous studies (e.g., Zeng’s and Intner’s). Although this study discovered errors
in the Korean records, they were for the most part minor and did not affect the
user’s ability to retrieve the record. Most were ISBD punctuation errors, and
although such errors do not typically block the record from use, they do affect
the way information is arranged and displayed in the database. On the other
hand, the error categories discussed below do pose some concern, for errors in
these categories potentially inhibit access to the record.
Similar to the findings of Zeng’s study, one of the major concerns in the
present study is incorrect romanization and word division. Such errors may af-
fect user retrieval and may lead to duplicate records. This decreases time effi-
ciency, for catalogers spend time both in evaluating duplicates and in creating
new records when they cannot find an incorrectly created record.
Errors in romanization are perhaps due to the current standard rules of Ko-
rean romanization and word division according to the LC romanization table,
which uses the McCune-Reischauer sytem. These rules are not clear and are
difficult for even a native speaker to follow. The complexities inherent in
romanizing Korean may also contribute to romanization errors. To further
complicate the issue, Korean dictionaries are not consistent in their use of
word division and Korean has an elaborate phonologic and orthographic struc-
ture, which leads to enormous problems in transcribing the language. Even
Korean catalogers can become confused when faced with variations in pronun-
ciation and word division. Furthermore, catalogers can transcribe the same
72 CATALOGING & CLASSIFICATION QUARTERLY
character in different ways (e.g., sam or salm; Hanguk hyondae munhak non or
Hanguk hyondae munhangnon).
Some Korean records contain Chinese and/or Japanese names or publish-
ers, and catalogers can become confused when inputting such records, using
different romanization systems instead of McCurn-Reinshauer, which in turn
causes problems in record retrieval. In one example, concerning a book pub-
lished in China but written in Korean, the 245 field was romanized using
McCurn-Reinshauer, while the 260 field was romanized using Wade-Giles (a
system for romanizing Chinese). According to rule 1.0E in AACR2, descrip-
tive cataloging must be consistent and use the same romanization scheme
Downloaded by [DePaul University] at 20:39 14 November 2014
throughout. Thus, when romanizing CJK characters, the 245, 250, 260, and
4xx fields in any single record should all adhere to the same system regardless
of language.
Although RLIN and OCLC CJK can display vernacular script, most Online
Public Access (OPAC) systems in institutions and libraries in the U.S. do not
support these scripts, and romanization provides the only access to these col-
lections. Therefore, romanization is a significant process and should be per-
formed consistently.
used in each record match the form established in the appropriate authority
file.
Authority records that contain vernacular characters are another concern.
Since several characters share the same pronunciation, many Korean names
share the same romanization, and thus identical authority headings can refer to
several different people. Also, vernacular characters in the 245 field can con-
flict with the standard authority form of the author’s name as it appears in the
100 field, making it difficult to determine the appropriate authority form. For
example, Korean characters may be entered for an author’s name as it appears
in the item, in the 245 field. However, the authority file does not indicate
Downloaded by [DePaul University] at 20:39 14 November 2014
whether the author’s established name should be entered in the 100 field using
Chinese or Korean vernacular characters, and, as a result, the author’s name
may appear in both character sets throughout the database. Errors of this type
usually occur in Korean and Japanese materials, particularly when Chinese
characters are used to display bibliographic information. Having the appropri-
ate vernacular characters available in the authority file would alleviate both of
these problems.
Missing access points not only impair record retrieval but also potentially
result in duplicate records, and this problem should be stressed so that all ac-
cess points are present when applicable. As access points that conflict with es-
tablished authority forms also pose a serious problem, heading consistency is
also important.
The 42 tagging and 56 indicator errors found in this study included incorrect
choice of main entry, incorrect indicator used in the 246 field, and improper
use of the 440 or 490/8xx fields. The majority of tagging errors occurred in the
440 and 490/8xx fields (16 errors) and the 100 and 700 fields (11 errors). In the
440 and 490/8xx fields, the reviewers only checked the tags against the author-
Downloaded by [DePaul University] at 20:39 14 November 2014
ity file, determining whether the series was traced properly or not, using either
a 440 or a 490/8xx combination. Errors in the 100 field usually resulted from
using the added entry rather than the main entry or vice versa. These errors were
related to either a lack of understanding of AACR2 Chapter 21 and/or a failure to
check the authority headings thoroughly. Tagging and indicator errors, though
more frequent in this study than in Zeng’s (tagging errors: 0.2%, indicator er-
rors: 1%), do not affect user retrieval of Korean records in the online environ-
ment. However, indicator errors should be considered for titles and data in other
languages, as indicators would affect the retrieval of these records.
The number of fixed field errors found in this study is significant, involving
8% of the sample. Inconsistencies between fixed and variable fields occurred
Hee-sook Shin 75
more frequently in this sample than in Zeng’s, where only 3% of the records
were affected. However, as the values in “contents,” “illustration,” and “in-
dex” are optional and do not affect record retrieval, they cannot be considered
major errors.
This study, like previous studies, revealed many catalog record errors.
Moreover, some errors in this study occurred in patterns similar to those found
in other studies (for example, the average number of errors per record found in
Downloaded by [DePaul University] at 20:39 14 November 2014
the present study was 2.49, while Zeng’s study found 2.23 per record). En-
hanced records in this study had a much lower error rate than that found in
original records. As in Zeng’s study, most errors involved ISBD punctuation
and other minor errors that do not prevent users from locating the record. This
study also showed, however, that there were some major errors that did impact
the usability of the record, notably romanization and access point errors.
As described above, errors were found most often in the 245, Fixed and 0xx,
and 260 fields. This distribution is similar to that found by Zeng (see Table 7
and Figure 3). Since the 245 field is the primary source of information for the
user, the rules for transcribing this field must be emphasized. Although errors
involving the matching of fixed and variable fields do not affect retrieval, con-
sistency is important, and these fields may provide useful search keys in the fu-
ture. As the 260 field does provide a search key, errors in this field may affect
record retrieval in shared databases. This study also found many errors in the
1xx and 7xx fields, mostly involving confusion between the main and added
entry.
Again, the greatest number of errors in this study involved inadequate ISBD
punctuation, followed by missing an entire field or containing an extra field,
and missing fixed field values as compared to the variable fields. These find-
ings were similar to Zeng’s findings, where ISBD punctuation and spacing for-
mat errors were the most frequent, followed by CJK-mode punctuation errors
and missing or extra field errors. Because CJK-mode punctuation errors do not
impede record retrieval in RLIN, this study excluded such errors.
Although RLIN had a higher error rate than OCLC, overall, error patterns in
OCLC and RLIN records were similar. Because of differences between the
two systems, errors such as punctuation and spacing errors were introduced
into records as they were tape-loaded into the OCLC system. This problem
needs to be addressed at a technical level.
Although the majority of errors occurred in the content and format catego-
ries, romanization errors posed the greatest concern. Romanization and word
division errors occurred frequently, rendering such records difficult to retrieve
76 CATALOGING & CLASSIFICATION QUARTERLY
0.35
Chinese records
0.30
Korean records
0.25
0.20
0.15
0.10
0.05
0.00
Fixed 1xx 240 245 246/740 250 260 300 4xx/8xx 5xx 6xx 7xx
& 0xx
Hee-sook Shin 77
from the system. It is thus clear that understanding Korean romanization and
emphasizing consistency between romanized and vernacular fields is crucial
to improving the quality of cataloging in Korean records. Missing and incor-
rect access points created a similar problem, as several entries failed to match
established and expected authority forms, complicating record retrieval. These
were serious errors impacting the efficiency and usability of the databases as
catalogers duplicated records already in the system and users would presum-
ably have to work to locate items based on substandard records.
Downloaded by [DePaul University] at 20:39 14 November 2014
example, has twelve different authority entries, which likely describe different
individuals, and correspond to different vernacular characters. Including ver-
nacular characters would also identify which language characters (Chinese or
Korean) catalogers should use in the fields as access points.
4. A duplicate detection program is currently available to detect and elimi-
nate non-CJK duplicates in the OCLC database. The database would benefit
from a similar program to detect CJK duplicates. Future versions of OCLC and
RLIN could also provide an automatic romanization function for Korean ver-
nacular characters, modeled after that currently used in cataloging Arabic and
Downloaded by [DePaul University] at 20:39 14 November 2014
8. CONCLUSION
Quality in cataloging is a vital issue in library services for both patrons and
staff. Cataloging is reemerging as a core service in libraries, fostered by the
ever increasing body of electronic resources in which quality becomes even
more important as the record serves as the only access to the material. The gen-
Hee-sook Shin 79
eral availability of all library resources is directly affected by the quality of the
library database.
The library staff is responsible for maintaining the quality of the local data-
base and should perform regular quality searches using various strategies.
Also, individual catalogers must always be on the alert for errors that they fre-
quently find (or make) and work to remedy them. Although persistence and at-
tention to detail are required, the resulting database will be helpful both to the
public and to technical services staff and will reflect positively on the library.
Shared databases should also be regularly maintained because errors have a
wide effect on the efficient use of the system. In foreign language cataloging,
Downloaded by [DePaul University] at 20:39 14 November 2014
NOTES
1. Lei Zeng, “An Evaluation of the Quality of Chinese-Language Records in the
OCLC OLUC Database and A Study of a Rule-based Data Validation System for On-
line Chinese Cataloging.” Ph.D Diss., University of Pittsburgh, 1992.
2. Janet Swan Hill, “The Elephant in the Catalog: Cataloging Animals You Can’t
See or Touch.” Cataloging & Classification Quarterly, 23, no. 1 (1996): 5-25.
3. Joseph C. Harmon, “The Death of Quality Cataloging: Does It Make a Differ-
ence for Library Users?” Journal of Academic Librarianship, 22 (July 1996): 306-307.
4. Lydia W. Wasylenko, “Building Quality That Counts into Your Cataloging Op-
eration.” Library Collections, Acquisitions, and Technical Services, 23, no. 1 (Spring
1999): 101-104.
5. Rahmatollah Fattahi, “Super Records: an Approach Towards the Description of
Works Appearing in Various Manifestations.” Library Review, 45, no. 4, (1996):
19-29.
80 CATALOGING & CLASSIFICATION QUARTERLY
12. James R. Dwyer, “The Catalogers’ ‘Invisible College’ at Work: The Case of the
Dirty Database Test." Cataloging & Classification Quarterly, 14, no. 1 (1991): 75-82.
13. Lisa Romero, “Original Cataloging Errors: A Comparison of Errors Found in
Entry-level Cataloging with Errors Found in OCLC and RLIN.” Technical Services
Quarterly, 12, no. 2 (1994): 13-27.
14. Robert N. Bland, “Quality Control in a Shared Online Catalog Database: The
Lambda Experience.” Technical Services Quarterly, 4 (Winter 1986): 43-58.
15. Sheila S. Intner, “Quality in Bibliographic Databases: An Analysis of Mem-
ber-contributed Cataloging in OCLC and RLIN.” Advances in Library Administration
and Organization, 8 (1989): 1-24.
16. Fung-yin K. Simpson, “Quality Control of Chinese Monographic Records: A
Case Study.” Journal of East Asian Libraries, no. 116 (Oct. 1998): 31-40.
17. Karen T. Wei and Sachie Noguchi, “RLIN CJK versus OCLC CJK: the Illinois
experience.” Library Resources & Technical Services, 33, no. 2 (1989): 140-151.
18. Hisako Kotaka, interview by author. OCLC, OH, 20 November 2000.
19. CJK Database Statistics. http://www.oclc.org/oclc/cjk/statmenu.htm.
20. Karen Smith-Yoshimura, Email to author, 18 January 2001.
21. Barbara G. Preece, “Preliminary LC records for monographs in OCLC.” Infor-
mation Technology and Libraries, 11 (Mar. 1992): 3-9.
22. Library of Congress. Cataloger’s Desktop Version 2001 issue 1, Library of
Congress, Washington D.C.
23. Online Computer Library Center, “OCLC Bibliographic Formats and Stan-
dards.” Ohio: OCLC, 1999.
24. Online Computer Library Center. OCLC Authority File. OCLC, Ohio.
25. G. M. McCune, and E. O. Reischauer. The Romanization of the Korean Lan-
guage: Based upon its Phonetic Structure. Korea : s.n., 1939?.
26. Samuel E. Martin, Yang Ha Lee and Sung-Un Chang. A Korean-English dictio-
nary. New Haven: Yale University Press, 1967.
27. Kotaka see note 18.
Hee-sook Shin 81
(1:1000)
200
180
160
140
120
100 OCLC
RLIN
80
60
40
20
0
1994 1995 1996 1997 1998 1999 2000
82 CATALOGING & CLASSIFICATION QUARTERLY
1. Encoding level codes, indicated by a capital or numeric character, reflect the degree of completeness of machine-readable
(MARC) records.
2. Descriptive Cataloging codes according to the provisions of International Standard Bibliographic Description (ISBD).
• i–record has the descriptive cataloging and punctuation conventions of ISBD but is known to be a non-AACR2 rec-
ord
• j–juvenile work, indicates items intended for children (up to a 9th grade level or 15 years old).
Hee-sook Shin 83
fixed RO RO RO RO
& 0xx F1 F2 F3 F4 M1 M2 M3 S1 S2 R1 R2 R3 R4 R5 E1 E2 E3 E4 1 2 3 4 Total
1 0
2 0
3 0
4 0
5 1 1
6 0
Downloaded by [DePaul University] at 20:39 14 November 2014
7 0
8 1 1
9 1 1
10 0
11 0
12 0
13 1 1
14 3 3
15 0
16 0
17 1 1 2
18 0
19 1 1
20 0
21 1 1 2
22 0
23 1 1
24 0
25 0
26 1 1
27 1 1
28 0
29 0
30 1 1 2
31 1 1
32 0
33 0
34 0
35 1 1
36 0
37 0
38 0
Hee-sook Shin 85
RO RO RO RO
F1 F2 F3 F4 M1 M2 M3 S1 S2 R1 R2 R3 R4 R5 E1 E2 E3 E4 1 2 3 4 Total %
Fixed
&0xx 3 1 1 10 29 0 0 0 93 1 2 1 31 0 0 0 0 0 0 0 0 0 172 0.34
1xx 5 0 0 6 5 8 3 1 0 5 0 4 0 0 0 0 0 1 3 0 1 0 42 0.08
Downloaded by [DePaul University] at 20:39 14 November 2014
240 1 6 0 2 12 2 1 0 0 2 2 0 0 0 2 0 0 0 1 1 1 0 33 0.06
245 0 13 9 58 1 3 0 17 0 1 2 5 0 0 29 5 0 1 71 47 14 2 278 0.55
246/
740 2 6 0 0 28 1 8 0 0 0 2 0 0 0 3 0 0 0 9 11 1 0 71 0.14
250 0 0 0 1 4 1 0 0 0 0 2 0 0 0 0 0 0 0 1 2 1 0 12 0.02
260 0 0 3 29 0 0 2 3 0 0 4 5 0 0 12 1 0 0 15 6 4 1 85 0.17
300 0 0 4 20 1 0 0 0 0 0 3 1 0 0 0 0 0 2 0 0 0 0 31 0.06
4xx/
8xx 15 14 0 11 14 0 4 2 0 2 2 6 1 0 4 3 0 0 8 2 4 0 92 0.18
5xx 4 1 0 64 12 0 1 2 0 2 1 0 0 0 11 0 0 4 8 7 3 0 120 0.24
6xx 1 0 4 6 1 1 11 0 0 4 0 5 0 18 4 0 0 7 3 0 0 0 65 0.13
7xx 4 2 0 14 31 11 0 0 0 4 0 4 0 0 8 0 0 1 4 0 1 0 84 0.17
Total 35 43 21 221 138 27 30 25 93 21 20 31 32 18 73 9 0 16 123 76 30 3 1085 2.14
APPENDIX 5 (continued)
RO RO RO RO
F1 F2 F3 F4 M1 M2 M3 S1 S2 R1 R2 R3 R4 R5 E1 E2 E3 E4 1 2 3 4 Total %
Fixed
& 0xx 1 0 4 0 2 0 0 0 31 1 0 1 6 0 0 0 0 1 0 0 0 0 47 0.41
1xx 2 0 0 6 1 4 0 0 0 0 0 2 0 0 0 0 0 0 1 0 0 0 16 0.14
240 1 2 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0.04
245 0 4 3 65 0 0 0 13 0 0 1 0 0 0 10 0 0 0 14 14 11 0 135 1.17
Downloaded by [DePaul University] at 20:39 14 November 2014
246/
740 0 4 0 1 2 1 4 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 13 0.11
250 0 0 0 1 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 12 0.10
260 0 0 3 44 1 1 0 7 0 1 3 0 0 0 8 0 0 0 5 5 1 0 79 0.69
300 0 0 12 22 0 11 0 0 0 0 0 11 0 0 1 0 0 0 0 0 0 0 57 0.50
4xx/
8xx 1 1 0 13 1 0 0 0 0 0 1 1 0 0 6 0 0 0 0 1 0 0 25 0.22
5xx 2 0 0 18 2 0 0 1 0 0 0 0 0 0 4 0 0 2 0 1 1 0 31 0.27
6xx 0 1 4 3 1 0 1 0 0 0 0 0 0 6 2 0 0 3 0 0 0 0 21 0.18
7xx 0 1 0 6 8 2 2 2 0 0 0 2 0 0 0 0 0 1 1 0 1 0 26 0.23
Total 7 13 26 180 20 19 8 23 31 2 5 17 6 6 31 0 0 7 21 31 14 0 467 4.06
APPENDIX 5 (continued)
MARC field x y r z
(entry number) (error number) (rate = y/x) (r* sample number)
Fixed & 0xx 508 172 0.34 172.00
1xx 336 42 0.13 63.50
240 90 33 0.37 186.27
Downloaded by [DePaul University] at 20:39 14 November 2014
440/490/8xx
300 245
260 246/740
250
90 CATALOGING & CLASSIFICATION QUARTERLY
APPENDIX 6 (continued)
MARC field x y r z
(entry number) (error number) (rate = y/x) (r* sample number)
Fixed & 0xx 115 47 0.41 47.00
1xx 73 16 0.22 25.21
240 6 5 0.83 95.83
245 115 135 1.17 135.00
Downloaded by [DePaul University] at 20:39 14 November 2014
300
260 245
250 246/740