Documente Academic
Documente Profesional
Documente Cultură
To cite this article: SungKyung Kim MLIS (2006) Romanization in Cataloging of Korean Materials, Cataloging & Classification
Quarterly, 43:2, 53-76, DOI: 10.1300/J104v43n02_05
Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained
in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no
representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the
Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and
are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and
should be independently verified with primary sources of information. Taylor and Francis shall not be liable for
any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever
or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of
the Content.
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any
form to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://
www.tandfonline.com/page/terms-and-conditions
Romanization
in Cataloging of Korean Materials
SungKyung Kim
Downloaded by [University of Louisville] at 06:48 01 January 2015
INTRODUCTION
Cataloging System
In the library and information science field, there are three types of
languages: writers’ language in their works, librarians’ language in in-
dexing terms and descriptors to describe the works, and users’ language
in searching for the works within catalog systems. Since the indexing
terms and descriptors are used as access points and metadata by which
users are able to retrieve information, the librarians’ language signifi-
cantly affects the rate of precision and recall in retrieving. Therefore, li-
braries and library associations should have cataloging practices that
create accurate bibliographical records using the librarians’ controlled
vocabulary.
Furthermore, in order to help users retrieve relevant information as
often as possible, libraries should consider users’ ability to create search
terms when setting cataloging standards. The users’ searching ability
can be recognized through their information-seeking behavior, search-
ing skill, and information needs. Many library and information scien-
tists have demonstrated that the users’ information-seeking behavior
including cognitive abilities is related to the quality of search results.
Thus, it is obvious that catalogers need to consider users’ searching
ability when assigning descriptors and deciding on cataloging terms in
bibliographical records. Diagram 1 shows the searching process in an
online catalog.
DIAGRAM 1
Input queries
Information
Downloaded by [University of Louisville] at 06:48 01 January 2015
Cataloging
System
Match queries to
Descriptors indexing and
cataloging terms
Retrieve
information of
Cataloguer’s FOA the topic
language
Result of search
Evaluate the
information
Search Search
termination termination
METHOD
Downloaded by [University of Louisville] at 06:48 01 January 2015
BACKGROUND OF ROMANIZATION
East Asian and Western scholars have designed unique schemes to
romanize the non-Roman alphabets of East Asian languages. Libraries
58 CATALOGING & CLASSIFICATION QUARTERLY
Many studies were done with Wade-Giles and Pinyin, and LC adopted
the Pinyin system after verifying its efficiency.
The Hepburn system8 has been used to romanize Japanese. It is an
easy system to use, but in 1983, some changes were made by the LC
when a problem was encountered. When defects were found in using
the Japanese Hepburn romanization system, the system was revised and
modified through trial and error.
There are several variant romanization schemes for Korean: the
Yale system,9 South Korean Ministry of Education system,10 and the
McCune-Reischauer (MR) system. MR, the system used by LC, was
devised by G. M. McCune and E. O. Reischauer in consultation with
Korean linguists, and first published through the Korean Branch of the
Royal Asiatic Society in 1939. Since the MR system was first designed,
it has been widely used for historical, literary, political, and military
purposes and in the general presentation of Korean proper names in a
romanized form. It is based on Korean phonetic structures, or sounds of
speech. The system is a comprehensible guide which combines “scien-
tific accuracy and practical simplicity,”11 and standardizes modern pro-
nunciation of Korean for those unfamiliar with the language.
TABLE 1
to the system’s rules, and not in the way that he or she prefers to spell his
or her name. Despite the rigidity of these rules, looking through the
RLIN and LC authority files and bibliographic records, it can be seen
that even catalogers make mistakes when romanizing Korean names.
Many records in RLIN and LC have inaccurately spelled names. Tables
2.1 and 2.2 show some mistakes in the bibliographic records of the LC
and UCLA, both of which use the MR system.
In Table 2.1, the author’s name, Lee, Duhyun, should be romanized
as “Yi, Tu-hyon,” according to the MR system. In the UCLA catalog,
however, the user can find three items under “Yi, Tu-hyon,” and four-
teen entries can be found under “Lee, Duhyun.” The authorized form of
this name in the LC/NACO Name Authority File (NAF) is also “Lee,
Duhyun, 1924-.” These results show that the MR system is inconsis-
tently applied when catalogers create bibliographic records using
AACR2 22.3C2 (Persons entered under surname), an alternative rule
which allows the author’s preferred usage in place of rigid transcription.
This rule says, “Choose the Romanized form of name that has become
well-established in English language reference sources for a person en-
tered under surname whose name is in a language written in a nonroman
script.”14
Another matter in transcribing a person’s name is the use of hyphens
between the first and second syllable of a given name. Although there
are a few exceptions, Korean names typically have three syllables: one
syllable for surname and two syllables for the given name. Because
many Koreans have the same surname–for example, one-fifth of all Ko-
reans have the surname “Kim,”–given name is an important element in
retrieving precise information in a catalog search. The two syllables of
the given name are traditionally regarded as a single word in Korean.
Some people use a hyphen between the first and second syllables to dis-
tinguish the syllables, but this practice is not common. Therefore, often
no material is retrieved when a hyphen is included in the middle of a
SungKyung Kim 61
TABLE 2.1
UCLA Record
100 1_ |6 880-01 |a Lee, Duhyun, |d 1924-
245 10 |6 880-02 |a HanGguk sin‘guk sa yHn‘gu / |c Lee Duhyun [chH.
UCLA Record
100 1_ |a Lee, Duhyun, |d 1924-
245 10 |6 880-01 |a HanGguk musok kwa yHnhIi / |c Yi Tu-hyJn chH.
UCLA Record
100 1_ |6 880-01 |a Yi, Tu-hyJn.
245 10 |6 880-02 |a HanGguk sinGgIksa yHnGgu, |c Yi Tu-hyJn chH.
Downloaded by [University of Louisville] at 06:48 01 January 2015
TABLE 2.2
UCLA
100 1_ |6 880-01 |a Park, Chung Hee, |d 1917-1979.
245 10 |6 880-02 |a Kukka wa hyHngmyHng kwa na, |c Pak ChJng-hui chH.
LC Authority Record
Control Number: n 79058355
HEADING: Park, Chung Hee, 1917-1979.
100 10 |a Park, Chung Hee, |d 1917-1979.
400 10 |a P`u, Cheng-hsi, |d 1917-1979
400 10 |a Pak, Chong-hui, |d 1917-1979
400 10 |a Pak, Jung Hi, |d 1917-1979
400 10 |w nnaa |a Pak, Chong-hui, |d 1917-
510 10 |a Korea (South). |b President (1963-1979 : Park)
turns 10,000 results, and a search for “seoul,” returns 2,458 (as retrieved
in April 2005). Since the spelling “Seoul” is more popular for most peo-
ple than “Soul” as romanized according to the MR system, a cataloger
may be confused in deciding which romanized spelling should be used.
Thus, both catalogers and users may be confused by the MR system
when romanizing well-known proper nouns.
Fourth, the MR system is an inconsistent scheme for bibliographic
records because Korean has several local dialects, and there are unique
pronunciations in each local dialect. In other words, phonetic-based
Downloaded by [University of Louisville] at 06:48 01 January 2015
TABLE 3
UCLA Record
260 __ |a Seoul : |b Korean Culture & Arts Foundation, |c 1985.
UCLA Record
260 __ |a Seoul, Korea : |b NAMAN Pub. House, |c 1994.
UCLA Record
260 __ |6 880-03 |a SJul : |b Isak Ch‘ulp‘ansa, |c 1985.
SungKyung Kim 63
TABLE 4
TABLE 5
Downloaded by [University of Louisville] at 06:48 01 January 2015
RLIN Record
100 1· ‡aPak, KyJng-ni,‡d1926-
245 10 ‡aT'oji :‡bPak KyJng-ni taeha sosJl.
260 ·· ‡aSJul
UCLA Record
245 10 |6 880-02 |a T‘oji : |b Pak KyHng-ni taeha sosHl.
260 __ |6 880-03 |a SHul : |b Nanam Ch‘ulp‘an, |c 2002.
USC Record
100:|6880-01|aPak, Ky*ong-ni,|d1926-|?UNAUTHORIZED
245:10: |6880-02|aT'oji :|bPak Ky*ong-ni taeha sos*ol.
260: |6880-03|aS*oul T'ukpy*olsi :|bSams*ong Ch'ulp'ansa,|c1988.
the beginning of a word, they are voiceless, and are romanized as “k,”
“ch,” “t,” or “p” respectively. When they occur between vowel sounds
in the middle of a word, however, they are voiced, and are romanized as
“g,” “j,” “d,” and “b.” It is hard for Korean users to distinguish between
the voiced sounds and voiceless sounds, and many Korean speakers
tend to use the voiced romanizations (“g,” “j,” “d,” and “b”) for both
voiced and voiceless sounds.
Finally, word division is also a problem in using the MR romanizing
scheme. According to ALA-LC Romanization Tables, space should be
used to separate each word or lexical unit (including noun particles)
from all other words or lexical units. But this rule is contrary to Korean
conventional practice which always joins particles to the nouns they
modify, and often omits spaces between nouns. For instance, the title
“89:;<” (“The old man and the sea”) should be written as “Noin
kwa pada” (“noin” = “the old man”; “kwa” = “and”) according to the
ALA-LC rule. Yet Koreans usually write the title as “Noinkwa pada,”
with no space between Noin and kwa. The principle of word division
may create a heavy burden for Korean users in their searching processes
64 CATALOGING & CLASSIFICATION QUARTERLY
as it is not natural. LC and other libraries have created many faulty bib-
liographical records, due to incoherent and arbitrary word division.
As shown above, the MR system has created many problems and dif-
ficulties for romanizing Korean in library catalogs. In bibliographic rec-
ords, the most significant issue is to help users to access accurate
information with simple methods, reflecting user’s information needs
and seeking behavior. From this viewpoint, the MR romanization scheme
is not satisfactorily effective in retrieving and providing access to infor-
mation.
Downloaded by [University of Louisville] at 06:48 01 January 2015
60% romanized it using the English spelling “design.” This points out
that the use of the original spellings for words borrowed from English is
more usual for Korean users than romanizing the Korean spelling.
Table 6.4 also shows the students’ romanization of widely used
proper nouns. In the MR system, the capital of South Korea, “Seoul”
should be romanized as “Soul.” In the survey, no student wrote “Soul”
(or even “Soul”), yet 90% of the students romanized the city’s name as
the commonly used “Seoul.”
Tables 7.2-1, 7.2-2, and 7.2-3 show that not a single student could re-
Downloaded by [University of Louisville] at 06:48 01 January 2015
trieve the publications when searching the names of a few of the most
famous authors in Korea.
Moreover, the students could not distinguish between voiced sounds
and voiceless sounds. When the character “6” or “5” is put at the be-
ginning of a word, it should be romanized as “t” and “ch” respectively.
However, Tables 6.5 and 6.6 show that many more students used “d” or
“j.” I asked the students individually whether they knew the difference
between voiced and voiceless sounds. According to their answers, they
knew the theoretical difference, but generally could not tell the sounds
apart or apply the distinction practically.
Table 7.5-1 shows the students’ skills in word division. None of the
students gave the correct word division.
Tables 7.2-1, 7.2-2, and 7.2-3 indicate the students’ understanding of
hyphens. No students included the hyphens between the first and sec-
ond syllables of the three authors’ given names.
Overall, through the questionnaire and interviews with Korean users,
the MR system was not found to be very effective in romanizing search
terms, because terms romanized using MR rarely matched the users’ in-
tuitive spellings. In the questionnaire, the simplest search query and the
highest retrieval result was “T’oji (BC; “The Land”), one of the best-
selling books in Korea. Even in this case, only 25% of the students gave
the correct approximation “Toji.” Users cannot be knowledgeable in cor-
rectly using MR or successfully retrieving materials using romanized
search terms until they take a specialized class on the romanization
scheme.
CONCLUSION
TABLE 6
Total 12 100.00% *
*Possible error exists within plus or minus 0.01 percent. It was rounded up to the tenth place to make 100%.
SungKyung Kim 67
etiyagi 1 8.33%
yesyiyagi 1 8.33%
non-romanizing 2 16.67%
yekeeygi 1 8.33%
Total 12 100.00%*
TABLE 7 (continued)
7.2-2. Pak, ChJng-hKi ( ): as an author’s name
No answer 1 8.33%
Total 12 100.00%*
12 100.00%*
*Possible error exists within plus or minus 0.01 percent. It was rounded up to the tenth place to make 100%.
**Possible error exists within plus or minus 0.04 percent. It was rounded up to the tenth place to make 100%.
***Possible error exists within plus or minus 0.02 percent. It was rounded up to the tenth place to make 100%.
70 CATALOGING & CLASSIFICATION QUARTERLY
The single letters “g,” “j,” “d,” and “b” are used to romanize “4,” “5,”
“6,” and “7” respectively, regardless of whether the sounds they rep-
resent are voiced or unvoiced.
Unlike the MR system, the single letters “g,” “j,” “d,” and “b” are used
to romanize “4,” “5,” “6” and “7” at the beginning of a word or be-
tween vowel sounds, regardless of whether the sounds they represent are
voiced or unvoiced. The MCT system is not a morphemic-based translit-
eration system. However, like the MR system, the new system still treats
consonants differently in situations where their sounds change percepti-
Downloaded by [University of Louisville] at 06:48 01 January 2015
bly. Table 8 compares the MR system and the new romanization system.
While the MCT recognizes the difficulties in requiring surnames to
conform to the new system, it is continuously working for standardizing
the romanization of surnames. The scholars who developed the MCT
system understood that adoption of the new system would cause much
confusion in the short term, and that it would take considerable time for
the system to be recognized around the world. Were the transition to be
delayed for fear of causing more confusion, however, the costs of prob-
lems engendered by the old system and the costs of redressing those
problems would become larger and larger, thus burdening future gener-
ations of Korean speakers.
The change in Chinese romanization had similar problems. Before
the late 20th century, the most widespread romanization scheme was
the Wade-Giles system, which was adopted by LC to romanize Chi-
nese-language materials. Yet in 1958, the government of the People’s
Republic of China approved Pinyin as a romanization system for Man-
darin,17 since the Wade-Giles system has phonetically redundant sylla-
bles. Wade-Giles cannot render the national standard pronunciation of
Mandarin with complete accuracy. On the other hand, the Pinyin system
is a well-structured and consistent romanization scheme in which Man-
darin phonemes and syllables can be relatively accurately presented.
Pinyin is also superior when used in online retrieval. Several research-
ers have reported that library users prefer the Pinyin system over the
Wade-Giles system. Such research encouraged LC to change its Chi-
nese-language records from Wade-Giles to Pinyin, and in 1997, LC de-
cided to adopt the Pinyin system. Although there were many difficulties
and much confusion, the adoption of the Pinyin system contributed to
improving bibliographic access.
Today, the South Korean government encourages many Western
countries to follow the MCT system that was introduced in 2000. Con-
sidering the previous example of Pinyin, LC and other libraries in the
72 CATALOGING & CLASSIFICATION QUARTERLY
Consonants Vowels
k g k g k a a
kk kk k kk k ae ae
Downloaded by [University of Louisville] at 06:48 01 January 2015
n n n n n ya ya
t d t d t yae yae
tt tt t tt t H eo
r r l r l e e
m m m m m yH yeo
p b p b p ye ye
pp pp p pp p o o
s s t s t wa wa
ss ss t ss t wae wae
— ng§ ng ng§ ng oe oe
ch j t j t yo yo
tch tch t jj t u u
ch' ch' t ch t wH wo
k' k' k k k we we
t' t' t t t wi wi
p' p' p p p yu yu
h h h h h I eu
Ii ui
i i
United States should consider switching from the MR system to the new
romanization system that matches the official system in use in South
Korea.
Third, as Tillett mentions,18 it is necessary to create more precise and
consistent authority control as well as access control, which can be
shared internationally. The authorized forms represent various forms of
names and titles and are used to create uniform access points to facilitate
retrieval in library catalogs. Each country has different expectations for
these access points. Libraries need to provide headings that reflect us-
ers’ conventions in their local communities and countries. As shown
Downloaded by [University of Louisville] at 06:48 01 January 2015
adopt Unicode in the future. Many users, however, may not yet know
about their libraries’ multi-language service. It is undoubtedly necessary
to inform users that they can search by using their vernacular languages.
Finally, the pattern of searching I observed would warrant and support
a change in the transcription rules. Words adopted from Roman alphabet
languages into non-Roman script should use their original spellings,
rather than being re-transliterated from the non-Roman languages.
Thus far, I have examined the history, characteristics, and use of
romanization schemes for non-Roman-alphabet materials, with a focus
on Korean romanization. The use of the MR system in library catalogs
has many drawbacks, considering users’ searching ability and accessi-
bility. To retrieve materials more precisely according to user needs, I
have introduced several methods being discussed currently. It is neces-
sary for librarians at LC and elsewhere to maintain a continuous dialog
on the effective cataloging of and rules for Korean-language materials.
By observing users’ seeking behaviors and searching skills, I can con-
clude that there should be further studies on the topic of designing and
developing alternatives for the future.
Received: January, 2006
Revised: May, 2006
Accepted: May, 2006
NOTES
1. Royal Asiatic Society-Korea Branch, “The Romanization of Korean According
to the McCune-Reischauer System,” Transactions of the Korea Branch of the Royal
Asiatic Society 38 (1961): 119-128.
2. Susie Cheng et al., Final Report on Pinyin Conversion (Washington: Pinyin Li-
aison Group, Council on East Asian Libraries, 2000). Available online at http://cealctp.
lib.uci.edu/ceal_pinyin.htm.
3. Laura Tull and Dona Straley, “Unicode: support for multiple languages at the
Ohio State University libraries,” Library Hi Tech 21, no. 4 (2003): 442.
SungKyung Kim 75
doi:10.1300/J104v43n02_05
76 CATALOGING & CLASSIFICATION QUARTERLY
APPENDIX
Survey for the Project of “Cataloguing System for Korean Collections: focusing on romanization”
Kim, Sungkyung
Department of Information Studies at UCLA
Which search terms do you use when you want to search in the USC online catalog?
Please romanize the words in “ ” (Question 1-6).
2. Fiction written by “ ”
3. Publications published by “ ”
4. Title is “ ”
6.
( Romanizing .)
7. ?
(How do you search for Korean fiction?)
Thank you.
_____________________