Documente Academic
Documente Profesional
Documente Cultură
Automation of Bibliographic
Control for Chinese Materials
in the United States
LEI ZENG”
ABSTRACT
The automation of bibliographic control for Chinese
materials (along with other East Asian language materials) is
significant because of the role this material plays in providing
swift access to information internationally, across physical,
political and linguistic boundaries. Ten years ago, the
Library of Congress (LC) and the Research libraries Group
(RLG) initiated a project for seeking means and methods
to automate Chinese, Japanese and Korean (CJK) bib-
liographic records. Today, most East Asian Libraries in
North America can process their CJK materials in vernacular
languages and scripts in library computers in the same way
as Western-language peers had done for years. This paper
attempts to review some of the important issues involved in
such a movement and the general progress during the last
decade. The review which is concerned with the automation
of bibliographic control for Chinese materials will emphasize
bibliographic utilities in the United States since the auto-
mation has been led primarily by East Asian libraries in
North America rather than by those in mainland China and
Taiwan.
GENERAL PROGRESS
Since the 196Os, there has been an increasing awareness in the United
States of the political and economic importance of East Asia. This
movement has led to an increased interest in East Asian studies, a
large increase in the number of East Asian specialists, and a dramatic
expansion of East Asian language collections. In 1930, East Asian
collections in the United States had some 400000 volumes; half a
century later, these collections have grown to about 8 000 000 volumes
distributed in 9 1 academic libraries.’ However, 10 years ago, most East
Asian libraries in the United States mainly functioned as warehouses
of books rather than as active information service organizations, let
TABLE I
Systems comparison
TABLE I continued
Authority file Display all diacritics Does not display OCLC’s authority
display diacritics file search may
result in confusion
and misleading
results
Authority searches Permits an authority OCLC’s capability is
have to be done search while necessary and
separately inputting a record convenient
Communica- Private dedicated Dial-access line Operation cost of
tion cost line. $5000 installation RLIN seems too high
$65 000 per year for a single library
(fixed)
Database size 800 000 records 450 000 records CJK records have a
(as of March (Includes lower hit rate on the
1991) duplicates) OCLC database
thanon RLIN's.
OCLC & RLIN
agreed to exchange
CJK records created
after September
1988 through LC
since OCLC merges the CJK records with all other records, each search
key has to go through the entire data base currently containing nearly
22 million records. OCLC may index the language code in the fixed
field and the 042 field so that CJK search keys could go through the
CJK records and bypass millions of other records. Another suggestion
is that OCLC should develop and validate a CJK stop-word list for
those common terms such as Chung-kuo (China), Nihon (Japan), or
Koria (Korea) so that common terms could be ignored in the indexes.
He also suggested that OCLC should develop a CJK data base subset
and maintain a separate telecommunication line for CJK users in order
to avoid a “traffic jam”; that an on-line thesaurus would be necessary;
and that OCLC CJK should change from using an 8-bit microcomputer
to 16- or 32- bit hardware.
In her MLS thesis based on a survey of East Asian libraries in the
United States, Elman ( 1990)37 made I7 suggestions for improvement
of the OCLC CJK350 system. Emphasizing the process of on-line
cataloging, she suggests improving search capability and card pro-
duction efficiency as well as a clean up of duplicate records. She had,
for the first time, mentioned the possibility of co-operation in order to
308 L. ZENG
The OCLC data base currently holds more than 450 000 unique
records of East Asian language materials (including tapeloaded records
created by LC and RLIN CJK- members); there are more than 300 000
CJK records with vernacular characters, of which over 150000 are
Chinese records.35 This number, however, is still small compared with
the volume of Chinese materials available worldwide. In order to over-
come the problem of limited CJK resources within North America, a
search for international co-operation has been a first consideration for
the major bibliographic utilities since the first implementation of CJK
systems in the mid- 1980s. Thus in issues related to the further devel-
opment of OCLC CJK on-line cataloging, concern over quantity has
prevailed over concern for quality, at least initially.
Impact of CJK slystemson the East Asian Libraries in the United States
In order to find out how academic/research libraries in the United
States incorporate the RLIN CJK and the OCLC CJK350 systems into
their local systems, Karen T. Wei ( 1986a),44 Wen-kai Kung,45 Hee-
Jung Lee ( 1985)46 and Su-erh T. Elman ( 1990)37 conducted surveys
among the East Asian libraries in the United States. Only Elman’s
survey covered both RLIN CJK users and OCLC CJK users. Elman’s
survey shows that the most important function of CJK systems is cata-
loging because of increased cataloging productivity. Authority control
is listed as the second benefit. Other important benefits include
improved staff morale and improved library image. Inter-library loan
(including record searching and verification) is the third important
function; however, it is not widely utilized.
Major problems of the two CJK systems as seen by the libraries are
different. OCLC CJK users ranked searching methods much higher
than that of RLIN, while RLIN CJK users ranked high costs much
higher than that of OCLC. Both groups selected “inadequate for public
use” as one of the three most serious problems of the systems. Quality
of records, though listed by the investigator as one of the major
problems, was not specifically discussed in the paper. However, the
responding libraries rated benefits of CJK systems higher than their
problems. This indicated that the merits of these two systems were seen
as more important than their problems.37
Nevertheless, the East Asian collections in the United States are
still not fully integrated into the main collections of their parent institu-
tions. Although most institutions include romanized CJK records in
their local library systems, they do not serve a practical purpose
because romanized records are not always legible to users or even
well-trained librarians. Card catalogs are still the major forms of
public access. The major barriers for the development plan are
hardware and software limitations, as well as insufficient funding.
BIBLIOGRAPHIC CONTROL OF CHINESE MATERIALS 311
“Low priority” and lack of expertise of personnel are also important
factors.
in the United States are still rare in the area. Their efforts in pursuit of’
library automation during the past decade or so have been directed
towards revision of cataloging codes, to make them compatible with inter-
national standards such as AACR2 and ISBDs, development of special
character sets for the automation of vernacular scripts, compilation
of national bibliographies, and establishment of national MARC for-
mats based on UNIMARC and/or USMARC.’ By 1982, Taiwan had
completed and published the Chinese MARC format based on
UNIMARC and LC MARC, a new version of Chinese cataloging
rules which incorporates AACR2 and ISBD, and a list of Chinese
subject headings.53,54 OCLC conducted tests of the adaptability of
Chinese MARC tapes to the OCLC data base and is working on
some remaining incompatibility problems. The National Library of
China in Beijing has, in recent years, been experimenting with the LC
MARC tapes in acquisitions and cataloging. Continuing projects
include the creation of a national bibliographic data base, publica-
tion in electronic form of the national bibliography of China, and
production of Chinese MARC tapes.gX55
Problems exist in the transmission of bibliographic data between
CJK data bases in East Asia and North America. For instance, unified
cataloging practices need to be established with every cataloging agency
adhering to international standards such as AACR2, ISBD and
UNIMARC. Consistency in the choice of romanization systems for
use in bibliographic records needs to be maintained. As pointed out
by Thomas H. Lee, Associate Librarian of Indiana University Library,
“the key word now is standardization” and to do this, not only East
Asian libraries in North America, but also all the national biblio-
graphic agencies in East Asia need to devote their best efforts to im-
prove the international exchange of bibliographic data and sharing of
resources.’
In conclusion, the automation of bibliographic control for Chinese
materials, along with Japanese and Korean materials, has moved into
the mainstream of library automation in the on-line environment since
the mid-1980s. Among the contributions made, the RLIN CJK system
and the OCLC CJK350 system are the most distinguished. To date,
about half a million CJK bilingual bibliographic records have been
entered on the data bases of these two library utilities, among them
over 50% are Chinese records. However, this is still a small number
compared with the huge number of Chinese materials available in the
world, Many issues related to policy and technology for the further
development of the Chinese bibliographic control become critical.
Meanwhile, although they present unique problems and represent a
significant subset of the records held by bibliographic utilities, CJK
BIBLIOGRAPHIC CONTROL OF CHINESE MATERIALS 313
bibliographic records have not been well-investigated. The literature
on CJK bibliographic control has concentrated on the introduction of
CJK systems, comparisons between the RLIN CJK and OCLC CJK
systems, and the impact that the automation of bibliographic control
has had on East Asian libraries in North America. Romanization,
retrospective cataloging, and the transferal of CJK records with or
without vernacular into the integrated library systems employed by the
parent institutions of East Asian libraries are new topics of recent years,
but literature in these areas is rare, most being reported in the form of
news, letters and notes. No comprehensive study on the quality of
CJK bibliographic records in on-line data bases has been reported.
Nevertheless, a new era has already begun, and the future of automated
bibliographical control for Chinese and other vernacular materials in
the whole world will be very exciting.
ACKNOWLEDGEMENTS
REFERENCES
‘rsuneishi, Warrent (1987) Current status of East Asian Collections in the North
America. Journal of Library and Znformation Science 13(2), pp. 199-209.
2 Martinique, Edward (1988) Summer Institute on East Asian Librarianship (1988:
Seattle). CEAL Bulletin 85, pp. 23-27.
Library of Congress. (1978) Library offers pl ans for romanization. Library of
Congress Information Bulletin 37 (43)) pp. 654-655.
4 Lee, Thomas H. (1979) Results of a survey on the romanization and automation
of East Asian catalog CEAL Bulletin 58, pp. 38-41.
records.
5 CEAL (1980) ACLS Conference on East Asian Character Processing in Auto-
mated Bibliographic Systems. CEAL Bulletin 61, pp. g--17.
Library of Congress. (1980) LC/RLG agreement concluded on cataloging of East
Asian scripts. Library of Congress Information Bulletin 39 (6)) pp. 42-43.
CEAL (1982) Research Library Group and Transtech Corporation sign contract
for RLG CJK cluster. CEAL Bulletin 67, pp. 47748.
Crawford, Walt (1984) MARCfor Library Use: Understanding the USMARC Formats.
White Plains, NY. Knowledge Industry Publications, Inc. pp. 97-102, 154.
Lee, Thomas H. (1988) The development of CJK bibliographic databases in
North America and East Asia. Cataloging & Class$cution Quarterly 8(3/4), pp. 111
126.
10 RLG (1988) RLG celebrates fifth anniversary of CJK. CEAL Bulletin 85, p. 43.
11 Tsiang, Amy Ching-Fen (1988) Experiences in OCLC CJK online cataloging at
the UCLA Oriental Library. CEAL Bulletin 85, pp. 8-.lO.
314 L. ZENG
12 Fischer, Russell G. (1984) The CJK ‘I’erminal: RLG and Transtech’s achievement.
Library Hi Tech l(4), pp. 27-34.
13 Hu, David Y. (1985a) OCLC and its CJK software. Journal of Educational Media
C!? Libray Sciences 23(I), pp. 63-75.
14 Hu, Lily H. (198513) Automation of East Asian bibliographical records in the
University of Melbourne Library. East Asian Librarians’ Group of Australia Newsletter
8(March), pp. l-4.
15 Huang, Hong-chu ( 1985) Th e d evelopment of RLIN’s automated system in East
Asian languages material. Journal OfEducationaL Media @ Library Sciences 23(l), pp,
76-85.
16 Wang, Andrew H. (1985a) OCLC CJK automated library information network.
Journal of Library and Information Science 11(2), pp. I43- 153.
17 Wang, Andrew H. ( 1985b) OCLC ‘s cataloging capability in the Chinese, Japanese
and Korean languages. Journal of Educational Media &3 Library Sciences 23( I), pp,
57-62.
18 Rogers, Gloria H. (1986) From cards ‘to online: the Asian connection. information
Technology and Libraries 5(4), pp. 280-284.
I9 Haeger, John W. (1987) The CJK enhancements to the RLIN system: a review
of basic issues. In: C. Bossmeyer and S. W. Massil (Eds) Automated Systemsfor Access
to Multilingual and Multiscript Library Materials: Problems and Solutions. New York.
K. G. Saur. pp. 156-162.
20 Smith-Yoshimura, Karen (1987) RLG’s East Asian studies program. Journal of
Library and Information Science 13(2), pp. 133-l 40.
21 Yu, Abraham J. (1989) An efficient way to copy and insert fields into OCLC CJK
records. CEAL Bulletin 88, pp. 54-57.
22 Bales, Kathleen and Tucker, Alan (1988) The RLIN database: current status,
work in progress, future developments. Cataloging 6Y ClassiJication Quarterly 8(3/4),
pp. 79-89.
23 Cain, Jack (1988) Speech presented at the plenary session of the Committee on
East Asian Libraries on March 24, 1988 in San Fransisco. Reprinted in CEAL
Bulletin 85, pp. 20-23.
24 Cain, Jack (1990) The development of Chinese ideogramic processing for a shared
cataloging system. Program 24(2), pp. 141-153.
25 Tam, Kwok-kan (1986) Automation of East Asian colIections in the U.S.: current
state and problems. Journal of Library and Information Science I2( l), pp. 34-40.
26 Yang, Chen-Chau (1985) Application and design considerations CJK for infor-
mation interchange code. Journal of Library and Information Science 11 ( l), pp. 24-32.
27 Shi, Y. (1989) Facilitating Chinese character entry and information retrieval
through regular expression searching. Library and Information Science Research 11(4),
pp. 335-356.
28 Shi, Y and Larson, R. (1990) Chinese online--problems and solutions. Znformation
Technology and Libraries 9(2), pp. 144-154.
29 Maier, John H. ( 1987) General progress in automation of Chinese and other Asian
materials: an overview. Special Libraries 78( 3)) pp. l&i- 190.
30 Miyazawa, Akira (1987) Character set and its control in an online cataloging
system: applications at the Center for Bibliographic Information. In C. Bossmeyer
and S. W. Massil (Eds) Automated Systems for Access to Multilingual and Multiscript
Library Materials: Problems and Solutions. New York. K.G. Saur. pp. 148-155.
31 OCLC (1987) Studies in automatic cataloging. In: Annual Review of OCLCResearch:
J@y 1986June 1987. Dublin, OH: OCLC.
32 Hu, Lily H. (1990) Online cataloging systems for Chinese, Japanese and Korean
BIBLIOGRAPHIC CONTROL OF CHINESE MATERIALS 315
collections: an evaluation report from the University of Melbourne. LASZE: hzr-
mation Bulletin of the Library Automated Systems information Exchange 20(4), pp. 7c-79.
33 Wei, Karen T. (1989b) RLIN CJK vs OCLC CJK. Journal of Educational Media
@ Library Sciences 24(l), pp. 82-94.
34 Wei, Karen T. and Noguchi, Sachie (1989) RLIN CJK versus OCLC CJK: the
Illinois experience. Library Resources &+’ Technical Services 33(2), pp. 140-15 1.
35 Kotaka, Hisako (1991) 1991 OCLC CJK350 updates brief notes. Distributed
at the OCLC CJK350 Users Meeting (April 12, 1991, Le Meridien, New Orleans)
36 Lee, Jay (1989) Cataloging CJK online: the ASIA experience. Journal of Librav
and Information Science 15(I), pp. 22-34.
37 Elman, Su-erh T. (1990) Automation in East Asian Libraries in the United States:
A Survey Result. (MLS Thesis). Los Angeles. Graduate School of Library and
Information Science, UCLA, 1990.
38 ‘Tao, Hanyu and Cole, Charles (1990) Wade-Giles or Hanyu pinyin: practical
issues in the transliteration of Chinese titles and proper names. Cataloging @
Clussijcation Quarterly 12 (2)) pp. 105- 117.
39 Lee, Thomas H. (1990) A letter to Rather, Lucia J., Director for Cataloging,
Collection Services, Library of Congress on July 20, 1990. Reprinted in CEAL
Bulletin 90, i-iii.
40 Library of Congress. (1990) P’ m y’ m: Possible approaches for cataloging and auto-
mation. Prepared by Collections Services, Library of Congress, March 30, 1990,
distributed at the CEAL Annual Meeting on April 3, 1990 in Chicago. Reprinted
in CEAL Bulletin 90, pp. 5&62.
41 Smith-Yoshimura, Karen (1990a) A letter to Rather Lucia, Director of Catalog-
ing, Collection Services, Library of Congress on July 20, 1990. Reprinted in CEAL
Bulletin 91, p. 35.
42 Smith-Yoshimura, Karkn (1990b) RLG response to LC’s discussion paper on
pinyin. Reprinted in CEAL Bulletin 91, pp. 36-41.
43 Avram, H.D. (1991) Memorandum to “Those interested in Chinese roman-
ization” , January 3, 1991. Reprinted in CEAL Bulletin 92, p, 32.
44 Wei, Karen T. (1986a) Current status and future trends of East Asian library
automation in North America. Information Technolqey and Libraries 5(2), pp. 14s
146.
45 Kung, Wen-kai. (1986) Computerized cataloging of East Asian vernacular
materials in non-RLIN libraries of North America. CEAL Bulletin 80, pp. 5-10.
46 Lee, Hee-Jung. (1985) A study of automated cataloging system for Chinese, Japanese and
Korean Materials (MLS thesis). Los Angeles. Graduate School of Library and
Information Science, UCLA, 1985.
47 Brown, Rowland C.W. (1988) Library Technology Applications and Inter-
national Co-operation. In: International Symposium on New Techniques and Applications
in Libraries: Sept. 8-11, 1988; Xi’an, PRC. Xi’an. Xi’an Jiao Tong University Press.
pp. 551-557.
48 Little, Thompson M. (1988) OCLC’s international initiatives and the online union
catalog. Cataloging & Clas$ca&n Quarterly 8(3/4), pp. 67-78.
49 Wang, Andrew H. (1988) The role of OCLC in promoting Chinese culture. In:
International Symposium on new Techniques and Applications in Libraries: Sept. 8-l 1,
1988; Xi’an, PRC. Xi’an. Xi’an Jiao Tong University Press. pp. 529-532.
50 Wei, Karen T. (1988) Computer applications in East Asian libraries: current
status and future trends. In: International Symposium on New Techniques and Applications
in Libraries: Sept. 8-11, 1988; Wan, PRC. Xi’an. Xi’an Jiao Tong University Press.
pp. 456-462.
316 L. ZENG
51 OCLC (1989) Use of OCLC CJK350 system grows. OCLC Newsletter 181, p. 31.
52 Martinique, Edward (1990) Report of the 1990 Plenary Session. C&IL Bulletin
90, pp. 25-29.
53 Lee, L. T.-C. et al. (1981) Chinese MARC. Its present status and future devrl-
opment. Journal of Library and Information Science 7( I), pp. 1 l- 18.
54 Wang, Chen-ku (1982) The development of library automation as a foundation
of the National Library Network in Taiwan. Journal of Educational Media &? Library
Science 20(2), pp. 106107.
55 Sun, Beixin (1985) Automation development at the National Library of China.
Library Resources and Technical Services 29(4), pp. 367-368.
APPENDIX
’ Lee, Thomas H. (1979) Results of a survey on the romanization and automation of East
Asian catalog records. CEAL Bulletin 58, pp. 38-11.
’ Library of Congress (1978) Library offers plans for romanization. Library of Congress Znformation
Bulletin 37(43), pp. 654655.
’ CEAL (1980) ACLS Conference on East Asian Character Processing in Automated Biblio-
graphic Systems. CEAL Bulletin 61, pp. 9-11.
‘Lee, Thomas H. (1988) The development of CJK bibliographic databases in North America
and East Asia. Cataloging and Clawiication Quarterly 8(3/4), pp. 11 I-126.
’ Library of Congress (1980) LC/RLG a g reement concluded on cataloging of East Asian scripts.
Library of Congress Information Bulletin 39( 6)) pp. 42243.
6 From a letter oflee, Thomas H. (Chair, CEAL) to Rather, Lucia J., Director for Cataloging,
Collection Services, Library of Congress July 20, 1990. Reprinted in CEAL Bulletin 90, i-iii.
’ CEAL (1982) Research Library Group and Transtech Corporation sign contract for RLG
CJK cluster. CEAL Bulletin 67, pp. 4748.
a Martinique, Edward (1990) Report of the I990 Plenary Session. CEAL Bulletin 90, pp. 25529.
‘Lee, Thomas H. 1988. (see Note 1).
lo Ibid
” Tsiang, Amy Ching-Fen ( 1988) E x p eriences in OCLC CJK online cataloging at the UCLA
Oriental Library. CEAL Bulletin 85, pp. 8-10.
BIBLIOGRAPHIC CONTROL OF CHINESE MATERIALS 319
I2 Kotaka, Hisako (1991) 1991 OCLC CJK350 updates brief notes. Distributed at the OCLC
CJK350 Users Meeting (April 12, 1991, Le Meridien, New Orleans).
‘s Kotaka, Hisako (1991) (see Note 12).
I4 Martinique, Edward (1988) Summer Institute on East Asian Librarianship (1988: Seattle).
CEAL Bulletin 85, pp. 23-27.
” RLG (1988) RLG celebrates fifth anniversary of CJK. CEAL Bulletin 85, p. 43.
“Ibid. p. 43.
” RLG (1989) RLIN Multiscript Station Enthusiastically Received. CEAL Bulletin 87, pp. 58
i9.
” RLG (1988) (see Note 15).
” Cain, Jack (Utlas International Canada) (1988) speech presented at the plenary session of
the Committee on East Asian Libraries on March 24, 1988 in San Francisco. Reprinted in CEAL
Bulletin 85, pp. ‘LO--23.
” RECON retrospective conversion projects.
” Tsiang, Amy Ching-Fen (1990) Retrospective conversion of East Asian materials. CEAL
Bulletin 91, pp. 10-15.
“Kotaka, Hisako (1989) OCLC 0 n 1’me Computer Library Center CJK350 Users Meeting
(San Francisco: 1988). CEAL Bulletin 86, pp. 3 l-34 (Pak, Moo-Jae reported.).
“’ Martinique, Edward (1990) (see Note 8).
24 Kotaka, Hisako (1989) (see Note 22).
” Tsiang, Amy Ching-Fen (1990) (see Note 21 j.
“’ Martinique, Edward (1990) (see Note 8).
” Library of Congress (1990) Pinyin: Possible approaches for cataloging and automation.
Prepared by Collections Services, Library of Congress, March 30, 1990. Distributed at the CEAL
Annual Meeting on April 3, 1990 in Chicago. Reprinted in CEAL Bulletin 90, pp. 56-62.
“A letter of Lee, Thomas H. (Chair, CEAL) to Rather, Lucia J., July 20, 1990. (see Note 6).
“’ Ibid.
“A letter to Lucia Rather Director of Cataloging, Collection Services, Library of Congress on
July 20, 1990 by Kaien Smith-Yoshimura, Program Officer and CJK Specialist of RLG, and a
paper prepared by him titled: RLG response to LC’s discussion paper on pinyin. Reprinted in
CEAL Bulletin 91, pp. 35, 36-41.
” Wu Yeen-mei ( 1990) University ofwashington East Asian Library acquires full-text database
of the stlndard Chinese dynastic histories CEAL Bulletin 90, pp. 44-45.
” Harvard-Yenching Library (1988) H arvard-Yenching Library Receives Grant to Acquire
Chinese Data Base. Harvard University Library Notes, no. 99 1 (Oct. 18, 1990): l-2. Reprinted in
CEAL Bulletin 92, pp. 4345.
s3 Library of Congress (1989) Library of Congress ceases distribution of CJK catalog cards.
Librav qf Congwss hfirmation Bulletin 48(45), (Nov. 6, 1989): 392 Reprinted in CEAL Bulletin
89, p. 63.
“Kotaka, Hisako (1991) (see Note 12).
sr, Avram, H.D. (1991) Memorandum to “Those interested in Chinese romanization”, January
3, 199 1. Reprinted in CEAL Bulletin 92, p. 32.