Research Guides: Resources: Subject Headings

Subject Headings

Authorities and vocabularies / Library of Congress
SKOS version: http://id.loc.gov The primary goal of this service is to enable machines to programmatically access data at the Library of Congress but the web interface also provides simple user access. MARC version: http://authorities.loc.gov/
Authorities - VIAF / Virtual International Authority File
http://viaf.org
ClassWeb includes LCSH (Library of Congress Subject Headings)
https://classweb.org/Menu/
SACO Participants' Manual - 2nd ed. / Adam Schiff
http://www.loc.gov/catdir/pcc/saco/SACOManual2007.pdf
Subject Authority Cooperative Program (SACO)
http://www.loc.gov/catdir/pcc/saco/saco.html SACO provides a means for libraries to submit subject headings and classification numbers to the Library of Congress.

SACO proposal form, password required: http://classificationweb.net/Menu/subject.html

See guidelines at: SACO
Library of Congress Subject Headings (LCSH) approved lists (monthly)
http://www.loc.gov/aba/cataloging/subject/weeklylists/
Library of Congress Subject Headings browsing tools / B. Eversberg Explanation and links - http://www.biblio.tu-bs.de/db/ lcsh/ Boolean search option - http://www.biblio.tu-bs.de/db/ lcsh/detail.php
Thesaurus for Graphic Materials / Library of Congress
http://www.loc.gov/rr/print/tgm1/ As of October 2007, the Thesaurus for Graphic Materials I: Subject Terms (TGM I) and the Thesaurus for Graphic Materials II: Genre and Physical Characteristic Terms (TGM II) were merged into a single vocabulary,the Thesaurus for Graphic Materials,and migrated to new software. Application guidelines for genre/format terms (TGM II) and a separate genre/format term list for downloading will continue to be available at the TGM II home page".
Canadian Subject Headings CSH on the Web - http://www.nlc-bnc.ca/6/23/index-e.html CSH is a list of standard subject headings (in English) on Canadian topics, which complements Library of Congress Subject Headings ( LCSH)
IFLA Section on Classification and Indexing
http://www.ifla.org/en/classification-and-indexing The Section focuses on methods of providing subject access in catalogs, bibliographies, and indexes to documents of all kinds, including electronic documents. It serves as a forum for producers and users of classification and subject indexing tools.
654 Faceted subject headings - AAT application
http://www.getty.edu/research/tools/vocabulary/aat/index.html In the term records, the Hierarchy field is followed by a code in brackets, eg. [KT]. The first letter if input in $c preceding the $_term to which it applies. $c is mandatory. Example: $c k $b Spanish $c h $a engineers. $2 aat
Web Resources for SACO Proposals / PCC
http://www.loc.gov/catdir/pcc/saco/resources.html

Fiction and Genre

GSAFD file
http://files.library.northwestern.edu/public/gsafd/ File gsafd.mrc.txt best viewed on Internet Explorer; stored by Gary Strawn at Northwestern Library's FTP server.
Demographic Group Terms Manual - draft Library of Congress Demographic Group Terms (LCDGT) manual: http://www.loc.gov/aba/publications/FreeLCDGT/freelcdgt.html/
LC Genre/Form Terms Manual - draft The instruction manual for Library of Congress Genre/Form Terms for Library and Archival Materials (LCGFT). The manual consists of guidelines and instructions for assigning genre/form terms and proposing new ones: http://www.loc.gov/aba/publications/FreeLCGFT/freelcgft.html
Juvenile literature / fiction / Materials intended for young readers LC's Descriptive Cataloging for Juvenile works: http://www.loc.gov/aba/cyac/descriptive_cyac.html Subject Headings Manual H1690:

Treat topical (non-fiction) materials intended primarily for children and young people through the age of 15, or 9th grade, as juvenile

Treat fiction intended primarily for children and young people through high school age as juvenile

Form Subdivisions

Subdivision authority records (18X) / Cataloging Policy and Support Office
http://lcweb.loc.gov/catdir/cpso/subdauth.html As announced in March 1998 by the Cataloging Policy and Support Office ( CPSO), the Library of Congress has begun to create subdivision authority records to control the approximately 3,100 topical, form, and chronological free-floating subdivisions in the Library of Congress Subject Headings system. These records contain subdivision data in 18X fields and codes in 073 fields that identify their controlling instruction sheet numbers from the Subject Cataloging Manual: Subject Headings (H 1095 - H 1200).
LCSH genre form headings, GSAFD, LCSH, LCGFT / Joel Hahn
http://www.hahnlibrary.net/libraries/formgenre.html
LCGFT - MARC Genre/form code and term source codes
http://www.loc.gov/standards/sourcelist/genre-form.html
LCGFT - Library of Congress Genre-Form Thesaurus (LCGFT) for Moving Images: Best Practices / OLAC
http://www.olacinc.org/drupal/capc_files/LCGFTbestpractices.pdf
LCGFT - Music Library Association Best practices
http://c.ymcdn.com/sites/www.musiclibraryassoc.org/resource/resmgr/BCC_Resources/BPsForUsingLCMPT_14022017.pdf
LCMPT - MARC Music instrumentation and voice code source codes
http://www.loc.gov/standards/sourcelist/musical-instrumentation.html
LCMPT - Best practices / Music Library Association
http://c.ymcdn.com/sites/www.musiclibraryassoc.org/resource/resmgr/BCC_Resources/BPsForUsingLCMPT_22022016v2.pdf Terms listed in Classification Web http://classificationweb.net/Auto/
LCDGT - MARC Genre/form code and term source codes Occupation Term Source codes: http://www.loc.gov/standards/ sourcelist/occupation.html Gender Code and Term Source codes: http://www.loc.gov/standards/ sourcelist/gender.html Subject Heading and Term Source codes include LCDGT: http://www.loc.gov/standards/ sourcelist/subject.html
Moving Image Genre-Form Guide / Motion Picture/Broadcasting/Recorded Sound Division, Library of Congress
http://lcweb.loc.gov/rr/mopic/migintro.html It defines the concepts of form and genre separately. It makes those definitions operational by providing separate lists of terms. Terms from the form and genre lists are combined in a faceted manner.
Moving Image Genre-Form headings / OLAC
http://www.olacinc.org/drupal/capc_files/GenreFormHeadingsList.pdf

LC Announcements

Summary of Decisions / Library of Congress / SACO
http://www.loc.gov/aba/pcc/saco/cpsoed/cpsoeditorial.html
Library of Congress Cataloging and Acquisitions (ABA) home including news and subject and genre/form headings information
http://www.loc.gov/aba/

Other Thesauri

BISAC
BISAC subject headings list, major subjects: http://www.bisg.org/what-we-do-0-136-bisac-subject-headings-list-major-subjects.php
Fact Sheet : UMLS � Metathesaurus �
http://www.nlm.nih.gov/pubs/factsheets/umlsmeta.html The UMLS Metathesaurus is one of three knowledge sources developed and distributed by the National Library of Medicine as part of the Unified Medical Language System� (UMLS�) project. The Metathesaurus contains information about biomedical concepts and terms from many controlled vocabularies and classifications used in patient records, administrative health data, bibliographic and full-text databases and expert systems.
Fact Sheet : Unified Medical Language System
http://www.nlm.nih.gov/pubs/factsheets/umls.html In 1986, the National Library of Medicine (NLM), began a long term research and development project to build a Unified Medical Language System� (UMLS�). The purpose of the UMLS is to aid the development of systems that help health professionals and researchers retrieve and integrate electronic biomedical information from a variety of sources and to make it easy for users to link disparate information systems, including computer-based patient records, bibliographic databases, factual databases, and expert systems.
FAST / OCLC searchFAST
http://fast.oclc.org/searchfast/ FAST subject headings were developed by adapting the Library of Congress Subject Headings (LCSH) with a simplified syntax, retaining the very rich vocabulary of LCSH while making the schema easier to understand, control, apply, and use. Several indexes and the ability to restrict the result to a desired FAST facet increase searching accuracy. The default result ranking is by usage, giving the most likely candidate heading near the top of the result, although alphabetic and facet order options are easily available. An autosuggest makes the selection process even easier.
NASA Thesaurus
http://www.sti.nasa.gov/thesfrm1.htm Text in PDF: http://www.sti.nasa.gov/thesvol1.pdf
Thesauri and Dictionaries / ABC
thesauri

SAC Subcommittee on Semantic Interoperability

SAC Subcommittee on Semantic Interoperability

REVISED DRAFT REPORT

Compiled report through March 4, 2006 (above)

Interoperability Projects / Lois Mai CHan See PROJECTS ADDRESSING OR RELATING TO INTEROPERABILITY ISSUES (below)
Program - Orlando 2004 See Enriching Subject Access (document below)
Subject Semantic Interoperability: Final Report ALCTS Subject Analysis Committee, Subcommittee on Semantic Interoperabililty
http://www.ala.org/ala/mgrps/divs/alcts/mgrps/ccs/cmtes/sac/inact/semantic/sacsem_rpt.pdf

Enriching Subject Access

Articles, etc.

ALCTS/CCS/SAC Subcommittee on Form Headings : Subdivisions Implementation, 1996-
http://www.pitt.edu/~agtaylor/ala/implem.htm
ALCTS CCS Subject Analysis Committee : subcommittees on metadata
http://www.ala.org/ala/mgrps/divs/alcts/mgrps/ccs/cmtes/ats-ccssac.cfm
Application of form data to works of fiction : discussion paper / Andrew MacEwan ...
http://www.pitt.edu/~agtaylor/ala/papers/blfictio.html
Educational forum: LCSH and subfield v
http://www.pitt.edu/~agtaylor/ala/edforum.htm
HILT - High-Level Thesaurus Project : building consensus for interoperable subject access across communities / Susannah Wake
http://www.dlib.org/dlib/september01/wake/09wake.html
How Many Subdivisions Represent the Form of an Item? : Results of a Research Study / Arlene Taylor
http://www.pitt.edu/~agtaylor/ala/subfldv.htm
LC Action 2.3 Bates report summary (see document below)
MetaSearch Initiative
http://www.niso.org/workrooms/mi Metasearch, parallel search, federated search, broadcast search, cross-database search, search portal have become commonplace in the information community's vocabulary. They speak to a common theme of allowing search and retrieval to span multiple databases, sources, platforms, protocols, and vendors at once. One-search access to multiple resources holds the promise of enabling libraries to offer portal environments so their users can enjoy the same easy searching found in web-based services like Google.
Subdivision report followup
http://www.pitt.edu/~agtaylor/ala/followup.htm

SAC-LC-Action-BatesDraftRecmdn-Nov02

PROJECTS ADDRESSING OR RELATING TO INTEROPERABILITY ISSUES

(compiled by Lois Mai Chan and Marcia Lei Zeng)

ADL Thesaurus Protocol, University of California, Santa Barbara, USA
Subject coverage: no limit

Types of KOS involved: thesaurus

Languages involved: English

Website: http://www.alexandria.ucsb.edu/~gjanee/thesaurus/specification.html

Status: operational, prototype

The Alexandria Digital Library (ADL) Project at the University of California, Santa Barbara, focuses on the design and implementation of geospatial digital libraries and has been involved with the design and building of collections, services, and KOS since the beginning of the NSF DL funding in 1994 (ADL Homepage). In 2001-2002, the ADL Implementation team developed both a Gazetteer and a Thesaurus Service Protocol. They are lightweight, stateless, XML- and HTTP-based protocols. Both are designed to support programmatical searching and retrieval of distributed online resources. The Thesaurus Protocol is based on the ANSI/NISO (1993) Z39.19 thesaurus model and supports downloading, querying, and navigating thesauri. Like the Gazetteer Service Protocol, all that is required for its use is the development of a thesaurus server that can accept the specified XML-encoded queries and return the specified standard reports. Theoretically, once a server has installed the program and linked it to a potential thesaurus, it can search various thesauri distributed over the Web, receive and process protocol-type queries sent by special chains or by thesaurus lookup embedded in other programs (Jan e, Ikeda, and Hill, 2002).

CAMed, Columbia University and Kent State University, USA
Subject coverage: alternative and complementary medicine

Types of KOS involved: thesaurus

Languages involved: English, French

Website: http://circe.slis.kent.edu/mzeng/tmshome.html

Status: prototype

In an international collaborative project CAMed, a comprehensive resource for complementary and alternative medicine (CAM), researchers at Columbia University and Kent State University developed an integrated thesaurus management and cross-thesaurus search system. In the prototype, four thesauri in the areas of CAM were normalized and stored in a thesaurus repository. This system allows a database manager to manage and edit his thesaurus in his local office (in his country) through the Web interface, while the thesauri are deposited and hosted on the server at Kent State University. The cross-thesaurus search function of the system allows a user to type one term and search all or any of the thesauri in this repository. Software matches the query against the thesauri and gives back all fully- or partially-matched thesaurus entries. When a term is selected from the search results, a user can see the details of a thesaurus term entry (including the broader, narrower, and related terms, as well as non-preferred terms) and continue selecting among the terms displays. The term-search eventually enables a direct search in four bibliographical databases (samples) that have been integrated in the prototype. The term search function also extends to the full-text searching of all resources in the CAMed website (Zeng and Chen, 2003).

CARMEN, Germany
Subject coverage: mathematics, physics, social science

Types of KOS involved: classification scheme, thesaurus

Languages involved: German, English

Website: http://www.mathematik.uni-osnabrueck.de/projects/carmen/

Status: Research and development

CARMEN (Content Analysis, Retrieval, Metadata: Effective Networking), a specially funded project within the Global Info German Digital Library Project in Germany consists of several Work Packages (WP). Its WP 12 is named Cross concordances of classifications and thesauri . The emphasis in CARMEN is on mathematics and physics, but for methodological reasons the subject-oriented frame also include social sciences. It began with correlating different German thesauri that are used to index social science literature through intellectual and statistical methods simultaneously. It also proposed to establish a concordance between the general classification, DDC, and special classifications, including the Regensburger Verbund klassifikation (RVK) in the areas of mathematics and physics, the American Mathematical Society (AMS) Mathematics Subject Classification (MSC), and American Institute of Physics (AIP) Physics and Astronomy Classification Scheme (PACS) (CARMEN WP12, 2000). One part of the CARMEN Project concerns itself with the association of the IZT (Informationszentrum Sozialwissenschaften=Information Centre for Social Sciences) thesaurus with the SWD (Schlagwortnormdatei). The method by which equivalencies are determined and links created is rather simple: starting from alphabetical lists which contain descriptors from a specific subject area, the relationships between the two thesauri are determined intellectually. Several thousand descriptors have been processed so that equivalent relationships between the thesaurus of the Informationszentrum Sozialwissenschaften, the thesaurus of the Deutsches Institut f uerPaedagogische Forschung (German Institute for Educational Research), and the subject word authority file which was based on a unified scheme for subject heading, namely the RSWK (Regeln f r dem Schlagwortkatalog), have been established and then recorded in a link management system (Kunz, 2002).

Classification Web, Library of Congress, USA
Subject coverage: general

Types of KOS involved: subject heading list, classification scheme

Languages involved: English

Website: http://classweb.loc.gov/

Status: operational

For years, the Library of Congress of the United States has attempted to provide links between Library of Congress Subject Headings (LCSH) and Library of Congress Classification (LCC) numbers where appropriate. Initially the links resided in LCSH only, but now they appear in linked LC classification authority records as well. Under valid headings in the print version of LCSH, the equivalent LC classes or specific numbers are listed. In the subject authority records for the terms in question, the corresponding class numbers are included. In Classification Web (a web-based interface under development), the user can move across the schemes through the links between the class numbers and subject headings (Classification Web website, 2002).

Finnish project, Finland

Subject coverage: general

Types of KOS involved: classification scheme, subject heading list

Languages involved: Finnish

Status: operational

Himanka and Kautto (1992) reported their work to convert assigned class numbers based on the Finnish abridged edition of UDC into General Finnish Subject headings (GFSH). First, a dictionary is created that maps UDC numbers to subject headings. Secondly, the dictionary is mechanically applied to convert the bibliographic databases.

HEREIN, Council of Europe

Subject coverage: cultural heritage

Types of KOS involved: thesaurus

Languages involved: Spanish, French, English

Website of the project:

http://inf2.pira.co.uk/factsheets/inform/digicult/herein2.html

Status: operational through website:

http://www.european-heritage.net/sdx/herein/index.xsp

The HEREIN Project (European Heritage Information Network on cultural heritage policies) of the European Heritage Net has developed an interlingua with no direct reference to the terms or to the structure of any pre-existing thesaurus. (http://www.european-heritage.net/en/index.html, select Thesaurus) Most of the terms in the thesaurus come from reports on cultural heritage policies in Europe. With these reports, each of the three teams from Spain, France, and the United Kingdom -responsible for the establishment of the thesaurus created a separate list of terms in its own language. These lists were also supplemented with additional terms gathered from specialized documentary sources. The three teams then compared their lists so as to obtain a pool of words with linguistic equivalences in the three languages; the idea was to bring out the different classes intended to represent the first, broadest level. Each term selected was placed into the most relevant class. Within each class, terms were ordered following the same hierarchical relationship for all linguistic versions of the thesaurus. During this stage of work, terms regarded as too specific to one language and those reflecting regional idiosyncrasies were not excluded from the thesaurus but treated as equivalent terms to a concept common to all three languages.

In building the tri-lingual vocabularies within the HEREIN Project, the thesaurus follows the five types of equivalence relationships defined in the ISO (1985) 5964 standard: exact equivalence; inexact equivalence; partial equivalence; single-to-multiple term equivalence; and non-equivalence. However, HEREIN deviates from the ISO 5964 standard in that it does not designate a source language (Th rond, 2002).

The project originally involved governmental services in charge of cultural heritage in six European countries, but the network was later expanded. The www.european-heritage.net site was opened in 1999 and was concerned with collecting information on national heritage policies across Europe. In addition to the concise data, the site was also a portal to computerized databanks and selected Internet sites, and included a multilingual thesaurus in English, French and Spanish to index the databank and clarify the concepts (HEREIN 2 website).

HILT (High-Level Thesaurus Project), UK

Subject coverage: general and special

Types of KOS involved: thesaurus, classification scheme, subject heading list

Languages involved: multiple languages

Website: http://hilt.cdlr.strath.ac.uk/

Status: Research and Development

HILT (High-Level Thesaurus Project) Phase I was funded by Joint Information Systems Committee (JISC) and the Research Support Libraries Programme (RSLP) of the United Kingdom, and HILT Phase II is funded by the JISC. HILT1, a one-year project initiated in September 2000 in U.K., investigated the problem of searching and browsing across a number of distributed services using different indexing vocabularies and attempted to derive a set of recommendations to help facilitate cross-searching and browsing by subject between communities, services, and initiatives. These included archives, the Further and Higher Education sectors, libraries, museums, the National Grid for Learning, and the Resource Discovery Network, etc., which usually use different subject schemes (HILT, 2000).

After completing a series of surveys on the main literature, stakeholders, and machine solutions and interfaces, a stakeholder workshop was held in 2001 for the purpose of reaching consensus on the best approach to address the issues. A clear consensus emerged that the best way forward was to establish a pilot mapping service. The proposed approach is to map key schemes like LCSH, UNESCO, DDC, UDC (Universal Decimal Classification), AAT, and perhaps user and regional terminologies and local adaptations of standard schemes, perhaps using one of them such as DDC as the central spine of the approach (Nicholson, Wake and Currier, 2001, Nicholson and Wake, 2003). HILT1 s conclusions are: many different subject schemes and practices were in use; cross searching by subject was considered of value to users and staff; and an online terminologies route map or TeRM that would map subject schemes to user terminologies and to each other was the preferred solution. HILT1 also concluded that there was a strong consensus favoring a project to create a pilot TeRM and investigate these issues. HILT Phase II moves Phase I process into the 'Pilot Project' stage, focusing on terminology and thesauri requirements at the collection level, but also bearing in mind the need to extend this in due course to the needs of item level retrieval. The initial illustrative TeRM would be based on the RDN terminologies, on terminologies available as part of the Wordmap taxonomies set, which include, in particular, a set of terms used by general internet users, and on selective subsets of LCSH, DDC, UNESCO, and AAT. OCLC will provide an LCSH DDC mapping, and may also be able to provide a DDC to Conspectus subject headings mapping. The aim would be a selective mapping sufficient for the purposes of the pilot in the first instance i.e. not a comprehensive terminologies map (Nicholson, 2002).

LCSH and MeSH, Northwestern University Libraries, USA
Subject coverage: general, medicine

Types of KOS involved: subject heading list

Languages involved: English

Status: Operational

LCSH (Library of Congress subject headings) and MeSH (Medical subject headings) often co-exist in online public access catalogs (OPAC). In an attempt to facilitate cross-vocabulary searching, the Northwestern University Libraries in the United States embarked on a project to map terms between these two vocabularies. The method adopted is to first use automatic data processing methods to generate heading pairs that represent potential correspondences between LCSH and MeSH. These heading pairs are then reviewed by subject editors to verify true correspondence. The 7XX linking fields of the MARC21 authority format are used as linking mechanisms. Thus, the mapping data reside in authority records. The data is maintained by reviewing the weekly updates to LCSH and the annual updates to MeSH to find new, deleted, or changed headings, after which the mapping data is adjusted accordingly (Olson, 2003).

In mapping terms, one-to-one correspondence is preferred. These include identical/co-extensive headings, main heading to main heading/subdivision, and main heading to cross reference(s). Since not all terms can be mapped precisely between the two vocabularies, various degrees of correspondence or matching are also recognized. These include one-to-two and two-to-one correspondences (Olson, 2003).

MACS (Multilingual Access to Subjects), Europe

Subject coverage: general

Types of KOS involved: subject heading list

Languages involved: English, French, German

Website: http://infolab.kub.nl/prj/macs/

Status: operational

MACS is a European project designed to allow users to search across cataloging databases of the partner libraries in different languages: English, French, and German for the moment. The partners are: the Swiss National Library (SNL), project leader, the Biblioth que nationale de France (BnF), The British Library (BL) and Die Deutsche Bibliothek (DDB). The project is running under the auspices of the Conference of European National Librarians (CENL). It aims "to provide multilingual subject access to library catalogs," by establishing equivalence links among three subject headings lists: SWD/RSWK (Schlagwortnormdatei / Regeln f r den Schlagwortkatalog) for German Rameau (R pertoire d'autorit -mati re encyclop dique et alphab tique unifi ) for French, and LCSH for English. The method employed for mapping consists of comparing subject headings in three monolingual lists and checking the consistency of bibliographic records retrieved with these headings. The links were analyzed on three levels: terminological level (subject heading), semantic level (authority record), and syntactic level (application). For creating and maintaining link equivalences among the three vocabularies, a "link management" interface was developed. It contains a classification field currently based on about sixty broad domains. The use of a classification ensures the creation of homogeneous groups of headings by subject (Freyre and Naudi 2003).

Megathesaurus - H.W. Wilson Company, USA

Subject coverage: general and special

Types of KOS involved: thesauri

Languages involved: English

Status: operational

To facilitate multi-file searching across Wilson databases that have been indexed with different controlled vocabularies, the H. W. Wilson Company has developed and is maintaining a megathesaurus, in a sense, a thesaurus of thesauri. Each record in the megathesaurus contains the authority main term or megathesaurus term, which serves as the anchor to which equivalent terms, along with their respective relational terms, from twelve vocabularies are mapped and stored. An automatic switching mechanism in searching has been developed, which enables the user to search a single index

, multiple indexes simultaneously, or the combined indexes in the multi-file OMNI Index in a transparent manner, by using search terms based on any of the source vocabulary (Kuhr, 2003).

Merimee, France

Subject coverage: cultural heritage, art and architecture

Types of KOS involved: thesaurus

Languages involved: English, French

Website: http://www.culture.fr/documentation/merimee/accueil.htm

Status: operational

For the purpose of indexing complexes, buildings, and built works described in the national database "Merimee" about the French Heritage, The Thesaurus of Architecture (Le th saurus de l'architecture) was created and mapped to the Art and Architecture Thesaurus (AAT, http://www.getty.edu/research/tools/vocabulary/aat/, published by The J. Paul Getty Trust) and the English Heritage Thesaurus (http://www.rchme.gov.uk/nmr.html, published by The National Monuments Record (NMR)). When mapping from Merimee s Thesaurus of Architecture to the AAT and NMR, Boolean operators AND and OR are used to indicate the equivalence, in addition to the conventional equivalence types (exact and partial). (See statistics reported in Doerr, 2001 and http://www.culture.gouv.fr/documentation/thesarch/pres.htm).

MSC and DDC 510 schedule, State University of New York in Albany, USA

Subject coverage: mathematics

Types of KOS involved: classification schemes

Languages involved: English

Status: Research

A project which maps the American Mathematical Society (AMS) Mathematics Subject Classification (MSC) to the DDC 20 edition Schedule 510 (mathematics) was conducted at the State University of New York in Albany, New York The mapping rules included: exact matches, specific to general, general to specific, many to one, cyclic mapping, no matches, and specific and broad class mapping. These mapping strategies are examined in an object-oriented, frame-based analysis for implementation in the expert system shell software (Iyer and Giguere, 1995).

Polish Project, Poland

Subject coverage: general

Types of KOS involved: classification scheme, subject heading list, thesaurus

Languages involved: English, Polish

Status: research

At the Institute for Scientific, Technical and Economic Information (ISTEI) in Warsaw (Poland), a project was conducted in 1992-93 for the establishment of concordances for four controlled vocabularies: Polish Thematic Classification (PTC), descriptors based on the Thesaurus of Common Topics (TCT), Universal Decimal Classification (UDC), and Subject-Heading Language (SHL) of the National Library in Warsaw. The PTC was chosen as the master language whereas the three others served as target languages (Scibor and Tomasik-Beck, 1994).

Renardus, Europe

Subject coverage: general

Types of KOS involved: classification scheme

Languages involved: multilingual

Website: http://www.renardus.org/

Status: operational, research

Renardus is an EU project (coordinated by the National Library of the Netherlands with partners from Denmark, Finland, Germany, the Netherlands, Sweden, and the UK) with the purpose of producing a cross-browsing feature based on the Dewey Decimal Classification (DDC) and improved subject searching across distributed and heterogeneous European subject gateways. The initial investigation included the use of classification systems by Renardus partners gateways, general mapping approaches and issues, the definition of mapping relationships, and information on technical solutions and the mapping tool. The approach adopted by the project is a harmonization process that maps local class schemes to a common scheme, thereby enabling users to browse a single subject hierarchy. DDC was chosen as the switching language and common browsing structure.

Each DDC class in Renardus presents links to "related collections" which enable the user to jump to the mapped classes in the participating local gateways and to continue browsing in the local classification structure there. In addition, a virtual browsing feature allows the merging of all local related records from all mapped classes into one common Renardus result set (Koch, Neuroth , and Day, 2003).

SAB and DDC, Sweden

Subject coverage: general

Types of KOS involved: classification schemes

Languages involved: Swedish, English

Status: operational

In Sweden, a concordance between Klassifikationssystem f r svenska bibliotek (SAB) 7th edition, the classification system used by the Royal Library as well as most university libraries and all public libraries, and DDC 21 was presented in year 2000 in the format of a booklet and an online database (IFLA, 2001:34).

UC Berkeley DARPA Unfamiliar Metadata Project, USA
Subject coverage: science, engineering

Types of KOS involved: classification scheme, thesaurus, subject heading list

Languages involved: English, French, German, Russian, Spanish

Website: http://metadata.sims.berkeley.edu/GrantSupported/unfamiliar.html

Status: prototype, research

The project "Mapping Entry Vocabulary to Unfamiliar Metadata Vocabularies" was conducted at the University of California, Berkeley in recent years. As stated in the project website, the researchers plan to develop Entry Vocabulary Indexes that accept topical statements in the searcher's terms ("Query vocabularies") and respond with a ranked list of terms in the system's vocabulary ("Entry vocabularies"). The prototype Entry Vocabulary Modules included English language indexes to BIOSIS Concept Codes, INSPEC Thesaurus, U.S. Patent and Trademark Office Patent Classification, and the Standard Industrial Classification (SIC) codes, and a multilingual index (supporting queries in English, French, German, Russian, or Spanish) to the physical sciences sections of the Library of Congress Classification (LCC). When the Entry Vocabulary Module leads users to a promising term in the target metadata vocabulary, a search can be executed using the newly-found metadata against a remote database (Buckland et al., 1999).

UMLS Metathesaurus, National Library of Medicine, USA

Subject coverage: medicine, health, biomedicine, and related areas

Types of KOS involved: thesaurus, subject heading list, classification scheme,

coding system, list of controlled terms

Languages involved: multiple languages

Website: http://www.nlm.nih.gov/pubs/factsheets/umls.html

Status: operational

UMLS (Unified Medical Language System), led by the National Library of Medicine in the United States, is probably the most ambitious project in harmonizing different vocabularies. It consists of three "UMLS Knowledge Sources" in which a Metathesaurus is the core source, the other two being the SPECIALIST lexicon and the UMLS Semantic Network. The UMLS Metathesaurus is a database containing semantic information about biomedical concepts, their various names, and the relationships among them. It is built from over 100 biomedical source vocabularies, some in multiple languages. These include thesauri, classification systems, coding systems, and lists of controlled terms that have been developed and are maintained by many different organizations. The 2003 edition of the Metathesaurus includes 875,255 concepts and 2.14 million concept names (NLM, 2001, 2003). For mapping index terms from different thesauri, UMLS uses a device called Semantic Network, which, through its 134 semantic types, provides a consistent categorization of all concepts represented in the UMLS Metathesaurus. The semantic types are the nodes in the Network, and the relationships between them are the links. There are major groupings of semantic types for organisms, anatomical structures, biologic function, chemicals, events, physical objects, and concepts or ideas. The current scope of the UMLS semantic types is broad, allowing for the semantic categorization of a wide range of terminology in multiple domains. The primary link is the `isa' link. This establishes the hierarchy of types within the Network and is used for deciding on the most specific semantic type available for assignment to a Metathesaurus concept (NLM, 2003).

WebDewey, OCLC, USA
Subject coverage: general

Types of KOS involved: subject heading list, classification scheme

Languages involved: English
Website: http://www.oclc.org/dewey/products/webdewey/about.htm

Status: operational

In WebDewey, produced by OCLC, DDC numbers are linked to assigned LC subject headings in MARC records intellectually or statistically where feasible. Such linking facilitates particularly the subject cataloging and classification process by requiring only the identification of either the appropriate class number or subject heading(s) for each document. These links, however, do not appear in LCSH (WebDewey website, 2002).

REFERENCES

ADL Homepage. Alexandria Digital Library Project. University of California, Santa Barbara. Available: http://www.alexandria.ucsb.edu (Last accessed February 15, 2003.)

ALA (American Library Association). (2000). Committee on Cataloging: Description and Access (CC:DA) Task Force on Metadata. Final Report. Available: http://www.ala.org/alcts/organization/ccs/ccda/tf-meta6.html (Last accessed February 15, 2003.)

ANSI/NISO (American National Standards Institute/National Information Standards Organization). (1993). Z39.19 - 1993 Guidelines for the Construction, Format, and Management of Monolingual Thesauri. Bethesda, MD: National Information Standards Organization.

Buckland, M., et al. (1999). Mapping entry vocabulary to unfamiliar metadata vocabularies, D-Lib Magazine, 5(1). Available: http://www.dlib.org/dlib/january99/buckland/01buckland.html (Last accessed February 15, 2003.)

CARMEN. WP12. (2000). Cross concordances of classifications and thesauri. Available: http://www.bibliothek.uni-regensburg.de/projects/carmen12/index.html (Last accessed February 15, 2003.)
Chan, L. M. & Pollard, R. (1988). Thesauri Used in Online Databases: An Analytical Guide. New YorK: Greenwood Press.

Chan, L. M., Childress, E., Dean, R., O'neill, E.T., & Vizine-Goetz, D. (2001). A Faceted Approach to Subject Data in The Dublin Core Metadata Record, Journal Of Internet Cataloging 4(1/2):35-47.

Chemical titles. (1960 --). Columbus, Ohio: American Chemical Society. ISSN 0009-2711

Classification Web website (2002). Available: http://classweb.loc.gov/ (Last accessed February 15, 2003.)

Cleverdon, C. (1967). The Cranfield tests on index language devices. Aslib Proceedings, 19: 173-192.

Dachelet, R. (1997). Multilingual querying and multilingual thesauri in Aquarelle, Technical Report, INRIA-Aquarelle, March. (Indirect source, see Doerr 2001).

Doerr, M. (2001). Semantic problems of thesaurus mapping. Journal of Digital information, 1 (8). Available: http://jodi.ecs.soton.ac.uk/Articles/v01/i08/Doerr/#Nr.52 (Last accessed February 15, 2003.)

English Heritage. (1999). National Monuments Record Thesauri homepage. Available: http://www.rchme.gov.uk/thesaurus/thes_splash.htm (Last accessed February 15, 2003.)

Ferrari, R.C. (1999). The art of classification: alternative classification systems in art libraries. Cataloging and Classification Quarterly, 28(2):73-98.

Foskett, D.J. (1980). Thesaurus. In: A.Kent et al. Ed.: Encyclopedia of Library and Information Science, Volume 30, pp.416-462. New York: Marcel Dekker.

Freyre, E. & Naudi, M. (2003). MACS: Subject access across languages and networks. In: I.C.Mcllwaine ed.: Subject Retrieval in a Networked Environment, Proceedings of the IFLA Satellite Meeting held in Dublin, Ohio 14-16 August 2001. M nchen: K.G.Saur. pp.3-10.

Garfield, E. (1955). Citation indexes for science. Science, (122)3159:108-111.

Getty Research Institute. (2000). Vocabulary Databases. Available at: http://www.getty.edu/research/tools/vocabulary/ (Last accessed February 15, 2003.)

Gilreath, C.T. (1992). Harmonization of terminology an overview of principles. International Classification 19(3):135-139.

HEREIN 2 website. European Heritage Network. Summary: Project Facts & Consortium Info. Available: http://inf2.pira.co.uk/factsheets/inform/digicult/herein2.html (Last accessed Feb. 9, 2003)

HILT. (2000). HILT: High-Level Thesaurus Project Proposal. Available: http://hilt.cdlr.strath.ac.uk/AboutHILT/proposal.html. (Last accessed February 15, 2003.)

Himanka, J., and Kautto, V. (1992). Translation of the Finnish Abridged Edition of UDC into General Finnish Subject Headings. International Classification 19(3):131-134.

Hudon, M. (1997). Multilingual thesaurus construction: integrating the views of different cultures in one gateway to knowledge concepts. Knowledge Organization 24(2): 84-91.

IFLA (International Federation of Library Associations and Institutions). (2001). Section on Classification and Indexing. (2001) Newsletter Nr.24, December 2001.

ISKO (International Society for Knowledge Organization). (1995). Recommendations of the Research Seminar on Compatibility and Integration of Order Systems, organized by the International Society for Knowledge Organization (ISKO) and the Society for Professional Information (TIP), Warsaw, Poland, Sept. 13-15, 1995. ISKO Press Release, Frankfurt, Sept.21, 1995.

ISO (International Organization for Standardization). (1985). Guidelines for the establishment and development of multilingual thesauri. ISO 5964.

ISO (International Organization for Standardization). (1986). Guidelines for the Establishment and Development of Monolingual Thesauri. ISO 2788.

Iyer, H. & Giguere, K. (1995). Towards designing an expert system to map mathematics classificatory structures. Knowledge Organization 22(3/4):141-147.

Jan e, G., Ikeda, S., & Hill, L. L. (2002). The ADL Thesaurus Protocol. Alexandria Digital Library Project. Available: http://www.alexandria.ucsb.edu/thesaurus/protocol/ (Last accessed February 15, 2003.)

Jouguelet, S. (1995). Evolution of subject indexing practice in France. In: R.P. Holley, et al. ed.: Subject Indexing: Principles and Practices in the 90 s: Proceedings of the IFLA Satellite Meeting Held in Lisbon, Portugal, 17-18 August 1993, and Sponsored by the IFLA Section of Classification and Indexing and the Instituto da Biblioteca Nacional e do Livro, Lisbon, Portugal. Műnchen: K. G. Saur.

Koch, T., Neuroth, H., & Day, M. (2003). Renardus: cross-browsing European subject gateways via a common classification system (DDC). In: I.C.Mcllwaine ed.: Subject Retrieval in a Networked Environment, Proceedings of the IFLA Satellite Meeting held in Dublin, Ohio 14-16 August 2001. M nchen: K.G.Saur. pp.25-33. Available: http://www.lub.lu.se/~traugott/drafts/preifla-final.html (Last accessed February 15, 2003.)

Kuhr, P.S. (2003). Putting the world back together: mapping multiple vocabularies into a single thesaurus. In: I.C.Mcllwaine ed.: Subject Retrieval in a Networked Environment, Proceedings of the IFLA Satellite Meeting held in Dublin, Ohio 14-16 August 2001. M nchen: K.G.Saur. pp. 33-42.

Kunz, M. (2002). Sachliche Suche in verteilten Ressourcen: ein kurzer berblick ber neuere Entwicklungen [Subject retrieval in distributed resources: a short review of recent developments] Paper presented at 68th IFLA Council and General Conference, Aug. 18-24, 2002, Glasgow, UK. Available: http://www.ifla.org/IV/ifla68/papers/007-122g.pdf English translation available: http://www.ifla.org/IV/ifla68/papers/007-122e.pdf (Last accessed February 15, 2003.)

Lancaster, F. W., & Warner, A.J. (1993). Information Retrieval Today. Arlington, Va. : Information Resources Press.

Lancaster, F.W. (1969). MEDLARS: Report on the Evaluation of Its Operating Efficiency. American Documentation, 20: 119-142.

Library of Congress Cataloging Distribution Service. (1999) MARC 21 Format for Bibliographic Data Including Guidelines for Content Designation). Washington, DC: Cataloging Distribution Service of the Library of Congress. Web-based version is also available at: http://lcweb.loc.gov/marc/ (Last accessed February 15, 2003.)

Library of Congress Thesauri website. Available: http://www.loc.gov/lexico/servlet/lexico/tgm1/brsearch.html (Last accessed February 15, 2003.)

Luhn, H.P. (1959). Potentialities of Auto-Encoding of Scientific Literature. Technical report RC-101. Yorktown Heights, NY: IBM Research Center.

Luhn, H.P. (1961). The automatic derivation of information retrieval encodements from machine-readable texts. in A. Kent ed.: Information Retrieval and Machine Translation. Vol.3, Pt 2, pp. 1021-1028. New York: Interseience Publication.

NAS (National Academy of Sciences). (1959). Proceedings of the International Conference on Scientific Information. 2 volumes. Washington, DC: National Academy of Sciences National Research Council.

Nicholson, D., Wake, S., & Currier, S. (2001). High-Level Thesaurus Project: investigating the problem of subject cross-searching and browsing between communities. In Global Digital Library Development in the New Millemnnium: fertile ground for distributed cross-disciplinary collaboration, edited by Ching-Chih Chen. Beijing: Tsinghua University Press, 2001.

Nicholson, D. (2002). Subject-based interoperability: issues from the High Level Thesaurus (HILT) project. Paper presented at 68th IFLA Council and General Conference, Aug. 18-24, 2002, Glasgow, UK. Available: http://www.ifla.org/IV/ifla68/papers/006-122e.pdf (Last accessed February 15, 2003.)

Nicholson, D. & Wake, S. (2003). HILT: subject retrieval in a distributed environment. In: I.C.Mcllwaine ed.: Subject Retrieval in a Networked Environment, Proceedings of the IFLA Satellite Meeting held in Dublin, Ohio 14-16 August 2001. M nchen: K.G.Saur. pp. 61-67.

Niehoff, R., & Mack, G. (1985). The Vocabulary Switching System: Description and Evaluation Studies. International Classification, 12(1):2-6.

NKOS (Networked Knowledge Organization Systems). (2000). Taxonomy of Knowledge Organization Sources/Systems. Draft June 7, 2000 (revised July 31, 2000) Based on Gail Hodge, Systems of Knowledge Organization for Digital Libraries: Beyond Traditional Authority Files CLIR Pub91. April 2000. Available: http://nkos.slis.kent.edu/KOS_taxonomy.htm (last accessed February 15, 2003.)

NLM (National Library of Medicine). (2003). Fact Sheet: UMLS Metathesaurus Last updated: 13 January 2003. Available: http://www.nlm.nih.gov/pubs/factsheets/umlsmeta.html. (Last accessed February 15, 2003).

NLM (National Library of Medicine). (2001). Fact Sheet: UMLS Semantic Network. Last updated: 14 February 2001. Available: http://www.nlm.nih.gov/pubs/factsheets/umlssemn.html. (Last accessed February 15, 2003.)

Noy, N.F., & Musen, M.A. (2001). Anchor-PROMPT: Using non-local context for semantic matching. Workshop on Ontologies and Information Sharing at the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-2001), Seattle, WA, 2001. Available: http://smi-web.stanford.edu/pubs/SMI_Abstracts/SMI-2001-0889.html (last accessed February 15, 2003.)

Olson, T. (2003). Integrating LCSH and MeSH in information systems. In: I.C.Mcllwaine ed.: Subject Retrieval in a Networked Environment, Proceedings of the IFLA Satellite Meeting held in Dublin, Ohio 14-16 August 2001. M nchen: K.G.Saur. pp.21-24.

Riesthuis, G. J.A. (2003). Information languages and multilingual access. In: I.C.Mcllwaine ed.: Subject Retrieval in a Networked Environment, Proceedings of the IFLA Satellite Meeting held in Dublin, Ohio 14-16 August 2001. M nchen: K.G.Saur. pp. 11-17.

Schweitzer, A. (1995). Subject access to library materials in Canada: A balancing act between conformity and divergence. In Subject Indexing: Principles and Practices in the 90 s.

Scibor, E. & Tomasik-Beck, J. (1994). On the establishment of concordances between indexing languages of universal or interdisciplinary scope (Polish Experiences). Knowledge Organization 21(4):203-212.

Sparck Jones, K. (1981). Retrieval system tests 1958-1978. In: K. Spark Jones ed.: Information Retrieval Experiment. London: Butterworths. pp.213-255.

Sparck Jones, K. (1997). History. In: K. Sparck Jones and P. Willett ed.: Readings in Information Retrieval, San Francisco, CA: Morgan Kaufmann Publishers, Inc., 1997. pp.9-14.

Taube, M. and Associates, (1953-1959). Studies in Coordinate Indexing, Washington, DC: Documentation Incorporated.

Th rond, D. (2000). European-Heritage.Net: The European Heritage Network, Cultivate Interactive, issue 2, 16 October 2000. Available: http://www.cultivate-int.org/issue2/herein/ (Last accessed February 15, 2003.)

VRA (Visual Resource Association). (2002). VRA Core Categories. Version 3.0. A project of the Visual Resources Association Data Standards Committee, last modified on 2/20/2002. Available: http://www.vraweb.org/vracore3.htm (Last accessed February 15, 2003.)

WebDewey website. (2002). Available: http://www.oclc.org/dewey/products/webdewey/about.htm (Last accessed February 15, 2003)

WordNet 1.7.1 Database Statistics. (2002). In: WordNet 1.7.1 Reference Manual. Available: http://www.cogsci.princeton.edu/~wn/man1.7.1/wnstats.7WN.html (Last accessed February 15, 2003.)

WordNet Homepage. Cognitive Science Laboratory, Princeton University. Available: http://www.cogsci.princeton.edu/~wn/ (Last accessed February 15, 2003.)

Zeng, L. (1992). Achieving compatibility of indexing languages in online access environment. in A. Kent ed.: Encyclopedia of Library and Information Science, vol. 50: 1- 24. NY: Marcel Dekker, Inc.

Zeng, M.L., & Chen, Y. (2003) Features of an integrated thesaurus management and search system for the networked environment. In: I.C.Mcllwaine ed.: Subject Retrieval in a Networked Environment, Proceedings of the IFLA Satellite Meeting held in Dublin, Ohio 14-16 August 2001. M nchen: K.G.Saur. pp. 122-128.