"KNOWLEDGE ACCESS MANAGEMENT:
TOOLS AND CONCEPTS FOR NEXT-GENERATION CATALOGERS"

Report from the OCLC Seminar

On Nov. 17-19, 1997 I attended the second of OCLC's seminars entitled "Knowledge
Access Management: Tools and Concepts for Next-Generation Catalogers" (http://www.oclc.org/institute/seminar2.htm). Although the title sounds general, the seminar focused on the intersection of cataloging, metadata, and the Internet. There were 22 of us, many from academic libraries, both large and small, but also some from a network, a large public library, a consulting firm, a special library, and a library science faculty.

There were 3 broad themes to the seminar: libraries and the Internet, issues in the cataloging of Internet resources, and the role of different kinds of "metadata" in the online environment. I have tried to highlight below some of the major points made for each theme.

LIBRARIES AND THE INTERNET

This session was an argument that the phenomenal growth of the World Wide Web in the last year or so has brought with it many of the scholarly resources that libraries traditionally collect, and thus libraries will have to "collect" and catalog these new Web-based versions of these resources in the future.

This theme was ended by a prediction that any OPAC vendor who does not have a Web catalog interface by the end of this year will be out of business.

ISSUES IN THE CATALOGING OF INTERNET RESOURCES

These sessions were conducted by Ann Sandberg-Fox who has been involved in both national and international discussions of bibliographic description of electronic resources.

We went through every cataloging rule and MARC field used in cataloging Internet resources discussing difficulties and possible solutions all along the way. Basic bibliographic concepts are hard to apply to Internet resources (an example she used that illustrates many of the issues discussed below is http://www.oc.ca.gov/):

METADATA

Metadata is a term being used to refer to any data about other data. Catalog records are currently libraries' most familiar form of metadata but this is an area of great interest, excitement and experimentation and other forms of metadata are emerging to serve different needs.

First, the impetus is not coming from libraries, who have always recognized the importance of metadata by creating catalog records. The impetus is coming from the WWW community: the Web browsers, the search engines, and the Internet service providers.

We all know what searches on the Web are like! A recent and very good metadata survey article by Warwick Cathro of the National Library of Australia (http://www.nla.gov.au/nla/staffpaper/cathro3.html) described a Web search on the acronym IETF (Internet Engineering Task Force). Admittedly, this is an Internet body and will have a lot of hits, but, still, it's not as general as searching for a common subject word like "environment." Nevertheless, the "IETF" search retrieved 896,354 matches! Cathro went on to explain that metadata would allow search engines to target searches onto words or phrases that identify their correct role, e.g., "green" as a personal name vs. "green" as a subject. This sounds elementary and it is, but it is still more than many search engines can do.

In March, 1995, the first Metadata Workshop was convened at OCLC, and included researchers and professionals from librarianship, computer science, text encoding, and related areas, to come up with a set of core metadata elements to describe networked resources. The result was called the Dublin Core, "Dublin" because of OCLC's location in Dublin, Ohio, and "Core" because this was meant to have the minimal number of elements required to describe an electronic resource adequately.

Since then, there have been four more Dublin Core conferences with these results:

Now that the DC has become standardized, various "crosswalks" are being developed between DC and other types of metadata, especially the MARC record (http://lcweb.loc.gov/marc/dccross.html). However, there are still many questions whose answers are speculative:

In the long run, from people with a vested interest in better access to Web items, e.g., Website creators, database creators, catalogers, etc. There is also software already available for creating DC records for existing Websites (for fun with this try DC Dot: http://www.ukoln.ac.uk/metadata/dcdot/).

6 months, as a conservative estimate. For a recent report: http://www.zdnet.com/pcweek/news/1103/03rdf.html

CONCLUSION

The MARC record, as old and imperfect as it is to us, is viewed by others as "very rich metadata," with all of its coding and authority control. No one expects the DC to replace the MARC record although it is likely that catalog records and other types of metadata will merge eventually. Right now, there is room for all kinds of metadata. We may find ourselves using several kinds in one catalog: the MARC record for the most important and scholarly resources, with, perhaps, the DC becoming a new minimal level format.

The seminar opened with some memorable quotes from library history, this one from Cutter in 1904: "I cannot help thinking that the golden age of cataloging is over, and the difficulties and discussions which have furnished an innocent pleasure to so many will interest them no more." If Cutter were here to see us at this new stage of catalog development I think he would be pleased.

Celine Noel

Catalog Dept.

Davis Library

1-29-98