Quarterly Report for 2nd Quarter 1998 (June '98 - August '98)
Digital Libraries Initiative

University of Illinois at Urbana-Champaign
Federating Repositories of Scientific Literature

Bruce Schatz, PI, schatz@uiuc.edu
contact:
dli@uiuc.edu

SIGNIFICANT EVENTS

TESTBED

This quarter testbed research focused on automated delivery of testbed materials in multiple markup formats, on further enhancements of hypertext linking capabilities both within and external to the testbed, and on extending access to portions of the testbed to other university campuses.

Code was developed and tested for re-formating IEE article SGML contained in the testbed into HTML 4.0 and well-formed XML real-time at point of delivery over the web. Preliminary cascading style sheets supporting presentation of testbed IEE articles in Netscape Navigator 4.x and Microsoft Internet Explorer 4.x and 5.x were developed. Similar work was also done for a small corpus of sample SGML provided by the Naval Research Labatory (see also http://shiva.grainger.uiuc.edu/nrl/test.asp). In a separate follow-on project, this work will be extended and put into production use.

Code was developed and tested to insert forward and backward citation links into SGML and HTML versions of testbed articles at the time of request. Link data is extracted from metadata files (which are updated as new articles are added to the testbed) and inserted in appropriate locations in original source articles such that the reader can jump from article reference list to relevant abstract and indexing databases and/or full-text of relevant articles within the testbed. Previously these links were only available when viewing extended citations of testbed articles. Follow-on work will put this code into production. Code for discovering and including in metadata the links between testbed articles and AIP, APS, and UIUC Library abstract and indexing databases was also enhanced during this quarter, following up on work done previous quarter. A portion of this code (and associated database design) was ported to AIP systems as work to further develop the paralell repository at that site continued.

User logon and registration procedures were modified and streamlined in preparation for participation of other universities in the follow-on project. (Notre Dame University faculty and staff were given access to a subset of the testbed beginning in September.) Preparatory to making the testbed available at Notre Dame University, testbed personnel made a series of presentations on the Notre Dame campus in early summer (see also

http://magni.grainger.uiuc.edu/notredame/). Contacts were also made with several CIC campuses preparatory to making portions of the testbed collection available on their campuses.

Currently in DeLIver: http://dli.grainger.uiuc.edu/journals/

DeLIver usage statistics: http://dli.grainger.uiuc.edu/statistics/

EVALUATION

During the past quarter, the Social Science Team worked primarily on analyzing, integrating, and writing papers based on previous data collection activities. Team members continued their involvement in managing user registration, authentication, and transaction logging procedures for DeLIver. We analyzed results of the summative user survey that collected data covering both usability issues of DeLIver as well as how DeLIver fits into faculty and students work practices.

Use of DeLIver has continued growing and, in cooperation with the Testbed Team, we have worked on queries and reports to manipulate DeLIver registration and log data. Our final user survey was completed by 950 registered DeLIver users (for a response rate of 25%) and provided summative data on the extent and nature of testbed use, the level of user satisfaction with the system, and reasons for use and nonuse. Data from our exit poll were also analyzed. The exit poll asked users about their impressions of the system, and what they were doing with the system during the current session.

Ann Bishop continued her work with Barbara Buttenfield (Alexandria Digital Library) and Nancy Van House (Berkeley DLI project) to produce an edited volume called Digital Library Use: Social Practice in Design and Evaluation. Among the chapter contributors are Team members Leigh Star and Laura Neumann. The book will be published by MIT Press late in 1999.

Our team has continued its efforts to present to the public results of our research. The Social Science Teams web site has been carefully maintained. At the ACM DL 98 conference, Bishop presented her paper on article component use and organized a panel on DL design and evaluation. Her paper was expanded and will appear in the journal Information Processing and Management. She also produced a paper on measuring access, use, and success in digital libraries that appears in the Journal of Electronic Publishing. The entire team began working on a paper that summarizes and integrates our major findings and insights related to digital use and users.

RESEARCH

Semantic Interoperability

Type Tagging:

The UMLS semantic net is being explored as a source of type tagging information for medical information retrieval. The semantic net tend to have variable granularity and may not convey as much information as is available from other sources. Use of the Metathesarus relational information, specifically the parents of terms, may provide more information than is available in the semantic net. Of the 40 vocabularies incorporated into the Methathesaurus MeSH and SnowMed seem to have the greatest potential, they provide the most information with the least amount of noise. Identifying the MeSH and Snowmed parents of a term will allow the application of more fine grained tagging. For example, the chemical "Deet" will be type tagged as a pesticide, insecticide, organic chemical, amide, to name only a few of the parent tags available.

GNIS has been used to tag geographic information and the code modificiation to allow 1998 UMLS Specialist lexicon incorporation have been completed.

Noun Phraser:

The Noun Phrase optimization is complete. Several memory leaks were found and eliminated and the code is now fully scalable.

Interface Modifications:

Final work is being completed on the interface modifications. The algorithm for the AI Lab implementation of the 1998 UMLS metathesaurus is complete and code is being modified to allow its inclusion into the interface. A tabular interface, which seperates Noun Phrase, MeSH terms, and Authors, has been completed. The dynamic SOM has been altered to exclude the use of MeSH terms, which users found to general for this application. The new implementation of the dynamic SOM will include Noun Phrases only.

User Studies:

Results of the user study conducted with the help of the Arizona Cancer Center and the Arizona Health Sciences Library showed a definite preference for Noun Phrase and Automatic Indexing terms for searching. These two tools did significantly better than Metathesaurus relational terms from Internet Grateful Med, the Internet Grateful Med co-occurrence list (frequency based), MeSH mapped terms from OVID, and terms from the AI Lab implementation of the 1996 version of the UMLS Metathesaurus. Interestingly, users seemed to prefer terms that originated from co-occurrence tools. While terms from the Internet Grateful Med co-occurrence list did not do as well as the AI Lab Noun Phraser or Automatic indexing, they did much better that terms supplied by the relational tools.

Internet

IODyne has been redesigned to have fewer windows and to use the Z39.58 query syntax for the user query language. Users will edit all queries directly on the search document, using popup menus to select qualifiers instead of separate dialog windows. It is also being reimplemented as a stand-alone Java application, to enhance not only functionality and portability but also to use the Gazebo bibliographic gateway developed by the NCSA Emerge group.

PUBLICATIONS

Bishop, Ann P. "Digital Libraries and Knowledge Disaggregation: The Use of Journal Article Components." In Digital Libraries 98: The Third ACM Conference on Digital Libraries. New York: ACM, 1998, pp. 29-39.

Bishop, Ann P. "Mobilizing Information in Journal Articles." In Proceedings of the Annual Meeting of the American Society for Information Science. Medford, NJ: Information Today, 1998, pp.445-452.

Bishop, Ann P. "Logins and Bailouts: Measuring Access, Use, and Success in Digital Libraries." Journal of Electronic Publishing, 4 (2), December 1998. [http://www.press.umich.edu/jep/04-02/bishop.html]

Bishop, Ann P. "Document Structure and Digital Libraries: How Researchers Mobilize Information in Journal Articles." Information Processing and Management, in press.

Bishop, Ann P., Nancy Van House, and Barbara Buttenfield. (Eds.). Digital Library Use: Social Practice in Design and Evaluation. MIT Press, in production.

Bowker, Geoffrey C. and Susan Leigh Star. In press. "Invisible Mediators of Action: Classification and the Ubiquity of Standards," Mind, Culture and Activity.

Kling, Rob and Susan Leigh Star. 1998. “Human Centered Systems in the Perspective of Organizational and Social Informatics,” Computers and Society (March), 22-29.

Neumann, Laura J., and Ann P. Bishop. "From Usability to Use: Measuring Success of Testbeds in the Real World." In The 1998 Clinic on Library Applications of Data Processing: Collected papers. Champaign, IL: University of Illinois, Graduate School of Library and Information Science, in press. <June 15, 1998 draft available at:

http://anshar.grainger.uiuc.edu/dlisoc/socsci_site/dpc-paper-98.html>.

Neumann, Laura J., and Emily Ignacio. 1998. "Trial and Error as a Learning Strategy in System Use." In Proceedings of the Annual Meeting of the American Society for Information Science. Medford, NJ: Information Today, 1998, pp. 243-252.

Star, Susan Leigh, Geoffrey Bowker, and Laura Neumann, Submitted for publication. “Transparency beyond the Individual Level of Scale: Convergence between Information Artifacts and Communities of Practice." In Ann P. Bishop, Barbara P. Buttenfield, and Nancy Van House, eds. Digital Library Use: Social Practice in Design and Evaluation.

Star, Susan Leigh, in press, "Categories and Cognition: Material and Conceptual Aspects of Large-Scale Category Systems," In Sharon Derry and Morton Gernsbacher, eds. Problems and Promises of Interdisciplinary Collaboration: Perspectives from Cognitive Science. NJ: Erlbaum.

Star, Leigh Star and Anselm Strauss. 1999. “Layers of Silence, Arenas of Voice: The Ecology of Visible and Invisible Work”, Computer-Supported Cooperative Work: The Journal of Collaborative Computing, 8: 9-30.

PRESENTATIONS

Bishop, Ann P. "Hybrids and Assemblages: Understanding Digital Library Use." Panel presentation at Digital Libraries 98: The Third ACM Conference on Digital Libraries, Pittsburgh, PA, June 23-26, 1998.

Bishop, Ann P. "Mobilizing Information in Journal Articles." Paper presented at Digital Libraries 98: The Third ACM Conference on Digital Libraries, Pittsburgh, PA, June 23-26, 1998.

Chung, Yi-Ming. "Automatic Subject Indexing using an Associative Neural Network"

Digital Libraries '98: The Third ACM Conference on Digital Libraries, Pittsburg, PA, June 23-26, 1998.

Mischo, William and Timothy Cole. "The UIUC Testbed". Special Libraries Association, June 11, Urbana, IL

Pottenger, William and Dmitry Zelenko. "Automatic Methods for Determining the Semantic Difference Between Collections of Documents". Workshop on Digital Metrics, Pittsburgh, PA, June 27, 1998.

Schatz, Bruce. "High Performance Distributed Digital Libraries: Building the Interspace on the Grid", 7th IEEE Symposium on High Performance Distributed Computing, Chicago, IL, July 1998.

Star, Leigh S. "Categories and Cognition: Material and Conceptual Aspects of Large-Scale Category Systems," Invited Symposium Plenary, Cognitive Science Society, Madison, WI, July, 1998.

Visitors

July 15

Dmitry Ivanov, Altay State University Internet Center, Barnaul, Russia

Sergei Ossipov, Ural State University Internet Center, Ekaterinburg, Russia

Olga Shaposhnikova, Nizhny Novgorod State University Internet Center, Nizhny Novgorod, Russia

Alexander Fronkin, Novgorod State University Internet Center, Novgorod, Russia

Alexey Leontyer, Petrozavodsk State University Internet Center, Petrozavodzk, Russia

Rustam Usmanov, St. Petersburg State Technical University Internet Center, St. Petersburg, Russia

Konstantin Belov, Tver State University Internet Center, Tver, Russia

Irina Krupenikova, Tver Regional Library A.M. Gorky, Tver, Russia

July 20

Norihiko Uda, University of Library and Information Science Tsukuba Science City, Japan

Hitomi Saito, University of Library and Information Science Tsukuba Science City, Japan

Chinatsu Ito, University of Library and Information Science Tsukuba Science City, Japan