Personal Glossaries on the WWW

3. Detailed Background

[skip to section contents list]

  1. Extent and Focus of Review

    A search of the digital libraries at ACM and IEEE, as well as Citeseer and the HCI Bibliography Project found very few recent published work about human factors of electronic glossaries to support the use of technical or academic texts, or experiments with such tools. The most notable resources found were studies of how and why people annotate texts (e.g. works by O'Hara et al. [1998], Marshall [1998], and Jackson [2001]). We did however find several excellent articles about the development, and occasional testing, of such systems (e.g., [Kaindl et al., 1999] and [Brown & Brown, 2004]). We agree with the view that prototyping can be a good way to elucidate user needs however in this work we take a more basic approach: we are trying to determine what users need in a hypertextual glossary tool before we commit to build a full-scale one.

    In the rest of this review section we summarize what is known about the use of hypertext specifically for glossaries (not annotation in general) in discursive works.

  2. Some Human Factors of Hypertext

    Cognitive overhead can be defined as the added concentration and exertion required in order to maintain multiple tasks or trains of thought at a single time [Wright, 1991].

    It is believed that readers form a coherent mental representation of a text as they read, and that they use that so-called map to help them to comprehend and use the text [Charney, 1994; Dillon, 2000]. Since reading hypertext requires a higher cognitive overhead on readers reading than regular text it is especially important that nothing in the interface imposes more load without providing significant benefit [Blustein, 2000]. In her survey of applications of hypertext applications Charney [1994] notes that if relevant textual passages are not physically close then readers will be unlikely to make mental connections between them. Since glossaries can provide a way to bridge between two parts of a text they have the possibility to improve readers understanding of hypertextual documents. One would expect such bridging to be best achieved by having a glossary tool that was physically close to a current use of a term and include sufficient context to remind the user of the other locations where the term is used, based on Charney's observation. (For more speculation about glossaries as bridges and their relationship to open linking services you can jump to the hypertextual significance of this work section.)

  3. Hypertextual Glossaries

    Black et al. [1992] discuss several different styles of glossary tools. Their research involved thorough experimentation into the factors surrounding the usability and effectiveness of different styles of glossary tools. Varying styles of glossary entry presentation were also studied.

    Most of the tools studied by Black et al. [1992] employ the use of links embedded in the main text that the user is reading. The links are anchored to terms in the text that have corresponding entries in the glossary. Activating a link presents the corresponding definition. They call them integrated glossary links.

    1. Visual Marking

      Some form of visual marking was also typically used to identify which words in the text had associated entries [Black et al., 1992]. Wright [1991] states that simply making functionality available to a user is not enough unless the user is made explicitly aware of the presence of this functionality. This view is echoed by Norman [1998] who pontificates that the perceived functionality of a system is what is important to users, even though it may be very different from the system's actual functionality. For example when a user encounters a completely or partially unknown term that is defined in the glossary, then the user must be made aware, perhaps by highlighting the term in the text, that clicking on the term will provide the corresponding definition.

      According to Wright [1993] providing the user with the ability to access glossary entries is not always enough to be entirely useful. She expects that the functionality would be much more useful if it had visual support from within the display itself. Black et al. [1992] also mention the importance somehow marking the presence of terms that have entries in the glossary.

      The visual presentation of information affects users' consultation strategies when using a glossary tool. Readers' willingness to consult definitions was influenced by whether the definitions occluded the main text or left it visible [Black et al., 1992]. Users were more likely to consult the glossary tool when the text remained visible.

    2. Presentation Styles

      Black et al. [1992] also discuss varying presentation styles for glossary entries. Once accessed, a glossary entry may occlude the main text or it may be presented in the margins alongside the text. Alternatively, the entry may be presented to the user through auditory means. If the auditory presentation style is used, attempting to access a glossary entry would result in a sound file reciting the entry as opposed to the visual presentation of the same entry.

      An alternative to integrating glossary links into the main text is presenting a list of glossary terms separately from the main body of text. Terms in the text that is currently being viewed on the screen may be highlighted in the list to make their presence in the glossary more prominent to the user. The glossary entries can then be accessed using the list. This style of glossary is directed more at readers who wish to supplement their vocabulary either before or after reading, but do not want to interrupt their reading.

      The list style of glossary has certain advantages and disadvantages when contrasted against glossaries that employ the use of integrated glossary links. Bailey [2002] concluded that varying line lengths in online texts produce different reading speeds in users. While leaving the original text unaltered is an advantage of not having integrated glossary links, Wright [1991] found that the lack of these links produced a drop in users' willingness to access glossary entries. They speculated that this decrease in willingness was due to the increase in cognitive overhead involved .

      Despite this disadvantage, this style of glossary is preferable over other glossary styles in certain respects, including ease of implementation, as well as allowing users free access to glossary entries at any time as opposed to only when they encounter terms in the course of their reading the main text. Combining integrated glossary links with this style of glossary allows for individual differences in reading styles. As Wright [1993] states, any generalizations about users' styles of reading should be made with care.

      Individual user styles can vary depending on several factors. A user's level of expertise with a system usually has a major impact on how the user will interact with the system. Other simpler factors, such as level of fatigue and mood can also cause differences in user behavior. Because of differences in individual users, care should be taken to ensure that systems are not designed for a specific kind of user behaviour [Nielsen, 1995]. Dillon & Watson [1996, p. 633] reached a similar conclusion but note that even though there are differences amongst users that predict performance with interactive systems, appropriate design of the interface and/or training can reduce these differences.

    3. User and Task Factors

      While Black et al. [1992] state the willingness of user to access glossary entries when they encounter completely unknown terms increases when the glossary terms are visibly marked, they also note that users' willingness to look up partially known terms depends on whether the method of marking the glossary terms in the main text disrupts the users' flow of reading. It has already been established that signaling the presence of glossary terms is necessary in order to allow the glossary tool's potential to be employed more fully. Underlining these terms does not disrupt the flow of the main text as well as signaling to the users that a corresponding glossary entry exists for the underlined terms. Many readers may also be familiar with the convention of underlined hypertext links on the World Wide Web, and will therefore, be more comfortable with clicking on the glossary term to access its entry.

      Wright [1993] states that separating users' access to glossary terms from the main text (e.g. having the terms listed in another division of the window or in a separate window entirely) decreases their willingness to access the glossary entries. This drop in willingness was found to be the case even if the list of glossary terms and the main text were presented on the screen at the same time without occluding each other. This research supports the decision to integrate glossary links into the main text. In this way, users' willingness to access glossary entries will rely more heavily on their perceived level of familiarity with the terms they encounter and less heavily on the glossary tool's interface.

      Some readers may prefer to enhance their vocabulary with completely or partially unknown terms in the text either before or after they are done reading, but not during [Black et al., 1992]. If users' access to glossary entries is restricted to clicking on the integrated glossary links, users who prefer this style of reading will have to adapt to a different style of reading or will be discouraged from using the glossary tool at all. This adaptation process will cause an increase in cognitive overhead. For this reason, a list of glossary terms is provided in another division of the interface. Having a separate list of terms allows users to access entries without having to locate them in the main text. Integrating both styles of glossary term presentation caters to both styles of reading, so that neither type of reader is at a disadvantage.


References

References for works cited in this text chunk appear below. References for all works cited are available in a separate chunk.

[Bailey, 2002]
Bob Bailey. Optimal line length. UI Design Update Newsletter, November 2002. Downloaded on 18 June 2003.
<URL:http://www.humanfactors.com/downloads/nov02.asp#bobbailey>.
[Blustein, 2000]
James Blustein. Automatically generated hypertext versions of scholarly articles and their evaluation. In HT2K, pages 201 – 210, 2000.
<DOI:10.1145/336296.336364>.
[Carr et al., 1998]
L. A. Carr, W. Hall, and S. Hitchcock. Link Services of Link Agents? In HT'98, pages 113 – 122, 1998.
<DOI:10.1145/276627.276640>.
[Black , 1992]
A. Black, P. Wright, and K. Norman. Consulting on-line dictionary information while reading. Hypermedia, 4(3):145 – 169, 1992.
[Brown & Brown, 2004]
P. J. Brown and Heather Brown. Integrating Reading and Writing of Documents. Journal of Digital Information, 5(1), Article No. 237, 2004-02-03, 2004.
<URL:http://jodi.ecs.soton.ac.uk/Articles/v05/i01/Brown/>.
[Charney, 1994]
Davida Charney. The effect of hypertext on processes of reading and writing. In C. L. Selfe and S. Hilligoss (editors), Literacy and Computers: The Complications of Teaching and Learning with Technology (ISBN 0-873-52580-9), chapter 10 (pages 238 – 263). The Modern Language Association of America, New York, NY, 1994.
[Dillon & Watson, 1996]
Andrew Dillon and Charles Watson. User analysis in HCI—the historical lessons from individual differences research. International Journal of Human-Computer Studies, 45(6):619 – 637, 1996.
<DOI:10.1006/ijhc.1996.0071>
[Dillon, 2000]
Andrew Dillon. Spatial-Semantics: How Users Derive Shape from Information Space. Journal of the American Society for Information Science, 51(6):521 – 528, April, 2000.
[Golovchinsky & Marshall, 1998]
Gene Golovchinsky and Catherine C. Marshall. Hypertext interaction revisited. In HT2K, pages 171 – 179, 2000.
[Jackson, 2001]
H. J. Jackson. Marginalia: Readers Writing in Books. Yale University Press, 2001.
ISBN 0-300-08816-7.
[Kaindl et al., 1999]
Hermann Kaindl, Stefan Kramer, and Papa Samba Niang Diallo. Semiautomatic generation of glossary links: A practical solution. In HT'99, pages 3 – 12, 1999.
<DOI:10.1145/294469.294473>.
[Lewis & Reiman, 1994]
Clayton Lewis and John Reiman. Task-Centered User Interface Design: A Practical Introduction. 1994
<URL:ftp://ftp.cs.colorado.edu/pub/cs/distribs/clewis/HCI-Design-Book/> or <URL:http://www.hcibib.org/tcuid/>.
[Marshall, 1998]
Catherine C. Marshall. Annotation: from paper books to the digital library. In DL'98, pages 131 – 140, 1998.
<DOI:10.1145/263690.263806>.
[Marshall, 1999]
Catherine C. Marshall. Toward an ecology of hypertext annotation. In HT'98, pages 40 – 49, 1998.
<DOI:10.1145/276627.276632>.
[Marshall, n.d.]
Cathy Marshall. The Trouble With Scenarios. Tekka, 1(2), no date.
<URL:http://www.tekka.net/02/?Scenarios> [requires subscription].
[Nielsen, 1995]
Jakob Nielsen. Multimedia and Hypertext: The Internet and Beyond. Academic Press Limited, 1995.
ISBN 0-12-518408-5.
[Norman, 1998]
Donald A. Norman. The Psychology of Everyday Things. Basic Books, 1988.
ISBN 0-465-06709-3.
[O'Hara & Sellen, 1997]
Kenton O'Hara and Abigail Sellen. A Comparison of Reading Paper and On-Line Documents. In CHI'97, pages 335 – 342, 1997.
<DOI:10.1145/258549.258787>.
[O'Hara et al., 1998]
Kenton O'Hara, Fiona Smith, William Newman, and Abigail Sellen. Student Readers' Use of Library Documents: Implications for Library Technologies. In CHI'98, pages 233 – 240.
[Wright, 1991]
Patricia Wright. Cognitive overheads and prostheses: Some issues in evaluating hypertexts. In HT'91 pages 1 – 12. ACM Press, 1991.
<DOI:10.1145/122974.122975>.
[Wright, 1993]
Patricia Wright. To jump or not to jump: Strategy selection while reading electronic texts. In C. McKnight, A. Dillon, and J. Richardson (editors), Hypertext: A Psychological Perspective (ISBN 0-134-41643-0), chapter 6 (pages 137 – 152). Ellis Horwood, 1993.
[Non-authoritative link: <URL:http://telecaster.lboro.ac.uk/HaPP/chapter6.html>].

This document is written in valid XHTML 1.0 & This document makes use of cascading style sheets.

[Up to navigation links]