Personal Glossaries on the WWW

6. Hypertextual significance of this work

[skip to list of contents]

  1. Glossaries as Annotation

    We consider glossaries to be a special form of annotation. It is our belief that annotation is one way that readers can, in a sense, become the joint author of a new work the meaning of which depends, more than usually, on the reader. We have more to say about the similarities and differences between glossaries and annotations in the sidebar which accompanies the main text.

  2. Glossary Types

    In the text below, We use published rather than shared to avoid confusion with the well-established hypertext systems concept [Garg & Scacchi, 1987].

    Although annotations can be shared with others (e.g., Blustein's [1996] lists of annotation for a textbook) and are sometimes printed in special editions (e.g., The Annotated Alice [Carroll, 1999]) they are a different class of writing. We term such annotations to be published in the sense that they have been put into a form that is intended for other readers to understand.

    Saul's The Doubter's Companion [1995] is an example of a published glossary.
    It is a collection of brief essays about various topics with short titles arranged alphabetically, in the style of Ambrose Bierce's famous Devil's Dictionary. Some of Saul's essays occupy several pages of the printed book.

    On the other hand, we use the adjective personal annotation to describe works which are more like the jottings of a moment and not formally arranged for presentation to other readers.

    1. Tied Versus Floating Glossaries

      We consider two types of personal glossary using examples from traditional media:

      • those that are tied to a single document such as notes on the flyleaves or in the margins of books; and
      • those that are available for use in any document (which we term floating) such as in a commonplace book (Miller as cited by Bernstein [2004] discusses the needs and implications of such devices).

      Tied personal glossaries, like the sort we report on in this article, may be most useful for textbooks and similar works. We can imagine a future where electronic books come equipped with glossaries with many entries drawn from the text, much as some printed books do today. The primary difference between our imagined future and today's books is that in the future all of the glossaries will be user-editable (and we hope compatible with each other so that a student can amass a grand floating glossary and compare different definitions of the same term).

      Some analogies may help to make the differences clearer. The page in a student's paper notebook where he (or she) records definitions of unfamiliar terms encountered in many books and classes is a form of floating glossary. If the student does not intend to allow other pupils (or even their teacher) to use the glossary, then it is a personal glossary. Similarly, if the teacher provides students with a list of terms and definitions to which the student adds (new terms, definitions, or both) but intends the new or updated entries to be for the student's use only then it is also a personal glossary. The flyleaf of a textbook in which the same student records definitions that pertain only to that book or class is tied because the glossary is connected to a single document. A glossary function provided by a software (e.g. a website or electronic textbook) without the ability to be exported to other programs is similarly tied.

    2. Shared Versus Personal (or Corporate) Glossaries

      The preceding examples have only dealt with personal glossaries. Shared glossaries are intended for users who might not be the authors of the entries. As such, we can consider that traditional printed dictionaries are a form of glossary. Furthermore if a wiki may be considered to be a glossary (because its structure is of terms and definitions) then it would be a floating one, since it could be accessible to someone who was reading another document, a webpage for example. It could be shared glossary if and only if there were no restrictions on who could read or update the entries.

    3. Summary of Types

      Differences between glossary types are summarized in the Table below.

      Classification by flexibility of location
      tied
      only available for use in one document
      floating
      available for use in many documents
      Classification by user
      personal
      • intended for use only by one person or community
      • update-able personal glossaries are only for use by the persons who edit the definitions
      shared
      • available for use by anyone
      • suggests a single glossary (possibly implemented using a distributed system) cf., published
      Classification by author type
      published
      • suggests a single authoritative source of the glossary and that users could modify copies of (e.g., users can modified their own copies of printed dictionaries without altering all extant copies)
      • in a form that is intended for readers (other than the original author, or authors) to understand
      unpublished
      suggests a work which has not reached (and may never reach) the stage where it can be considered to be in a stable version and thus suitable for publication (see published, immediately above)
      Classification by interactivity
      Note
      We do not distinguish here between entries that can be edited, added, or deleted by only a proper subset of users, and glossaries that allow changes by any of its users.
      static
      the entries cannot be changed by the user
      editable / update-able
      users may
      • add new entries,
      • delete existing entries, and
      • update existing entries (with new definitions, changes in definitions, and cross-references)

  3. Implications

    If our supposition that personal glossaries can help individuals (and co-operating groups of people) to make sense of texts, but that are unsuited for sharing with others, then there is a great opportunity for such tools in ebooks and web browsers. The results of our experiment indicate that our prototypes are good models for tied versions of such products. There is clearly much need for further research and product development in this area.

    1. Human Factors

      1. Floating versus tied glossaries

        The main difference between floating and tied glossaries, other than the obvious implementation details, is that the definitions of terms in a tied glossary are not likely to change within the work that the glossary is tied to [Furnas et al., 1987]. We expect that the use of floating glossaries will lead to users encountering conflicting definitions of terms in different documents from within the same community of discourse (as illustrated in the example of Part of Bob's Glossary in §2) and of boundary objects in largely unrelated communities [Muller & Friedman, 2000].

        The floating tools will clearly be more powerful but they may, as a consequence, require users to exercise restraint in their use lest they become unmanageable — think of how few of the infinity of possible hypertextual structures are feasible for discursive websites now that the WWW has settled into genres. The tactics promoted by Brown & Brown [2004] (namely, cultivating a community of practice and developing the techniques as the sophistication and needs of the users grows) might be appropriate starting point for the development of a floating glossary tool. So-called lowercase semantic web efforts [Çelik & Marks, 2004] are using similar techniques with apparent success.

        (The XHTML Friends Network (XFN) is an example of a lowercase semantic web effort.)

        There are however more striking differences between shared and personal glossaries.

      2. Shared versus personal glossaries

        We contend that as with shared annotation, making shared glossaries useful is more a matter of human factors than technological sophistication. This view is supported by previous research about glossaries (see Literature Review section). Also Blustein [2000] found that user interface factors had a greater impact on users' success with a hypertext linking system than the accuracy of the links.

        As people read and make notes they are attempting to make sense of a text and this sense-making process alters their interpretation of the text [Dillon, 1994; McKendree et al., 1995; Tague-Sutcliffe, 1995, pp. 8–9, 12–13]. Personal annotations, of which glossary entries are a distinguished type, are the most obvious remnant of the cognitive state that brought them forth. Because it is difficult to reconstruct the meaning of such entries without access to the context in which they were created glossaries that are not carefully and specifically constructed become increasingly difficult for anyone but their author (or compiler) to understand as the number of entries grows.

        We speculate that personal glossaries are more suitable than shared glossaries for many tasks. It is well-known that readers rarely agree on where links should begin and end within a single document [Furner et al., 1999; Blustein, 2000]. It is likely that this is due to to way people cognitively process documents [McKendree et al., 1995] and for the same reasons we expect personal annotation to vary amongst individuals. Glossaries in particular are, we believe, an artifact of the reader's cognitive state at the time the definition was recorded. We know that processes of reading and searching for information changes users' cognitive state [Tague-Sutcliffe, 1995] and just as we would not expect a student who had not read a textbook to be able to make sense of a random passage in that book out of context we would not necessarily expect anyone to understand someone else's personal (that is to say, informal) glossary entries.

        These differences become clearer in practice. Marshall & Brush [2004] conducted a study comparing personal annotations on paper with shared annotations in a computer network-based forum. They have found … that annotators make profound changes to annotations that they share [Marshall & Brush, 2004, p. 356].

    2. Glossaries as Bridges

      A glossary maintained by a reader in a traditional (codex) text may be kept on a loose piece of paper that the reader moves from page to page as they read — indeed this is often done with mathematical texts when notation is unfamiliar to the reader. If such a glossary includes notes about the relevance of terms and concepts and not merely definitions then it can act as a type of bridge to connect (in the reader's mind) disparate parts of the text. The bridge concept is more similar to an open-linking service [Carr et al., 1998] than to a user-created hypertext link, because the bridge is not explicitly recorded as being between two parts of the text. Since glossaries can provide a way to help readers to make mental associations between two parts of a text they have the possibility to improve readers understanding of hypertextual documents. The success of such a technique however this may depend on the coherence of the reader's model of the text, and a discussion of the psychological factors that are believed to be involved is beyond the scope of this article.


References

References for works cited in this text chunk appear below. References for all works cited are available in a separate chunk.

[Bernstein, 2004]
Daybook: High Tech and Low Tech. [blog entry for 05 March 2004]
Note: Bernstein quotes Doug Miller.
<URL:http://markBernstein.org/Mar0401/DaybookHighTechandLowTech.html>
[Blustein, 1996]
Annotations on K&R II. [webpage] Created 20 January 1996. Current version 04 November 2001.
<URL:http://www.csd.uwo.ca/~jamie/C/KR/annotations.html>
[Blustein, 2000]
James Blustein. Automatically generated hypertext versions of scholarly articles and their evaluation. In HT2K, pages 201 – 210, 2000.
<DOI:10.1145/336296.336364>.
[Brown & Brown, 2004]
P. J. Brown and Heather Brown. Integrating Reading and Writing of Documents. Journal of Digital Information, 5(1), Article No. 237, 2004-02-03, 2004.
<URL:http://jodi.ecs.soton.ac.uk/Articles/v05/i01/Brown/>.
[Carr et al., 1998]
L. A. Carr, W. Hall, and S. Hitchcock. Link Services of Link Agents? In HT'98, pages 113 – 122, 1998.
<DOI:10.1145/276627.276640>.
[Carroll, 1999]
Lewis Carrol (author) & Martin Gardner (annotator). The Annotated Alice: The Definitive Edition. W. W. Norton & Co., 1999.
ISBN 0-393-04847-0.
[Çelik & Marks, 2004]
Tantek Çelik and Kevin Marks. Real World Semantics. [participant session at conference] O'Reilly Emerging Technology Conference 2004. 9 – 12 February 2004, San Diego, CA.
Presentation at <URL:http://tantek.com/presentations/2004etech/realworldsemanticspres.html> [last retrieved on 08 September 2004]
[Dillon, 1994]
Andrew Dillon. Designing Usable Electronic Text: Ergonomic Aspects of Human Information Usage. Taylor & Francis, 1994. ISBN 0-7484-0112-1 (cloth) / 0-7484-0113-X (paper).
[Furner et al., 1999]
Jonathan Furner, David Ellis, and Peter Willett. Inter-linker consistency in the manual construction of hypertext documents. ACM Computing Surveys, 31(4es), December 1999.
<DOI:10.1145/345966.346008> and <URL:http://www.cs.brown.edu/memex/ACM_HypertextTestbed/papers/44.html>.
[Furnas et al., 1987]
G. W. Furnas, T. K. Landauer, L. M. Gomez and S. T. Dumais. The Vocabulary Problem in Human-System Communication. Communications of the ACM, 30(11):964 – 971, November 1987.
<DOI:10.1145/32206.32212>.
[Garg & Scacchi, 1987]
Pankaj K. Garg and Walt Scacchi. On designing intelligent hypertext systems for information management in software engineering. In HT'87, pages 409 – 432, 1987.
[Marshall & Brush, 2004]
Catherine C. Marshall and A. J. Bernheim Brush. Exploring the relationship between personal and public annotations. In JCDL'04, pages 349 – 357, 2004.
<DOI:10.1145/996350.996432>.
[McFedries, 2004]
Paul McFedries. Words About Words - John Ralston Saul. In The Word Spy [website]. Copyright © 1995-2004 Paul McFedries and Logophilia Limited. Retrieved: 14 September 2004.
<URL:http://www.wordspy.com/waw/Saul-JohnRalston.asp>.
This website lists some brief excerpts from Saul's [1995] The Doubter's Companion, for example the
ironically self-referential definition of a dictionary as Opinion presented as truth in alphabetical order.
[McKendree et al., 1995]
Jean McKendree, Will Reader, and Nick Hammon. The “Homeopathic Fallacy” in Learning from Hypertext. interactions, ii(3), July, 1995.
<DOI:10.1145/208666.208687>.
[McKnight et al., 1991]
Cliff McKnight, Andrew Dillon, and John Richardson. Navigation Through Complex Information Spaces. In Cliff McKnight, Andrew Dillon, and John Richardson (editors). Hypertext in Context, (ISBN 0-521-37488-X) Chapter 4. Cambridge University Press, 1991.
[Non-authoritative link <URL:http://telecaster.lboro.ac.uk/HiC/chapter4.html>].
[Muller & Friedman, 2000]
Michael J. Muller and Jessica Friedman. Electronic communities: places and spaces, contents and boundaries. [workshop session] In CHI2K Extended Abstracts Pages 373 – 373, 2000.
<DOI:10.1145/633292.633520>.
[Saul, 1995]
John Ralston Saul. The Doubter's Companion. Penguin Books, 1995.
ISBN 0-14-023707-0.
The Word Spy website [McFedries, 2004]  lists some brief excerpts from Saul's book.
[Tague-Sutcliffe, 1995]
Jean Tague-Sutcliffe. Measuring Information: An Information Services Perspective. Academic Press, 1995.
ISBN 0-12682660-9.

This document is written in valid XHTML 1.0 & This document makes use of cascading style sheets.

[Up to navigation links]