Dictionary Design: February 2009

1. The typographic repertoire

Book design is a utilitarian activity. It is intended to serve the reader by making the structure of the author’s text clear in a visual form, and also by making the book pleasant to handle and durable. My approach to discussing the design of dictionaries is to consider how the surface appearance of the page typography relates to the underlying structure of the author’s text. So first of all it is necessary to briefly define this awkward term typography. It has been used to describe a particular printing process (letterpress); in many languages it still just means printing; and students think of it as the art of laying out printed matter. None of these definitions express the role of visual organization in making the meaning of written language clear. It is not simply a matter of the arrangement of blocks of words, or the choice of particular typefaces to set them in, but an interactive process where rational visual choices are determined by inherent aspects of the author’s text, the expected reader, the kind of book being produced, and the production technology. If this stresses the collaborative nature of typography, then that is a good thing. Typographic design is something that grows out of a series of decisions about a text, and good typography can rarely be painted on to a text after the event. So basing its terminology on that of Twyman (1979), the working definition of typography for this paper will be ‘the rational organization of verbal graphic language’.

Tools for typographic analysis
One way of looking critically at a piece of design is to consider the genre it belongs to. Waller (1987) has given us tools for discussing typographic genre, and sees a genre as the interplay between the particular requirements of an author’s text (the ‘topic structure’), the physical format (the ‘artefact structure’), and the means by which readers gain access to the information (the ‘access structure’). This approach has the advantage of placing the reader clearly in the frame. Typographers also consider a design by looking at macro-typographic issues and micro-typographic issues. These can be categorized at those which chunk and arrange information at a level for general access (macro), and those which provide close-grain differentiation (or harmony) of elements within those larger structures (micro). Aspects of macro-typography tend to relate to document and page layout, and micro-typography to what happens within a paragraph or within a line. A further tool for typographic analysis is to consider the configuration of written language: continuous prose, headings, lists, tables, algorithms, etc. (Twyman, 1979). Recognizing typographic configurations is something we all learn at an early age, and typographic genres gain much of their effect by exploiting this recognition of configurations.

Using Waller’s model, we can describe the topic structure for dictionaries as generally an alphabetic sequence of entries with further levels of information nested within them. The configuration that this involves is usually a list structure with various nested sub-lists. The artefact structure is generally a relatively compact book with a multi-column structure. The access structure is provided at the macro-typographic level by the alphabetic sequence and page elements (alphabetic sections, headlines, headwords) which assist overall navigation and the location of entries, and at the micro-typographic level by the differentiation of elements (etymologies, cited forms, sense numbers) within an entry.

Dictionaries are by their nature structure-rich, and therefore require a potentially complex typography. Hanks (2003) has described dictionaries as ‘more highly structured than any other piece of text’. Because dictionaries have developed a repertoire of typographic presentations, it is possible to describe how these map on to commonly-found structural elements, at least at in general terms. Svensén (1993) and Jackson (2002) both list the font variants currently considered normal for standard elements. Advertising provides evidence that dictionaries constitute a clear typographic genre; it calls on the typographic forms associated with dictionaries to imply an authoritative standard for a product. A recent UK press advertisement for a Honda car ‘defined’ the vehicle’s qualities, with ‘headwords’, ‘phonetics’, and ‘sense numbers’ set in typefaces similar to those used in Oxford dictionaries.

So what are the resources a typographer can call on? He can vary the font for a particular element, so that it contrasts more or less with its surroundings, he can alter sizes, and sometimes colours. These features can be described as ‘intrinsic’. ‘Extrinsic’ features that can be used are mainly to do with the deployment of space: entries can be divided into sub-paragraphs, various forms of indentation can be used (Twyman, 1986). While a hierarchical relationship between elements can be expressed by either intrinsic features or extrinsic features alone, a combination of features is the most normal approach. Each typographic effect is reinforced by another. This reflects the important role of redundancy in written language, which has been usefully illustrated by Garland (1966: 16) [Figure 1].

Figure 1: The usefulness of redundancy in graphic language (after Garland)

Allocating a typographic format (such as a particular font variant) to give emphasis to a particular word is described by Moxon, in his seventeenth-century manual of printing:

And what Words of Emphasis come in that precedent Matter; that he [the compositor] may Set them either in Capitals, Roman, Italick, or English [black letter]. (Moxon, 1683/4: §22 ¶5)

But Moxon seems to be describing a rather ad-hoc process, and he is after all only discussing the preliminary pages of a book. The systematic mapping of typographic formats to text features, and thereby to structural elements, to differentiate them for the reader is now so normal, enforced by the rules of house style manuals (Walker, 2001), that it may need to be illustrated by an eccentric exception. It is now a generally accepted convention to distinguish book titles by italicization, so we are taken aback when we read George Bernard Shaw railing against the advice given by an early edition of Hart’s Rules:

This is deplorable. To the good printer the occurrence of two different founts on the same page is at best an unavoidable evil. … Not only should titles not be printed in italic; but the customary ugly and unnecessary inverted commas should be abolished. Let me give a specimen. 1. I was reading The Merchant of Venice. 2. I was reading “The Merchant of Venice”. 3. I was reading The Merchant of Venice. The man who cannot see that No. 1 is the best looking as well as the sufficient and sensible form, should print or write nothing but advertisements of lost dogs or ironmongers’ catalogues: literature is not for him to meddle with. (Shaw 1902)

Fortunately Hart’s prevailed in this matter. Shaw, of course, was writing at a time when an even-textured page (promoted by William Morris) was becoming the desired end of typography, and when it was felt that this could really only be achieved by returning to the closest English printing had to ‘original’ type, namely Caslon. Caslon, true to its eighteenth-century origins, had only a single weight. By using Caslon, you achieved uniformity of texture, because you could not use bold; roman, italic, and small capital fonts existed, but not bold. In general, dictionary typography has progressed towards more differentiation, not less.

2. Looking at the past

Approaching the history of dictionaries from the standpoint of typographic design is not a well-trodden path. (Hancher (1996, 1998), who considers the graphic and cultural qualities of nineteenth-century dictionaries, is an exception.) Perhaps this is because of the utilitarian nature of dictionaries, which are books with active readers who have a problem to solve, rather than contemplative readers who have an interest in the aesthetics of the book as an object. Typography develops over time, and the range of effects available in earlier centuries differed from that available today. So the questions a typographer might pose of a dictionary are: to what degree does the typographic surface map the underlying structure in a way that is logical, consistent, and revealing to the reader? To what extent does it use (or even expand) the repertoire of typographic effects for this kind of text?

Robert Estienne
The foundations of dictionary typography were laid by the Parisian scholar-printer Robert Estienne (1498–1559), who printed the Thesaurus Linguae Latina (1531) and the Dictionnaire François–Latin (1539) [Figure 2]. Estienne’s spatial organization of the page was exemplary, precisely because he mapped visual appearance to structural significance: he differentiated headwords by setting them on separate lines, in a larger point-size, indented; definitions were set full out, and examples of usage indented one em. Significantly, he introduced the differentiation of language by typeface, using italic type for French words and roman type for Latin ones (Twyman 1982). In other words, he used paragraphing and changes in type size to assist navigation, and changes of typeface to indicate structural elements within a paragraph. Estienne’s preface to the Thesaurus includes the following comment on its design, which holds good for the Dictionnaire:

Here the different senses of words have not been indiscriminately piled into one confused heap, but distinguished by making each one start on a new line throughout, even in cases of twenty or more variant meanings. Here, as a result of this elegant differentiation in sequential form, the variety not only of constructions but also of expressions has been noted, by the observation of a regular manner of expression in authoritative writers. (Brandon, 1904: 42 n3; passage translated by Leofranc Holford-Strevens)

Figure 2: Estienne, Dictionnaire François–Latin (1539)

Estienne’s achievement looks even more significant when compared to the edition of Ortus Vocabulorum (1509) printed in London by Pynson some thirty years earlier, which has only one typeface for both Latin and English text, and no indentation to articulate entries  [Figure 3]. Subsequent printed English dictionaries were slow to aspire to such accessible typography, and only gradually adopted different typefaces and indentation systems.

Figure 3: Pynson, Ortus Vocabulorum (1509)

Samuel Johnson
Relatively undeveloped typography might have been suitable when English dictionaries were little more than lists of hard words, but the development of lexicographical practice in the eighteenth century required a more considered approach. Bailey’s Dictionarium Britannicum (1730) entries contained elements and typographic presentations familiar today, such as etymologies set off in square brackets and cited forms in italic [Figure 4].

Figure 4: Bailey, Dictionarium Britannicum (1730)

Martin’s Lingua Britannica Reformata (1749) introduced numbered senses, and the appearance of his page was surprisingly modern [Figure 5].

Figure 5: Martin, Lingua Britannica Reformata (1749)

But it was Johnson who added significantly to the complexity by using not only numbered senses but also illustrative quotations. Johnson’s printer had limited typographic resources: the Caslon-like typeface that he used had only roman, italic, and roman small cap variant fonts. A comparison of folio, octavo, and quarto editions of Johnson’s Dictionary shows how much more effective it is to subdivide entries into paragraphs for senses and quotations, rather than to attempt to run elements on [Figures 6–9]. On the folio page (1755), the status of the illustrative quotations is demonstrated by their spatial arrangement, and the ranged-right italic sources provide an alternative means of access to the entries. On the quarto page (1777), senses are run on, and quotations are divorced from the senses they illustrate. The abridged London octavos (1758a, 1760) manage to provide some articulation through paragraphing; the Dublin octavo (1758b) does not.

Figure 6: Johnson, Dictionary (folio 1755)

Figure 7: Johnson, Dictionary (quarto 1777)

>Figure 8: Johnson, Dictionary (London octavo 1760)

Figure 9: Johnson, Dictionary (Dublin octavo 1758)

Figure 10: Murray, OED

James Murray and the OED
Mastery of a complex page with many fonts and paragraphed senses was achieved by Murray in the OED [Figure 10]. The OED used a particularly effective combination of typefaces. The underlying face was an Old Style, effectively a much-regularized version of Caslon. Its even and relatively light overall density on the page mean that two weights of bold-face type could be used to contrast with it. Murray also had a more complex system of illustrative quotations to contend with than Johnson – sequences of quotations demonstrate the changing use of words. The individually paragraphed quotation was not an option. The solution was to create banks of run-on quotations after each numbered sense. Bold dates were used to begin each quotation, providing both a visual catch for the start of each, and reinforcing the historical sequence of quotations.

The design of the OED depends on the variety of types used, and is almost an exact counterpoint to Johnson’s lack of variety. Murray’s particular innovation in the use of bold was to use variant typefaces systematically to identify different structural items. Bold-face types for headwords in English dictionaries had appeared in Hunter’s Encyclopædic Dictionary in 1879 (Luna 2000). Murray used four bold fonts: a large-face bold Egyptian, a large-face lighter Egyptian, a small-face lighter Egyptian – all cast on brevier (about 7½ pt) and a nonpareil bold (about 6 pt). The boldest, largest fount was used only for headwords. The lighter large-face Egyptian was used for subsidiary headwords, usually archaic forms, and also for the sense-numbering system. The lighter small-face Egyptian was used within entries for variant forms, and the nonpareil bold was used only for quotation dates.

3. Focusing on the user

How can a dictionary be made more user-friendly? The move towards greater contextual information in dictionaries is an important trend. The development of corpus-based dictionaries has begun to emphasize the company a word keeps. As Hanks reminds us, ‘innovation has been very much the exception rather than the rule in lexicography’ (2003: 61). He also points out that while conventional dictionaries offer a number of senses to define each word, little help has traditionally been given to the reader to enable them to assess which is the most likely meaning for their purposes. Confronted with a sequence of senses to wade through, a reader needs every bit of help in selecting the most relevant. School dictionaries in the UK are required to provide word class, word history, and word family information (for pedagogical, not information-retrieval purposes), but there is no specification or requirement to indicate meaning through context (DfES, 2001). Carney (2004) argues that native-speaker dictionaries ‘are about decoding, so naturally they privilege meaning over usage, semantics over syntax’, and therefore lag behind bilingual dictionaries in the classification of phrases, and in definitions that respect the true syntactic and collocational boundaries of words.

Access devices
Recent dictionaries have experimented with access devices to help readers identify the most appropriate entry or sense [Figures 11–13]. Two approaches have been tested by Tono (1997): ‘guide words’ (used in Cambridge International Dictionary of English, 1995) and ‘signposts’ (used in The Longman Dictionary of Contemporary English, 1995). Matters are complicated because CIDE follows the rule ‘one entry, one core meaning’, while LDCE entries are subdivided into word classes. Tono concludes that the general approach of what he calls ‘meaning-access indexes’ is effective. Two further examples of this approach are the Macmillan English Dictionary (2002), which uses an ruled-off ‘menu’ of senses at the start of each entry, and the Encarta dictionaries (1999, 2001), which start each sense with a ‘quick definition’ – a summary phrase in bold. The typography of these access devices could certainly be improved, and there has been a development towards more considered presentations in the Encarta Concise and MED. Recent work at The University of Reading shows that there is scope for developing the typography of meaning-access devices.

Figure 11: Longman Dictionary of Contemporary English, showing ‘signposts’

Figure 12: Macmillan English Dictionary, showing a ‘menu’

Figure 13: Encarta World Dictionary, showing ‘quick definitions’

Typographic coding
As we have seen in the historical examples, spatial arrangement is probably more important that font differentiation in revealing structure. But this is problematic when dictionaries need to be compact and portable, or include more and more features for pedagogic or marketing reasons. When space is at a premium there is a tendency to use the less efficient differentiation by font change alone. The first editions of the Concise Oxford or the Pocket Oxford were miracles of compression, but hardly offered much assistance to the reader in navigating entries. Dictionaries that were typeset by hand had the advantage of flexibility in typeface choice and combination – the OED itself shows this particularly well. Subsequent editions of the COD increased the use of coding within entries. As entries became more complex, especially though the use of cross-references, problems arose of deciding which structural feature a particular word should be mapped to typographically. An example from COD6 shows the difficulty that arose when a complex structure met the relatively inflexible range of typographic options offered by mechanical composition.

COD6 was typeset in Monotype Baskerville, using a 17x19 matrix-case which allowed seven alphabets in a single size to be used simultaneously; these would normally have been roman upper and lower case, italic upper and lower case, bold upper and lower case, and small capitals. Small capitals were only available in roman – there were no italic or bold small capitals. Problems arose when a word had two different statuses in the entry structure, for example if it were both part of a phrase and also a cross reference; of if it was a repeat of the headword and part of a phrase.

Figure 14: Comparison of specimen and printed entries in COD6

In the example [Figure 14], the two phrases given are ‘die game’ and ‘as game as Ned Kelly’. COD6 conventions replace the repeated headword with a swung dash – arguably satisfactory if the rest of the phrase retains its typographical coding to reveal its status. But in the specimen setting, die in ‘die game’ is a cross-reference, so the cross-reference formatting (roman small capitals) overrides the undefined-phrase formatting (italics), and as a result neither of the two words appear in the font that indicates a phrase. The problem was clearly noted, because the published dictionary amended ‘die game’ by the ad-hoc replacement of the swung dash with the word in full.

Digital fonts and mapping surface to structure
The ability of digital composition systems to combine many typefaces, font variants, and sizes without penalty, and the development of extended and related typeface families (groups of related families have been called ‘tribes’ by the type designer Gerard Unger) has provided a more appropriate set of resources for necessarily complex dictionaries, such as the Shorter Oxford English Dictionary. The seven-alphabet restriction has long been overcome, and the typefaces used in SOED5 (2002) represent a larger palette [Figure 15].

Figure 15: Comparison of SOED4 (top) and SOED5, showing how adjustments to micro-typography can allow intra-entry paragraphing and the expansion of abbreviations

OUP Swift and OUP Argo are related designs, with matching appearing sizes (similar sizes align optically) and a consistent approach to character shapes. OUP Swift is a seriffed family, OUP Argo a sanserif family, and both have full ranges of constituent fonts; small capitals, for example, are available in both typeface families in all weights and style variants. Figures are available in both ranging (capital-size) and non-ranging (lower-case-size) in all fonts. While there are no IPA, Greek or Cyrillic characters (yet) in these families, their appearing size relates so closely to Times New Roman that this typeface can be used for all those characters.

The seriffed–sanserif contrast can be used to separate definitions (seriffed) from metalanguage (sanserif), providing an axis of contrast analogous to the ‘bold for look-up items, normal weight for text’ that readers are used to. A further axis of differentiation can be the degree of expansion or condensation used. Specially-drawn condensed and expanded typefaces exist, but any digital fonts can be condensed and expanded in very fine increments. While this is normally avoided in good typesetting, it may be necessary in a dictionary to increase the ‘dynamic range’ of a typeface, and to optically correct very small sizes by making them relatively wider. Adjusting the size of type microscopically can ensure that a particular number of words fit on a page, or compensate for the spelling-out in full of abbreviations.

Design possibilities
Publishers may see the typography as a way of making dictionaries more ‘feature rich’: typographic quirks appear to be growing, for example the ‘associative’ use of different typefaces to differentiate notes in the New Penguin English Dictionary (2000), or the over-emphatic information boxes in the Oxford School Dictionary (2002). The real advantage of digital typefaces that can be fine-tuned more easily is to allow designers to use a wider range of typographic effects by carefully mapping form to content, so that typographic surface faithfully (and informatively) represents structure. Font variation need not be restricted to emphasis, but can distinguish definitions from metalanguage. Ambiguities, such as the use of one italic font to represent several elements, can be avoided. A word can display two statuses, for example a phrase can be in bold italic, with an embedded cross-reference in bold italic small caps. Importantly, the ability to micro-tune sizes allows a return to explicit paragraphing within entries and the expansion of abbreviations with little loss of economy or appearing size.

Acknowledgements
The preparation of this paper was made possible by a grant from the Arts & Humanities Research Board (UK) under their Research Leave scheme.

References
Dictionaries
Bailey, N. 1730. Dictionarium Britannicum. London: T. Cox.
Cambridge International Dictionary of English. 1995. P. Procter (ed.). Cambridge: Cambridge University Press.
Concise Oxford Dictionary, 6/e. 1976. J. Sykes (ed.). Oxford: Clarendon Press.
Encarta Concise English Dictionary. 2001. K. Rooney (ed.). London: Bloomsbury.
Encarta World Dictionary. 1999. K. Rooney (ed.). London: Bloomsbury.
The Encyclopædic Dictionary. 1879. R. Hunter. London: Cassell.
Estienne, R. 1539. Dictionnaire François–Latin. Paris.
Johnson, S. 1755. A Dictionary of the English Language. Folio ed. London: W. Strahan.
—— 1758a, 1760. A Dictionary of the English Language. Octavo eds. London: J. Knapton etc.
—— 1758b. A Dictionary of the English Language. Octavo ed. Dublin: Geo. and Alex. Ewing.
—— 1777. A Dictionary of the English Language. Quarto ed. London: W. Strahan.
Longman Dictionary of Contemporary English. 1995. D. Summers (ed.). Harlow: Longman.
Macmillan English Dictionary for Advanced Learners. 2002. M. Rundell (ed.). London: Macmillan.
Martin, B. 1749. Lingua Britannica Reformata. London: J. Hodges, etc.
New Penguin English Dictionary. 2000. R. E. Allen (ed.). London: Penguin Books.
Ortus Vocabulorum. 1509. London: R. Pynson.
The Oxford English Dictionary. 1933. J. A. H. Murray et al. (eds.). Oxford: Clarendon Press.
Oxford School Dictionary, 3/e. 2002. A. Delahunty (ed.). Oxford: Oxford University Press.
The Shorter Oxford English Dictionary, 4/e. 1993. L. Brown (ed.). Oxford: Clarendon Press.
Shorter Oxford English Dictionary, 5/e. 2002. W. R. Trumble and A. Stevenson (eds.). Oxford: Oxford University Press.

Other works
Brandon, E. E. 1904. Robert Estienne et le Dictionnaire française au XVIe siécle. Baltimore: J. H. Furst.
Carney, F. 2001. ‘When is a Word a Word (and When is it Something Else?)’. Available at www. lexiconplanet.com/art01_eng.html (12 March 2004).
DfES (Department for Education and Skills, UK). 2001. Framework for Teaching English: Years 7, 8 and 9, April 2001. Available at www.standards.dfes.gov.uk/keystage3/ publications/ (12 March 2004).
Garland, K. 1966. Graphics Handbook. London: Studio Vista.
Hancher, M. 1996. ‘[The Century Dictionary:] Illustrations,’ Dictionaries, 17: 79–115.
—— 1998. ‘Gazing at the Imperial Dictionary,’ Book History, 1: 156–81.
Hanks, P. 2003. ‘Lexicography’, in R. Mitkov (ed.), The Oxford Handbook of Computational Linguistics. Oxford: Oxford University Press.
Jackson, H. 2002. Lexicography: An Introduction. London: Routledge.
Luna, P. 2000. ‘Clearly Defined: Continuity and Innovation in the Design of English Dictionaries’, Typography Papers, 4: 5–56.
Moxon, J. 1683/4. Mechanick Exercises on the Whole Art of Printing, repr. 1958, H. Davies and H. Carter (eds.). London: Oxford University Press.
Shaw, G. B. 1902 [review of Hart’s Rules] repr. 1965, A. Taube (ed.), George Bernard Shaw on Language. London: Peter Owen.
Svensén, B. 1993. Practical Lexicography: Principles and Methods of Dictionary-making. Oxford: Oxford University Press
Tono, Y. 1997. ‘Guide Word or Signpost?: An Experimental Study on the Effect of Meaning Access Indexes in EFL Learners’ Dictionaries’, English Studies, 28: 55–77. Available at leo.meikai.ac.jp/~tono/userstudy/signpost.htm (13 March 2004).
Twyman, M. L. 1979. ‘A Schema for the Presentation of Graphic Language’, in P. Kolers, M. Wrolstad, and H. Bouma (eds.), Processing of Visible Language. New York, Plenum.
—— 1982. ‘The Graphic Presentation of Language’, Information Design Journal, 3: 2–22.
—— 1986. ‘Articulating Graphic Language: A Historical Perspective’, in M. E. Wrolstad & D. F. Fisher (eds.), Towards a New Understanding of Literacy. New York: Praeger.
Walker, S. F. 2001. Typography and Language in Everyday Life: Prescription and Practices. Harlow: Longman.
Waller, R. H. 1987. ‘The Typographic Contribution to Language’, PhD thesis, The University of Reading. Available at www.gem.stir.ac.uk/robwaller.html (12 February 2004).

This article first appeared in the proceedings of the Euralex conference, 2004.

Dictionary Design

Wednesday, 18 February 2009

Not Just a Pretty Face: The Contribution of Typography to Lexicography

Followers

Blog Archive

About Me