TransWikia.com

Why does usepackage[british]{babel} hyphenate the word "alternate" incorrectly?

TeX - LaTeX Asked on March 20, 2021

Now I know there are hyphenation differences between British and American English but in no dictionary, British or American, could I find the word “alternate” to be broken down to al·tern·ate instead of al·ter·nate (for example). And yet:

documentclass[a5paper]{article}
usepackage[british]{babel}
begin{document}
alternate alternate alternate alternate alternate alternate alternate alternate alternate alternate alternate alternate alternate alternate alternate alternate alternate alternate
end{document}

alternate altern-

If you remove the babel line or change it to american, the hyphenation will be correct:

many alternates, all correct

So how does the British hyphenation work? Is it a complete reimplementation that can be buggy? Or is it just a list of exceptions where British syllabification differs from American? If it is the latter, why would a wrong hyphenation of “alternate” be made part of that word list?

5 Answers

According to the Oxford dictionary the correct hyphenation in British English is

al-ter-nate

The pattern for British English were prepared in 1996 by Dominik Wujastik using a list of hyphenated words made available by Oxford University Press and is present on CTAN as ukhyph.tex. In 2008, the team in charge of maintaining hyphenation patterns for TeX Live made a reorganization of the material; here's the start of hyph-en-gb.tex:

% This file has been renamed from ukhyphen.tex to hyph-en-gb.tex in June 2008
% for consistency with other files with hyphenation patterns in hyph-utf8 package.
% No other changes made. See http://www.tug.org/tex-hyphen for more details.

% File: ukhyphen.tex
% TeX hyphenation patterns for UK English

Some lines later we can read

%       $Log: ukhyph.tex $
%       Revision 2.0  1996/09/10 15:04:04  ucgadkw
%       o  added list of hyphenation exceptions at the end of this file.
%
%
% Version 1.0a.  Released 18th October 2005/PT.
%
% Created by Dominik Wujastyk and Graham Toal using Frank Liang's PATGEN 1.0.
% Like the US patterns, these UK patterns correctly hyphenate about 90% of
% the words in the input list, and produce no hyphens not in the list
% (see TeXbook pp. 451--2).
%
% These patterns are based on a file of 114925 British-hyphenated words
% generously made available to Dominik Wujastyk by Oxford University Press.
% This list of words is copyright to the OUP and may not be redistributed.
% The hyphenation break points in the words in the abovementioned file is
% also copyright to the OUP.

so I argue that the hyphenation patterns have never changed from 1996, except for the addition of a hyphenation exception list that reads, in the original file,

hyphenation{ % Do NOT make any alterations to this list! --- DW
uni-ver-sity
uni-ver-sit-ies
how-ever
ma-nu-script
ma-nu-scripts
re-ci-pro-city
through-out
some-thing}

and is exactly the same in the reorganized files.

It is true that alternate hyphenates as

al-tern-ate

as the following file to be run with pdflatex shows:

makeatletterlanguagel@britishshowhyphens{alternate}stop

that prints

Underfull hbox (badness 10000) detected at line 0
[] OT1/cmr/m/n/10 al-tern-ate

on the terminal.

Hyphenation in TeX doesn't examine a long list of words, but rather uses a method based on patterns, described in Appendix H of the TeXbook. The patgen program distills a set of patterns based on a list of hyphenated words, but some compromise has to be made for efficiency of the algorithm, so it's surely possible that some word slips off and turns out to be hyphenated incorrectly.

That's what the hyphenation exception list is for. You can, until the problem is fixed by adding some suitable patterns or the word in the exception list, add it manually:

documentclass[a5paper]{article}
usepackage[british]{babel}

babelhyphenation[british]{al-ter-nate}

begin{document}
alternate alternate alternate alternate alternate alternate alternate 
alternate alternate alternate alternate alternate alternate alternate 
alternate alternate alternate alternate
end{document}

enter image description here

The command babelhyphenation requires babel version 3.9; for an earlier version one can use

begin{hyphenrules}{british}
hyphenation{al-ter-nate}
end{hyphenrules}

which has the same effect.

Correct answer by egreg on March 20, 2021

this answer will not be as elaborate as the one by egreg, but i have some different information.

essentially everything egreg says is correct, but the clue may lie in exactly which oxford dictionary was the basis for the list of hyphenated words that dominik used.

i have just come into possession of a copy of the dictionary that was purportedly used: "the oxford minidictionary of spelling and word division". in it, the word in question is presented as

al.tern|ate

where the period represents a broken vertical, a "less recommended" place for division.

i agree that i don't find this "attractive", and certainly would question it, but then, i'm from the left side of the pond. (i was offered this dictionary as an aid to my editing of tugboat; since i aim for consistency of style -- either british or u.s. -- i gratefully accepted. i admit to surprise in many instances looking through it, but as i said, i'm from the western side of the atlantic.)

edit: here is a scan of the relevant page of the cited dictionary. in no word beginning "altern" is there a hyphen after the "r"; if there is a hyphen, it's always after the "n" (which i don't understand), but in the case of "alternation", that location is avoided completely, with the primary hyphenation point before the "-tion". a true puzzlement.

scanned page of hyphenation dictionary

note: this image is from "the oxford minidictionary of spelling and word division", copyright by oxford university press, 1986, from a 1992 reprint. (i have neither requested nor received permission for this use.)

Update:
After discussion with a native British speaker, I was coerced into searching for an audio example of the pronunciation. I found a useful example at https://dictionary.cambridge.org/us/pronunciation/english/alternate which gives three forms (verb, adjective, and noun) in both UK and US pronunciations. The UK adjective is pronounced in this example with the stress on the second syllable. But so is the US adjective -- which is just plain wrong in my experience. (The pronunciation given for the US noun is also not what I learned, in any US regional variation.) So I concede that the UK pronunciation of the adjective may differ in the way that makes the hyphenation "al-tern-ate" appropriate. However, since the spelling of the three grammatical forms is uniform, this difference in hyphenation means that the word should be omitted entirely from resolution by the patterns, since no automatic grammatical distinction is possible. A conundrum.

Answered by barbara beeton on March 20, 2021

I believe I can perhaps add a touch of clarity to this discussion. Fowler's Modern English Usage, a standard style guide for [British] English published by the Oxford University Press, is likely to have been the source that informed the hyphenation practice. Fowler says of hyphenation that:

The problems of hyphenation at the line-end are compounded in newspapers by the narrowness of the columns and the customary assumption in most printing that the right-hand margin, like the left-hand one, should be straight (or 'justified'). Who has not encountered bad end-of-line breaks like c-/hanging, mans-/laughter, rear-/ranged? [...]

It is usually best to divide a word after a vowel, taking over the following constant to the next line. In pres. pples take over -ing, e.g., divid-/ing, sound-/ing; but chuck-/ling, trick-/ling, and similar words. Generally, when two consonants or vowels come together one should divide between them, e.g. splen-/dour, appreci-/ate. Terminations such as -cian, -sion, and -tion should not be divided when forming one sound: divide as Gre-/cian, ascen-/sion, subtrac-/tion. Hyphened words should be divded at the hyphen, and indictionaries a second hyphen may be used to clarify their spelling. This is not the end of the story: Ronald McIntosh lists thirty-three rules altogether for dividing words at the line-end. [...]

[Regarding printers] Very broadly, British practice has tended to emphasise morphological structure and word origin (as in triumph-/ant), and American practice has tended to give greater weight to the perceived pronunciation (c.f. trium-/phant).

[...] McIntosh warns us that American practice is likely to become more influential in British English as more and more technology for word-setting becomes imported across the atlantic. We have been warned. [!]

It therefore seems entirely plausible that Babel's rules for British English hyphenation do indeed differ slightly, based on the rules above, before the etymology of each word in preference to their perceived pronunciation.

Answered by Landak on March 20, 2021

Apologies for reviving this thread, but I didn't see this answer in here. I believe al-ter-nate and al-tern-ate to be two different words.

  • al-ter-nate: a verb meaning to take turns performing two different activities, e.g. I alternate between running and walking.
  • al-tern-ate: an adjective meaning every other, e.g. I go running on alternate Sundays.

They are pronounced differently, in the same way the word ate (as in to eat) is pronounced, and based on that it makes sense to hyphenate them in two different ways.

Answered by Martin on March 20, 2021

I wrote a thick book in British English. Using option british for babel, I also got many "suspicious" spellings like phys-ics, introduct-ory, proced-ure, dir-ection, stand-ards, quant-ity, intens-ity, mech-anical (it goes on and on like that).

Then I checked that against two web pages:

  • ushuaia.pl that uses hunspell agrees but
  • hyphenation24.com disagrees.

Since Libre Office is also using hunspell, I have tested in it and got the same as babel.

So this spelling is obviously used pretty widely.

All these hyphenations sound funny to me, but I am irrelevant as I am obviously not a native speaker. It would be nice to hear an opinion from a native British speaker though.

Answered by Pygmalion on March 20, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP