OLD | NEW |
| (Empty) |
1 Compound word hyphenation | |
2 | |
3 Hyphen library supports better compound word hyphenation and special | |
4 rules of compound word hyphenation of German languages and other | |
5 languages with arbitrary number of compound words. The new options, | |
6 COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN help to set the right | |
7 style for the hyphenation of compound words. | |
8 | |
9 Algorithm | |
10 | |
11 The algorithm is an extension of the original pattern based hyphenation | |
12 algorithm. It uses two hyphenation pattern sets, defined in the same | |
13 pattern file and separated by the NEXTLEVEL keyword. First pattern | |
14 set is for hyphenation only at compound word boundaries, the second one | |
15 is for hyphenation within words or word parts. | |
16 | |
17 Recursive compound level hyphenation | |
18 | |
19 The algorithm is recursive: every word parts of a successful | |
20 first (compound) level hyphenation will be rehyphenated | |
21 by the same (first) pattern set. | |
22 | |
23 Finally, when first level hyphenation is not possible, Hyphen uses | |
24 the second level hyphenation for the word or the word parts. | |
25 | |
26 Word endings and word parts | |
27 | |
28 Patterns for word endings (patterns with ellipses) match the | |
29 word parts, too. | |
30 | |
31 Options | |
32 | |
33 COMPOUNDLEFTHYPHENMIN: min. hyph. dist. from the left compound word boundary | |
34 COMPOUNDRIGHTHYPHENMIN: min. hyph. dist. from the right comp. word boundary | |
35 NEXTLEVEL: sign second level hyphenation patterns | |
36 | |
37 Default hyphenmin values | |
38 | |
39 Default values of COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN are 0, | |
40 and 0 under the hyphenation, too. ("0" values of | |
41 LEFTHYPHENMIN and RIGHTHYPHENMIN mean the default "2" under the hyphenation.) | |
42 | |
43 Examples | |
44 | |
45 See tests/compound* test files. | |
46 | |
47 Preparation of hyphenation patterns | |
48 | |
49 It hasn't been special pattern generator tool for compound hyphenation | |
50 patterns, yet. It is possible to use PATGEN to generate both of | |
51 pattern sets, concatenate it manually and set the requested HYPHENMIN values. | |
52 (But don't forget the preprocessing steps by substrings.pl before | |
53 concatenation.) One of the disadvantage of this method, that PATGEN | |
54 doesn't know recursive compound hyphenation of Hyphen. | |
55 | |
56 László Németh | |
57 <nemeth (at) openoffice.org> | |
OLD | NEW |