iso8859.html
21 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
<HTML><HEAD><TITLE>ISO 8859 Alphabet Soup</TITLE><META NAME="Description"
CONTENT="A commented graphical overview of the ISO 8859 character sets">
<META HTTP-EQUIV="Content-Type" CONTENT="text/html;charset=ISO-8859-1">
<META HTTP-EQUIV="Content-Language" CONTENT="en">
<BASE HREF="http://czyborra.com/charsets/iso8859.html">
<LINK REV="made" HREF="mailto:roman@czyborra.com">
<META HTTP-EQUIV="Reply-To" CONTENT="roman@czyborra.com">
</HEAD><BODY>
<BLOCKQUOTE><I>NEWS-1998: This page has been moved to <A
HREF="http://czyborra.com/charsets/iso8859.html"
>http://czyborra.com/charsets/iso8859.html</A>, substantially extended
and updated and is now accompanied by additional pages on <A
HREF="iso646.html" >ASCII</A>, <A HREF="codepages.html" >code
pages</A> and <A HREF="cyrillic.html">Cyrillic
charsets</A>.</I><HR></BLOCKQUOTE>
<H1><A NAME="Intro" >The ISO 8859 Alphabet Soup</A></H1>
<P> ISO 8859 is a full series of 10 (and soon even <A HREF="#Future"
>more</A>) standardized multilingual single-byte coded (8bit) graphic
character sets for writing in alphabetic languages:
<OL>
<LI><A HREF="#ISO-8859-1" >Latin1</A> (West European)
<LI><A HREF="#ISO-8859-2" >Latin2</A> (East European)
<LI><A HREF="#ISO-8859-3" >Latin3</A> (South European)
<LI><A HREF="#ISO-8859-4" >Latin4</A> (North European)
<LI><A HREF="#ISO-8859-5" >Cyrillic</A>
<LI><A HREF="#ISO-8859-6" >Arabic</A>
<LI><A HREF="#ISO-8859-7" >Greek</A>
<LI><A HREF="#ISO-8859-8" >Hebrew</A>
<LI><A HREF="#ISO-8859-9" >Latin5</A> (Turkish)
<LI><A HREF="#ISO-8859-10">Latin6</A> (Nordic)
</OL>
The ISO 8859 charsets are not even remotely as complete as the truly
great <A HREF="http://charts.unicode.org/" >Unicode</A> but they have
been around and usable for quite a while (first <A
HREF="ftp://ftp.isi.edu/in-notes/iana/assignments/character-sets"
>registered</A> <A HREF="ftp://ftp.isi.edu/in-notes/rfc2277.txt"
>Internet charsets</A> for use with <A
HREF="ftp://ftp.isi.edu/in-notes/rfc2046.txt" >MIME</A>) and have
already offered a major improvement over the plain 7bit <A
HREF="iso646.html" >US-ASCII</A>.
<P><A HREF="/unicode/standard.html" >Unicode</A> (<A
HREF="iso646.html#ISO-10646" >ISO 10646</A>) will make this whole
chaos of mutually incompatible charsets superfluous because it unifies
a superset of all established charsets and is out to cover all the
world's languages. But I still haven't seen any software to display
all of Unicode on my Unix screen. We're <A HREF="/diplom/abgabe.html"
>working</A> on it.
<P>The ISO 8859 charsets were designed in the mid-1980s by the
European Computer Manufacturer's Association (<A
HREF="http://www.ecma.ch/" >ECMA</A>) and endorsed by the
International Standards Organisation (<A HREF="http://www.iso.ch/"
>ISO</A>). The series is currently being revised by the <A
HREF="http://www.dkuug.dk/jtc1/sc2/wg3/" >ISO/IEC JTC1/SC2/WG3</A>
working group. The 1998 editions all come with Unicode numbers.
<P>This page exists because the ISO won't provide <A
HREF="ftp://ftp.informatik.uni-erlangen.de/pub/doc/ISO/std-faq" >free
copies</A> of their <A HREF="http://www.iso.ch/cate/35040.html"
>published standards</A> (the charset subcommittee <A
HREF="http://www.dkuug.dk/jtc1/sc2/" >JTC1/SC2</A> has recently called
for a free online publication in the future, though, see their <A
HREF="http://www.dkuug.dk/jtc1/sc2/open/02n3077.htm" >Redmond
resolution M08.02: Publication of SC 2 Standards on the web</A>) and
the ECMA offers them <A HREF="http://www.ecma.ch/stand/standard.htm"
>on paper only</A>.
<P>By clicking at my [TXT]-buttons you can download textual reference
tables with Unicode mappings for each of the charsets. You may want
to double-check them against more authorative sources like Keld
Simonsen's pioneering <A HREF="ftp://ftp.isi.edu/in-notes/rfc1345.txt"
>RFC 1345</A>, or his <A HREF="ftp://dkuug.dk/i18n/charmaps/" >updated
and corrected charmaps</A> for <A HREF="http://www.dkuug.dk/i18n/"
>i18n@dkuug.dk</A>, mirrored at many Linux's POSIX.2 <A
HREF="file://localhost/usr/share/i18n/charmap/"
>/usr/share/i18n/charmap/</A> directory, the <A
HREF="ftp://ftp.unicode.org/mappings/iso8859/" >mapping tables on
ftp.unicode.org</A>, or Kosta Kostis's <A
HREF="http://www.kostis.net/charsets/" >transhtm-generated tables</A>.
<P>There are <A HREF="http://www.unicode.org/unicode/Languages.html"
>ISO 639 language codes</A> for some 150 of the world's several
thousand known languages.
The 1998 editions of the ISO-8859 Latin alphabets come with a <A
HREF="http://www.hut.fi/u/jkorpela/8859.html" >table of languages
covered</A>.
A <A HREF="http://www.alvestrand.no/~hta/ietf/lang-chars.txt" >survey
of each language's characters</A> was started by <A
HREF="http://www.alvestrand.no/~hta/" >Harald Alvestrand</A>.
A more complete but less computerized survey is Akira Nakanishi's
colorful book of the "Writing Systems of the World", ISBN <A
HREF="http://www.amazon.com/exec/obidos/ISBN=0804816549"
>0-8048-1654-9</A>.
It would be interesting to merge these two into an illustrative <A
HREF="/utf/#UTF-8" >UTF-8</A> text file with <A HREF="/yudit/"
>Yudit</A>.
<P> The following bitmap GIFs show only the upper <A
HREF="http://www.ecma.ch/stand/ecma-035.htm" >G1</A> portions of the
respective charsets. Characters 0 to 127 are always identical with <A
HREF="iso646.html" >US-ASCII</A> and the positions 128 to 159 hold
some less used control characters: the so-called <A
HREF="iso6429-c1.txt.gz" >C1 set</A> from <A HREF="iso6429.html" >ISO
6429</A>.
<P>Each image is followed by a link to the textual reference table and
the matching <A HREF="http://www.ora.com/homepages/comp.fonts/FAQ/"
>public-domain bitmap font</A> source code in <A
HREF="http://www.adobe.com/supportservice/devrelations/PDFS/TN/5005.BDF_Spec.pdf"
>BDF bitmap distribution format</A> so that you can integrate support
for all charsets in your <A
HREF="http://wwwwbs.cs.tu-berlin.de/cgi/rtfm/usr/elm/man/man1/metamail.1"
>metamail</A> setup like I did in 1994 in <A
HREF="http://wwwwbs.cs.tu-berlin.de/usr/elm/"
>cs.tu-berlin.de:/usr/elm/</A> before our beloved superuser
confiscated it because he felt competed or something.
Check out the commands <A
HREF="http://wwwwbs.cs.tu-berlin.de/cgi/rtfm/usr/X11/man/mann/mkfontdir.n"
>mkfontdir</A> and <A
HREF="http://wwwwbs.cs.tu-berlin.de/cgi/rtfm/usr/X11/man/mann/xset.n"
>xset</A> to install extra fonts on your X terminal.
If anybody has converters from BDF to other bitmap formats like those
for Windows or MacOS, please send them to me!
Most glyphs were extracted from <A
HREF="ftp://ftp.x.org/contrib/fonts/" >etl16-unicode.bdf</A> and
reassembled using a bunch of perl scripts.
<P><BLOCKQUOTE>``I'm really terrified to see how difficult it can
be for a non-latin1 person to be able to print in his/her own mother
tongue!'' -- Akim Demaille, maintainer of <A
HREF="http://www-inf.enst.fr/~demaille/a2ps/" >a2ps</A>, early 1998
</BLOCKQUOTE>
<H2><A NAME="ISO-8859-1">ISO-8859-1 (Latin1)</A></H2>
<IMG SRC="iso8859-1.gif"><BR> charset=ISO-8859-1
[<A HREF="iso8859-1.txt.gz">TXT</A>]
[<A HREF="iso8859-1.bdf.gz">BDF</A>]
<P> <A HREF="http://www.hut.fi/u/jkorpela/latin1/" >Latin1</A> covers
most <B>West European</B> languages, such as French (fr), Spanish
(es), Catalan (ca), Basque (eu), Portuguese (pt), Italian (it),
Albanian (sq), Rhaeto-Romanic (rm), Dutch (nl), German (de), Danish
(da), Swedish (sv), Norwegian (no), Finnish (fi), Faroese (fo),
Icelandic (is), Irish (ga), Scottish (gd), and English (en),
incidentally also Afrikaans (af) and Swahili (sw), thus in effect also
the entire American continent, Australia and much of Africa. The most
notable exceptions are Zulu (zu) and other Bantu languages using <A
HREF="http://charts.unicode.org/Unicode.charts/normal/U0180.html"
>Latin Extended-B</A> letters, and of course Arabic in North Africa,
and <A HREF="/yudit/Guarani.kmap" >Guarani</A> (gn) missing GEIUY with
~ tilde.
The lack of the ligatures Dutch IJ, French OE and ,,German`` quotation
marks is considered tolerable. The lack of the new C=-resembling Euro
currency symbol U+20AC has opened the <A
HREF="http://www.indigo.ie/egt/standards/iso10646/euro/euromin2.html"
>discussion</A> of a new <A HREF="#ISO-8859-15">Latin0</A>.
<P>Latin1 has also been adopted as the first page of ISO 10646 (<A
HREF="http://charts.unicode.org/" >Unicode</A>). Latin1 is HTML's <A
HREF="http://www.uni-passau.de/~ramsch/iso8859-1.html" >base
charset</A> but HTML has now been globalized through <A
HREF="ftp://ftp.isi.edu/in-notes/rfc2070.txt" >RFC 2070</A>. You can
browse the <A
HREF="http://www.ics.uci.edu/pub/ietf/html/multilingual.html" >charset
smorgasbord</A> or the impressive <A
HREF="http://www.unicode.org/unicode/iuc10/x-utf8.html" >IUC10
poster</A> to test your browser or let Andy Flavell tell you more
about the <A
HREF="http://ppewww.ph.gla.ac.uk/~flavell/iso8859/iso8859-pointers.html"
>practical problems</A>.
<DL>
<DT><A NAME="DEC-MCS">DEC-MCS</A><DD>
<P><A HREF="#ISO-8859-1" >ISO-8859-1</A> was derived from the <A
HREF="http://www.digital.com/" >DEC</A> Multinational Character Set
used on the standard DEC VT-220 terminals:
<P><IMG SRC="dec-mcs.gif"><BR> charset=DEC-MCS
[<A HREF="dec-mcs.txt.gz">TXT</A>]
[<A HREF="dec-mcs.bdf.gz">BDF</A>]
<P>
<DT><A NAME="CP1252">CP1252 (WinLatin1)</A><DD>
<P>You often see Microsoft Windows users (check out my <A
HREF="codepages.html" >code page survey</A>) announcing their texts as
being in <A HREF="#ISO-8859-1" >ISO-8859-1</A> even when in fact they
contain funny characters from the CP1252 superset (and they may become
more since Microsoft has also added the Euro to their code pages), so
here you have a Unix font for them:
<P><IMG SRC="cp1252.gif"><BR> charset=Windows-1252
[<A HREF="cp1252.txt.gz">TXT</A>]
[<A HREF="cp1252.bdf.gz">BDF</A>]
</DL>
<H2><A NAME="ISO-8859-2">ISO-8859-2 (Latin2)</A></H2>
<IMG SRC="iso8859-2.gif"><BR> charset=ISO-8859-2
[<A HREF="iso8859-2.txt.gz">TXT</A>]
[<A HREF="iso8859-2.bdf.gz">BDF</A>]
<P><A HREF="http://sizif.mf.uni-lj.si/linux/cee/iso8859-2.html"
>Latin2</A> covers the languages of <B>Central and Eastern Europe</B>:
Czech (cs), Hungarian (hu), <A
HREF="http://www.agh.edu.pl/ogonki/plchars.html" >Polish</A> (pl),
Romanian (ro), Croatian (hr), Slovak (sk), Slovenian (sl), Sorbian.
For Romanian the S and T had better use commas instead of cedilla as
in Turkish: the U+015F LATIN SMALL LETTER S WITH CEDILLA at =BA ought
to be read as U+0219 LATIN SMALL LETTER S WITH COMMA BELOW etc.
<P>The German umlauts äöüß are found at exactly the same positions in
Latin1, Latin2, Latin3, Latin4, Latin5, Latin6. Thus you can write
German+Polish with Latin2 or German+Turkish with Latin5 but there is
no 8bit charset to properly mix German+Russian, for instance.
<H2><A NAME="ISO-8859-3">ISO-8859-3 (Latin3)</A></H2>
<IMG SRC="iso8859-3.gif"><BR> charset=ISO-8859-3
[<A HREF="iso8859-3.txt.gz">TXT</A>]
[<A HREF="iso8859-3.bdf.gz">BDF</A>]
<P> Latin3 is popular with authors of <A
HREF="http://www.esperanto.net/veb/faq.html" >Esperanto</A> (eo) and
Maltese (mt), and it covered Turkish before the introduction of <A
HREF="#ISO-8859-9" >Latin5</A> in 1988.
<H2><A NAME="ISO-8859-4">ISO-8859-4 (Latin4)</A></H2>
<IMG SRC="iso8859-4.gif"><BR> charset=ISO-8859-4
[<A HREF="iso8859-4.txt.gz">TXT</A>]
[<A HREF="iso8859-4.bdf.gz">BDF</A>]
<P> Latin4 introduced letters for Estonian (et), the <A
HREF="http://winnie.math.tu-berlin.de/cgi-bin/schlicke/webster?baltic"
>Baltic</A> languages Latvian (lv, Lettish) and <A
HREF="http://www.angelfire.com/me/rch/ll4.html" >Lithuanian</A> (lt),
Greenlandic (kl) and Lappish. Note that Latvian requires the cedilla
on the =BB U+0123 LATIN SMALL LETTER G WITH CEDILLA to jump on top.
Latin4 was followed by <A HREF="#ISO-8859-10" >Latin6</A>.
<H2><A NAME="ISO-8859-5">ISO-8859-5 (Cyrillic)</A></H2>
<IMG SRC="iso8859-5.gif"><BR> charset=ISO-8859-5
[<A HREF="iso8859-5.txt.gz">TXT</A>]
[<A HREF="iso8859-5.bdf.gz">BDF</A>]
<P> With these <B>Cyrillic</B> letters you can type Bulgarian (bg),
Byelorussian (be), Macedonian (mk), Russian (ru), Serbian (sr) and
pre-1990 (no <A HREF="cyrillic.html#KOI8-U" >ghe with upturn</A>)
Ukrainian (uk). The ordering is based on the (incompatibly) revised
GOST 19768 of 1987 with the Russian letters except for ë sorted by
Russian alphabet (ABVGDE).
<P> Note that <A HREF="cyrillic.html#KOI8-R" >several other Cyrillic
charsets</A> are used on the net. Have a look at my neighboring <A
HREF="cyrillic.html" >Cyrillic charsets page</A>.
<H2><A NAME="ISO-8859-6">ISO-8859-6 (Arabic)</A></H2>
<IMG SRC="iso8859-6.gif"><BR> charset=ISO-8859-6
[<A HREF="iso8859-6.txt.gz">TXT</A>]
[<A HREF="iso8859-6.bdf.gz">BDF</A>]
<P> This is the <B>Arabic</B> alphabet, unfortunately the basic
alphabet for the Arabic (ar) language only and not containing the four
extra letters for Persian (fa) nor the eight extra letters for
Pakistani Urdu (ur). This fixed font is not well-suited for text
display. Each Arabic letter occurs in up to four (2²) presentation
forms: initial, medial, final or separate. To make Arabic text
legible you'll need a display engine that analyses the context and
combines the appropriate glyphs on top of a handler for the reverse
writing direction shared with <A HREF="#ISO-8859-8" >Hebrew</A>. The
rendering algorithm is described in the <A
HREF="http://www.unicode.org/unicode/uni2book/uc20toc.html" >Unicode
book</A> and I have implemented it in my <A HREF="/arabjoin/"
>arabjoin</A> perl script.
<H2><A NAME="ISO-8859-7">ISO-8859-7 (Greek)</A></H2>
<IMG SRC="iso8859-7.gif"><BR> charset=ISO-8859-7
[<A HREF="iso8859-7.txt.gz">TXT</A>]
[<A HREF="iso8859-7.bdf.gz">BDF</A>]
<P>This is (modern monotonic) <B>Greek</B> (el) to me. ISO-8859-7 was
formerly known as <A HREF="http://www.elot.gr/" >ELOT</A>-928 or
ECMA-118:1986.
<H2><A NAME="ISO-8859-8">ISO-8859-8 (Hebrew)</A></H2>
<IMG SRC="iso8859-8.gif"><BR> charset=ISO-8859-8
[<A HREF="iso8859-8.txt.gz">TXT</A>]
[<A HREF="iso8859-8.bdf.gz">BDF</A>]
<P> And this is the <A HREF="ftp://ftp.isi.edu/in-notes/rfc1555.txt"
>Hebrew</A> script used by <B>Hebrew</B> (iw) and Yiddish (ji). Like
<A HREF="#ISO-8859-6" >Arabic</A> it is <A
HREF="ftp://ftp.isi.edu/in-notes/rfc1556.txt" >written leftwards</A>,
so get your dusty old bidirectional typewriters out of the closet! We
are promised to see a Bidirectional Algorithm Reference Implementation
published as <A
HREF="http://www.unicode.org/unicode/reports/techreports.html"
>Unicode Technical Report</A> #9 in the near future.
<H2><A NAME="ISO-8859-9">ISO-8859-9 (Latin5)</A></H2>
<IMG SRC="iso8859-9.gif"><BR> charset=ISO-8859-9
[<A HREF="iso8859-9.txt.gz">TXT</A>]
[<A HREF="iso8859-9.bdf.gz">BDF</A>]
<P>Latin5 replaces the rarely needed Icelandic letters ðýþ in <A
HREF="#ISO-8859-1" >Latin1</A> with the <B>Turkish</B> ones.
<H2><A NAME="ISO-8859-10">ISO-8859-10 (Latin6)</A></H2>
<IMG SRC="iso8859-10.gif"><BR> charset=ISO-8859-10
[<A HREF="iso8859-10.txt.gz">TXT</A>]
[<A HREF="iso8859-10.bdf.gz">BDF</A>]
<P>Introduced in 1992, Latin6 rearranged the <A HREF="#ISO-8859-4"
>Latin4</A> characters, dropped some symbols and the Latvian ŗ,
added the last missing Inuit (Greenlandic Eskimo) and non-Skolt Sami
(Lappish) letters and reintroduced the Icelandic ðýþ to cover the
entire <B>Nordic</B> area. Skolt Sami still needs a few more accents.
Note that RFC 1345 and <A
HREF="http://wwwwbs.cs.tu-berlin.de/cgi/gnuinfo/recode/" >GNU
recode</A> contain errors and use a preliminary and different latin6.
<P><HR>
<H2><A NAME="Future">Future Perspective</A></H2>
<P>From information to be found on <A HREF="http://www.indigo.ie/egt/"
>Michael Everson's website</A> and the official <A
HREF="http://www.dkuug.dk/jtc1/sc2/wg3/" >WG 3 website</A> I gathered
that in the near future we shall get to see new parts to ISO-8859
which may look like these:
<H2><A NAME="ISO-8859-11">ISO-8859-11 (Thai)</A></H2>
<IMG SRC="iso8859-11.gif"><BR>charset=ISO-8859-11
[<A HREF="iso8859-11.txt.gz">TXT</A>]
[<A HREF="iso8859-11.bdf.gz">BDF</A>]
<P>The Thai TIS620 is likely to be published as ISO-8859-11 Latin/Thai
(th). It contains some combining vowel and tone marks that have to be
written above or below the consonants.
<H2><A NAME="ISO-8859-12">ISO-8859-12</A></H2>
<P>There is currently no draft numbered ISO-8859-12. This number
might be reserved for ISCII Indian.
<P>It is unlikely that there will ever be a Vietnamese part.
Vietnamese (vi) seems to be the language using the most accentuated
letters of all languages using the Latin script. Some letters carry a
combination of two different accents. They are so many that they
simply don't fit into the model of ISO-8859. You can use <A
HREF="vietnamese.html" >VISCII</A> instead.
<H2><A NAME="ISO-8859-13">ISO-8859-13 (Latin7)</A></H2>
<IMG SRC="iso8859-13.gif"><BR>charset=ISO-8859-13
[<A HREF="iso8859-13.txt.gz">TXT</A>]
[<A HREF="iso8859-13.bdf.gz">BDF</A>]
<P>Latin7 is going to cover the <B>Baltic Rim</B> and re-establish the
Latvian (lv) support lost in Latin6 and may introduce the local
quotation marks. It resembles <A HREF="codepages.html#CP1257"
>WinBaltic</A>.
<H2><A NAME="ISO-8859-14">ISO-8859-14 (Latin8)</A></H2>
<IMG SRC="iso8859-14.gif"><BR>charset=ISO-8859-14
[<A HREF="iso8859-14.txt.gz">TXT</A>]
[<A HREF="iso8859-14.bdf.gz">BDF</A>]
<P>Latin8 adds the last Gaelic and Welsh (cy) letters to Latin1 to
cover all <B>Celtic</B> languages.
<H2><A NAME="ISO-8859-15">ISO-8859-15 (Latin9)</A></H2>
<IMG SRC="iso8859-15.gif"><BR>charset=ISO-8859-15
[<A HREF="iso8859-15.txt.gz">TXT</A>]
[<A HREF="iso8859-15.bdf.gz">BDF</A>]
<P>The new <A HREF="http://www.hut.fi/u/jkorpela/latin9.html"
>Latin9</A> nicknamed <B>Latin0</B> aims to update <A
HREF="#ISO-8859-1" >Latin1</A> by replacing the less needed symbols
¦¨´¸¼½¾ with forgotten French and Finnish letters and placing the
U+20AC Euro sign in the cell =A4 of the former international currency
sign ¤.
<P>On 1998-06-28 I suggested to heed the lesson learned and base
Latin9 on <A HREF="#ISO-8859-9" >ISO-8859-9</A> instead of Latin1
because there is a much greater use for Turkish than for Icelandic but
apparently that proposal did not sway the <A
HREF="http://www.dkuug.dk/jtc1/sc2/wg3/" >WG3 standardizers</A>.
<P><BLOCKQUOTE>
<HR>From: misha.wolf@reuters.com
<BR>Date: 22 Jun 1998
<BR>To: unicode@unicode.org
<BR>Subject: Re: Outlook & the Euro
<P>> <I>ISO 8859-15 will probably be implemented by a number of
vendors, but it will take some time until a large percentage of the
users start using those versions. Until then, it might be wise *not*
to make 8859-15 the default when sending mail.</I>
<P>We have just the place for ISO 8859-15 here in London. It is
called the Science Museum and is full of charming historical relics,
like Babagge's difference engine, used by Ada Lovelace (I think that
was her family name).
<P>What a relief that we now have Unicode and won't have to implement
this amusing piece of history.<HR></BLOCKQUOTE>
<P>But with <A HREF="/yudit/" >good Unicode support</A>, adding yet
another charset is a <A HREF="Makefile.html" >piece of cake</A>. And
the Euro will be needed on systems limited to 8bit. ISO-8859-15 fonts
and keysyms have already been included in X11 <A
HREF="ftp://ftp.x.org/pub/R6.4/fixes/fix-02" >R6.4 fix #02</A>.
<HR>
<H2>Blurb</H2>
I started this page as http://www.cs.tu-berlin.de/~czyborra/charsets/
on February 27, 1995, in reaction to a request for ISO-8859 code
charts on <A HREF="news:comp.std.internat" >comp.std.internat</A>.
Until then, there had only been lousy scans of the ISO charts floating
around on the net besides textual tables. I could easily throw this
together since I had already gathered all the necessary X11 fonts from
<A HREF="http://www.etl.go.jp/~mule/">MULE</A>'s, <A
HREF="ftp://ccsun.tuke.sk/pub/network-administration/netadmin.jpg"
>Barry Bouwsma</A>'s and <A HREF="http://www.kostis.net/" >Kosta
Kostis</A>' collections. Since then has the charsets page had more
than <IMG
SRC="http://wwwwbs.cs.tu-berlin.de/cgi/count/~czyborra/charsets/"
ALT="[a bitmapped number]"> accesses, got copied, included in books,
CD-ROMs, and even <A
HREF="http://www.isoc.org:8080/codage/iso8859/jeuxiso.fr.htm"
>translated into French</A>.
Because of network turbulences at cs.tu-berlin.de that shook the
referer database I can only offer you an old list of <A
HREF="http://wwwwbs.cs.tu-berlin.de/cgi/links/~czyborra/charsets/$"
>who referred</A> to the charsets page.
<P>Thanks go to Sven-Ove Westberg, Alexandre Khalil, Andreas Prilop,
<A HREF="http://meyer.fys.ku.dk/~sparre/" >Jacob Andersen</A>, Stavros
Macrakis, Doug Newell, Chrystopher Nehaniv, Alan Watson, Aaron Irvine,
Jonathan Rosenne, Christine Kluka, Clint Adams, Arnold Krivoruk, Van
Le, <A HREF="http://vzdmzi.zdv.uni-mainz.de/~knappen/" >Jörg
Knappen</A>, Thomas Henlich, Chris Maden, Paul Keinänen, <A
HREF="http://www.rhein-neckar.de/~mips/" NAME="naddy" >Christian
Weisgerber</A>, Kent Karlsson, <A
HREF="http://www.cl.cam.ac.uk/~mgk25/" >Markus Kuhn</A>, <A
HREF="http://www1.highway.com.py/~pinoz/" >Pino Zollo</A>, Imants
Metra, <A HREF="http://www.hut.fi/u/jkorpela/" >Jukka Korpela</A>, and
Paul Hill who provided valuable hints for corrections to this page.
You are welcome to mail your criticism to <A
HREF="mailto:roman@czyborra.com?subject=iso8859.html"
>roman@czyborra.com</A>.
<P><I><A HREF="http://czyborra.com/">Roman Czyborra</A></I>
<BR>$Date: 1998/12/01 12:39:22 $
</BODY></HTML>