summaryrefslogtreecommitdiff
path: root/sys/src/cmd/tcs/ex5.utf
diff options
context:
space:
mode:
authorTaru Karttunen <taruti@taruti.net>2011-03-30 15:46:40 +0300
committerTaru Karttunen <taruti@taruti.net>2011-03-30 15:46:40 +0300
commite5888a1ffdae813d7575f5fb02275c6bb07e5199 (patch)
treed8d51eac403f07814b9e936eed0c9a79195e2450 /sys/src/cmd/tcs/ex5.utf
Import sources from 2011-03-30 iso image
Diffstat (limited to 'sys/src/cmd/tcs/ex5.utf')
-rwxr-xr-xsys/src/cmd/tcs/ex5.utf50
1 files changed, 50 insertions, 0 deletions
diff --git a/sys/src/cmd/tcs/ex5.utf b/sys/src/cmd/tcs/ex5.utf
new file mode 100755
index 000000000..b4baca0bc
--- /dev/null
+++ b/sys/src/cmd/tcs/ex5.utf
@@ -0,0 +1,50 @@
+.tr -\(hy
+.TL
+Hello World
+.br
+or
+.br
+Καλημέρα κόσμε
+.br
+or
+.br
+こんにちは 世界
+.AU
+Rob Pike
+Ken Thompson
+.AI
+.MH
+.AB
+Plan 9 from Bell Labs has recently been converted from ASCII
+to an ASCII-compatible variant of Unicode, a 16-bit character set.
+In this paper we explain the reasons for the change,
+describe the character set and representation we chose,
+and present the programming models and software changes
+that support the new text format.
+Although we stopped short of full internationalization\(emfor
+example, system error messages are in Unixese, not Japanese\(emwe
+believe Plan 9 is the first system to treat the representation
+of all major languages on a uniform, equal footing throughout all its
+software.
+.AE
+.SH
+Introduction
+.PP
+The world is multilingual but most computer systems
+are based on English and ASCII or worse.
+The pending release of Plan 9 [Pike90], a new distributed operating
+system from Bell Laboratories, seemed a good occasion
+to correct this chauvinism.
+It is easier to make such deep changes when building new systems than
+by retrofitting old ones.
+.PP
+The ANSI C standard [ANSIC] contains some guidance on the matter of
+`wide' and `multi-byte' characters but falls far short of
+solving the myriad associated problems.
+We could find no literature on how to convert a
+.I system
+to larger character sets, although some individual
+.I programs
+have been converted.
+This paper reports what we discovered as we
+explored the problem of representing multilingual