This site is a static rendering of the Trac instance that was used by R7RS-WG1 for its work on R7RS-small (PDF), which was ratified in 2013. For more information, see Home.

Source for wiki CharacterSets version 1

author

alexshinn

comment


    

ipnr

118.21.204.67

name

CharacterSets

readonly

0

text

R5RS specifies only the characters necessary to handle parsing Scheme
source code itself, leaving almost everything else unspecified.  The
character ordering only guarantees that the characters a..z, A..Z and
0..9 are in order, and the actual integer values are left unspecified
(although in practice almost all implementations use ASCII).

Unicode is fast becoming the most widely used character set, and full
Unicode compliance was specified in the R6RS.  However, Unicode is
expensive to support on small and embedded systems, is slower and more
space intensive than most native character sets, and there are
competing unified character sets which should not be dismissed easily.
Given this, while one would expect a general-purpose featureful Scheme
implementation to support Unicode, it is not immediately clear that we
should _require_ Unicode support unconditionally in the WG1 standard.

Given this, we need to determine what and how much we specify about
characters and strings, including their integer values, orderings and
case mappings.

Current Proposals:

  * UnicodeCowan

time

2010-03-05 13:34:12

version

1