Stream type for Unicode Character Database (UCD) files
Unicode data files are semicolon ;
(U+003B) separated fields, except for
Unihan files and a few others that are tab (U+0009) separated. White spaces
around field values are not significant. Line comments are prefixed with a
number sign #
(U+0023).
Instances For
Equations
- Unicode.instInhabitedUCDStream = { default := { toSubstring := default, isUnihan := default } }
Make a UCDStream
from a substring
Equations
- Unicode.UCDStream.ofSubstring str = { toSubstring := str, isUnihan := false }
Instances For
Make a UCDStream
from a string
Equations
- Unicode.UCDStream.ofString str = { str := str, startPos := 0, stopPos := str.endPos, isUnihan := false }