Support unicode block names.
Support for unicode character classes.
Support UTF-8 character lists and strings in syntax files.