Character sets

All code points are in hexadecimal.

NameStartChar

NameStartChar
   : 'A'..'Z' | 'a'..'z'
   | '\u00C0'..'\u00D6'
   | '\u00D8'..'\u00F6'
   | '\u00F8'..'\u02FF'
   | '\u0370'..'\u037D'
   | '\u037F'..'\u1FFF'
   | '\u200C'..'\u200D'
   | '\u2070'..'\u218F'
   | '\u2C00'..'\u2FEF'
   | '\u3001'..'\uD7FF'
   | '\uF900'..'\uFDCF'
   | '\uFDF0'..'\uFFFD'
   ;
00C0 …​ 00D6

Latin-1 Supplement (0080 - 00FF)

not valid

00D7 Multiplication Sign

00D8 …​ 00F6

Latin-1 Supplement (0080 - 00FF)

not valid

00F7 Division Sign

00F8 …​ 02FF
0370 …​ 037D
  • Greek and Coptic (0370 - 03FF)

not valid

037E Greek Question Mark

037F …​ 1FFF
  • …​ + 61 other languages and scripts

200C …​ 200D
2070 …​ 218F
2C00 …​ 2FEF
3001 …​ D7FF
  • …​ +45 languages and scripts

F900 …​ FDCF
  • …​ +10 languages and scripts

not valid

FDD0 …​ FDEF "not a character" code points

FDF0 …​ FFFD

NameChar

NameChar::
   : NameStartChar
   | DIGIT
   | '_'
   | '-'
   | '\u00B7'
   | '\u0300'..'\u036F'
   | '\u203F'..'\u2040'
   ;
00B7

Middle Dot

0300 …​ 036F

Combining Diacritical Marks

203F …​ 2040

Undertie …​ Character Tie

DIGIT

DIGIT: [0-9];