Module Cmarkit_data

Data needed for CommonMark parsing.

Unicode data

val unicode_version : string

unicode_version is the supported Unicode version.

val is_unicode_whitespace : Stdlib.Uchar.t -> bool

is_unicode_whitespace u is true iff u is a CommonMark Unicode whitespace character.

val is_unicode_punctuation : Stdlib.Uchar.t -> bool

is_unicode_punctuation u is true iff u is a CommonMark Unicode punctuation character.

val unicode_case_fold : Stdlib.Uchar.t -> string option

unicode_case_fold u is the UTF-8 encoding of u's Unicode case fold or None if u case folds to itself.

HTML data

val html_entity : string -> string option

html_entity e is the UTF-8 data for of the HTML entity name (without & and ;) e.