cpython/Tools/unicode
Serhiy Storchaka bab1d7a561
gh-74902: Add Unicode Grapheme Cluster Break algorithm (GH-143076)
Add the unicodedata.iter_graphemes() function to iterate over grapheme
clusters according to rules defined in Unicode Standard Annex #29.

Add unicodedata.grapheme_cluster_break(), unicodedata.indic_conjunct_break()
and unicodedata.extended_pictographic() functions to get the properties
of the character which are related to the above algorithm.

Co-authored-by: Guillaume "Vermeille" Sanchez <guillaume.v.sanchez@gmail.com>
2026-01-14 14:37:57 +00:00
..
python-mappings
comparecodecs.py
dawg.py gh-96954: use a directed acyclic word graph for storing the unicodedata codepoint names (#97906) 2023-11-04 15:56:58 +01:00
gencjkcodecs.py
gencodec.py
genmap_japanese.py
genmap_korean.py
genmap_schinese.py
genmap_support.py
genmap_tchinese.py
genwincodec.py
genwincodecs.bat
listcodecs.py
Makefile
makeunicodedata.py gh-74902: Add Unicode Grapheme Cluster Break algorithm (GH-143076) 2026-01-14 14:37:57 +00:00
mkstringprep.py