cpython/Tools/unicode
Serhiy Storchaka bab1d7a561
gh-74902: Add Unicode Grapheme Cluster Break algorithm (GH-143076)
Add the unicodedata.iter_graphemes() function to iterate over grapheme
clusters according to rules defined in Unicode Standard Annex #29.

Add unicodedata.grapheme_cluster_break(), unicodedata.indic_conjunct_break()
and unicodedata.extended_pictographic() functions to get the properties
of the character which are related to the above algorithm.

Co-authored-by: Guillaume "Vermeille" Sanchez <guillaume.v.sanchez@gmail.com>
2026-01-14 14:37:57 +00:00
..
python-mappings Revert "gh-84508: Add mapping files for Korean and Japanese. (gh-93309)" (#93320) 2022-05-29 09:49:19 +09:00
comparecodecs.py
dawg.py gh-96954: use a directed acyclic word graph for storing the unicodedata codepoint names (#97906) 2023-11-04 15:56:58 +01:00
gencjkcodecs.py
gencodec.py
genmap_japanese.py Code: Update Donghee Na's name (#109744) 2023-09-25 18:17:34 +03:00
genmap_korean.py Code: Update Donghee Na's name (#109744) 2023-09-25 18:17:34 +03:00
genmap_schinese.py Code: Update Donghee Na's name (#109744) 2023-09-25 18:17:34 +03:00
genmap_support.py Code: Update Donghee Na's name (#109744) 2023-09-25 18:17:34 +03:00
genmap_tchinese.py gh-84508: tool to generate cjk traditional chinese mappings (gh-93272) 2022-06-11 23:19:41 +09:00
genwincodec.py
genwincodecs.bat
listcodecs.py
Makefile
makeunicodedata.py gh-74902: Add Unicode Grapheme Cluster Break algorithm (GH-143076) 2026-01-14 14:37:57 +00:00
mkstringprep.py bpo-37758: Cut always-constant conditionals on sys.maxunicode. (GH-15302) 2019-09-09 08:20:40 -07:00