Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compound tone diacritics iii #956

Draft
wants to merge 7 commits into
base: main
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
15 changes: 13 additions & 2 deletions unicodetools/data/ucd/dev/DerivedAge.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedAge-16.0.0.txt
# Date: 2024-04-30, 21:48:12 GMT
# DerivedAge-17.0.0.txt

Check warning on line 1 in unicodetools/data/ucd/dev/DerivedAge.txt

View workflow job for this annotation

GitHub Actions / Draft unless approved

Not in the 17.0 pipeline

These characters are neither accepted for Unicode 17.0, nor for any specific version of Unicode, nor are they provisionally assigned. The Age property values for new characters are likely incorrect right now. They will be recomputed after the UTC accepts their encoding and this pull request is updated for the target version.
# Date: 2024-10-21, 19:29:58 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -2059,4 +2059,15 @@

# Total code points: 5185

# ================================================

# Age=V17_0

# Newly assigned in Unicode 17.0.0 (September, 2025)

1ADE..1ADF ; 17.0 # [2] COMBINING GRAVE-DOT..COMBINING DOT-ACUTE
1AEC..1AF0 ; 17.0 # [5] COMBINING CARON-ACUTE..COMBINING DOUBLE COMMA ABOVE

# Total code points: 7

# EOF
24 changes: 17 additions & 7 deletions unicodetools/data/ucd/dev/DerivedCoreProperties.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedCoreProperties-16.0.0.txt
# Date: 2024-05-31, 18:09:32 GMT
# DerivedCoreProperties-17.0.0.txt
# Date: 2024-10-21, 19:30:46 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -3195,6 +3195,8 @@ FF41..FF5A ; Cased # L& [26] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH LATIN
1AB0..1ABD ; Case_Ignorable # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
1ABE ; Case_Ignorable # Me COMBINING PARENTHESES OVERLAY
1ABF..1ACE ; Case_Ignorable # Mn [16] COMBINING LATIN SMALL LETTER W BELOW..COMBINING LATIN SMALL LETTER INSULAR T
1ADE..1ADF ; Case_Ignorable # Mn [2] COMBINING GRAVE-DOT..COMBINING DOT-ACUTE
1AEC..1AF0 ; Case_Ignorable # Mn [5] COMBINING CARON-ACUTE..COMBINING DOUBLE COMMA ABOVE
1B00..1B03 ; Case_Ignorable # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
1B34 ; Case_Ignorable # Mn BALINESE SIGN REREKAN
1B36..1B3A ; Case_Ignorable # Mn [5] BALINESE VOWEL SIGN ULU..BALINESE VOWEL SIGN RA REPA
Expand Down Expand Up @@ -3505,7 +3507,7 @@ E0001 ; Case_Ignorable # Cf LANGUAGE TAG
E0020..E007F ; Case_Ignorable # Cf [96] TAG SPACE..CANCEL TAG
E0100..E01EF ; Case_Ignorable # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2749
# Total code points: 2756

# ================================================

Expand Down Expand Up @@ -7458,6 +7460,8 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
1AA7 ; ID_Continue # Lm TAI THAM SIGN MAI YAMOK
1AB0..1ABD ; ID_Continue # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
1ABF..1ACE ; ID_Continue # Mn [16] COMBINING LATIN SMALL LETTER W BELOW..COMBINING LATIN SMALL LETTER INSULAR T
1ADE..1ADF ; ID_Continue # Mn [2] COMBINING GRAVE-DOT..COMBINING DOT-ACUTE
1AEC..1AF0 ; ID_Continue # Mn [5] COMBINING CARON-ACUTE..COMBINING DOUBLE COMMA ABOVE
1B00..1B03 ; ID_Continue # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
1B04 ; ID_Continue # Mc BALINESE SIGN BISAH
1B05..1B33 ; ID_Continue # Lo [47] BALINESE LETTER AKARA..BALINESE LETTER HA
Expand Down Expand Up @@ -8370,7 +8374,7 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
31350..323AF ; ID_Continue # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
E0100..E01EF ; ID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 144541
# Total code points: 144548

# ================================================

Expand Down Expand Up @@ -9640,6 +9644,8 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
1AA7 ; XID_Continue # Lm TAI THAM SIGN MAI YAMOK
1AB0..1ABD ; XID_Continue # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
1ABF..1ACE ; XID_Continue # Mn [16] COMBINING LATIN SMALL LETTER W BELOW..COMBINING LATIN SMALL LETTER INSULAR T
1ADE..1ADF ; XID_Continue # Mn [2] COMBINING GRAVE-DOT..COMBINING DOT-ACUTE
1AEC..1AF0 ; XID_Continue # Mn [5] COMBINING CARON-ACUTE..COMBINING DOUBLE COMMA ABOVE
1B00..1B03 ; XID_Continue # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
1B04 ; XID_Continue # Mc BALINESE SIGN BISAH
1B05..1B33 ; XID_Continue # Lo [47] BALINESE LETTER AKARA..BALINESE LETTER HA
Expand Down Expand Up @@ -10557,7 +10563,7 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
31350..323AF ; XID_Continue # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
E0100..E01EF ; XID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 144522
# Total code points: 144529

# ================================================

Expand Down Expand Up @@ -10779,6 +10785,8 @@ E01F0..E0FFF ; Default_Ignorable_Code_Point # Cn [3600] <reserved-E01F0>..<rese
1AB0..1ABD ; Grapheme_Extend # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
1ABE ; Grapheme_Extend # Me COMBINING PARENTHESES OVERLAY
1ABF..1ACE ; Grapheme_Extend # Mn [16] COMBINING LATIN SMALL LETTER W BELOW..COMBINING LATIN SMALL LETTER INSULAR T
1ADE..1ADF ; Grapheme_Extend # Mn [2] COMBINING GRAVE-DOT..COMBINING DOT-ACUTE
1AEC..1AF0 ; Grapheme_Extend # Mn [5] COMBINING CARON-ACUTE..COMBINING DOUBLE COMMA ABOVE
1B00..1B03 ; Grapheme_Extend # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
1B34 ; Grapheme_Extend # Mn BALINESE SIGN REREKAN
1B35 ; Grapheme_Extend # Mc BALINESE VOWEL SIGN TEDUNG
Expand Down Expand Up @@ -11029,7 +11037,7 @@ FF9E..FF9F ; Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK.
E0020..E007F ; Grapheme_Extend # Cf [96] TAG SPACE..CANCEL TAG
E0100..E01EF ; Grapheme_Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2193
# Total code points: 2200

# ================================================

Expand Down Expand Up @@ -13106,6 +13114,8 @@ ABED ; Grapheme_Link # Mn MEETEI MAYEK APUN IYEK
1AB0..1ABD ; InCB; Extend # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
1ABE ; InCB; Extend # Me COMBINING PARENTHESES OVERLAY
1ABF..1ACE ; InCB; Extend # Mn [16] COMBINING LATIN SMALL LETTER W BELOW..COMBINING LATIN SMALL LETTER INSULAR T
1ADE..1ADF ; InCB; Extend # Mn [2] COMBINING GRAVE-DOT..COMBINING DOT-ACUTE
1AEC..1AF0 ; InCB; Extend # Mn [5] COMBINING CARON-ACUTE..COMBINING DOUBLE COMMA ABOVE
1B00..1B03 ; InCB; Extend # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
1B34 ; InCB; Extend # Mn BALINESE SIGN REREKAN
1B35 ; InCB; Extend # Mc BALINESE VOWEL SIGN TEDUNG
Expand Down Expand Up @@ -13357,6 +13367,6 @@ FF9E..FF9F ; InCB; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HA
E0020..E007F ; InCB; Extend # Cf [96] TAG SPACE..CANCEL TAG
E0100..E01EF ; InCB; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2192
# Total code points: 2199

# EOF
6 changes: 4 additions & 2 deletions unicodetools/data/ucd/dev/EastAsianWidth.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# EastAsianWidth-16.0.0.txt
# Date: 2024-04-30, 21:48:20 GMT
# EastAsianWidth-17.0.0.txt
# Date: 2024-10-21, 19:30:58 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -807,6 +807,8 @@
1AB0..1ABD ; N # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
1ABE ; N # Me COMBINING PARENTHESES OVERLAY
1ABF..1ACE ; N # Mn [16] COMBINING LATIN SMALL LETTER W BELOW..COMBINING LATIN SMALL LETTER INSULAR T
1ADE..1ADF ; N # Mn [2] COMBINING GRAVE-DOT..COMBINING DOT-ACUTE
1AEC..1AF0 ; N # Mn [5] COMBINING CARON-ACUTE..COMBINING DOUBLE COMMA ABOVE
1B00..1B03 ; N # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
1B04 ; N # Mc BALINESE SIGN BISAH
1B05..1B33 ; N # Lo [47] BALINESE LETTER AKARA..BALINESE LETTER HA
Expand Down
6 changes: 4 additions & 2 deletions unicodetools/data/ucd/dev/LineBreak.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# LineBreak-16.0.0.txt
# Date: 2024-07-29, 16:26:55 GMT
# LineBreak-17.0.0.txt
# Date: 2024-10-21, 19:24:59 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -777,6 +777,8 @@
1AB0..1ABD ; CM # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
1ABE ; CM # Me COMBINING PARENTHESES OVERLAY
1ABF..1ACE ; CM # Mn [16] COMBINING LATIN SMALL LETTER W BELOW..COMBINING LATIN SMALL LETTER INSULAR T
1ADE..1ADF ; CM # Mn [2] COMBINING GRAVE-DOT..COMBINING DOT-ACUTE
1AEC..1AF0 ; CM # Mn [5] COMBINING CARON-ACUTE..COMBINING DOUBLE COMMA ABOVE
1B00..1B03 ; CM # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
1B04 ; CM # Mc BALINESE SIGN BISAH
1B05..1B33 ; AK # Lo [47] BALINESE LETTER AKARA..BALINESE LETTER HA
Expand Down
18 changes: 16 additions & 2 deletions unicodetools/data/ucd/dev/NormalizationTest.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# NormalizationTest-16.0.0.txt
# Date: 2024-04-30, 21:48:23 GMT
# NormalizationTest-17.0.0.txt
# Date: 2024-10-21, 19:31:14 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -18098,6 +18098,20 @@ FFEE;FFEE;FFEE;25CB;25CB; # (○; ○; ○; ○; ○; ) HALFWIDTH WHITE CIRCLE
0061 1ACD 0315 0300 05AE 0062;0061 05AE 1ACD 0300 0315 0062;0061 05AE 1ACD 0300 0315 0062;0061 05AE 1ACD 0300 0315 0062;0061 05AE 1ACD 0300 0315 0062; # (a◌ᫍ◌̕◌̀◌֮b; a◌֮◌ᫍ◌̀◌̕b; a◌֮◌ᫍ◌̀◌̕b; a◌֮◌ᫍ◌̀◌̕b; a◌֮◌ᫍ◌̀◌̕b; ) LATIN SMALL LETTER A, COMBINING LATIN SMALL LETTER INSULAR R, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, LATIN SMALL LETTER B
0061 0315 0300 05AE 1ACE 0062;00E0 05AE 1ACE 0315 0062;0061 05AE 0300 1ACE 0315 0062;00E0 05AE 1ACE 0315 0062;0061 05AE 0300 1ACE 0315 0062; # (a◌̕◌̀◌֮◌ᫎb; à◌֮◌ᫎ◌̕b; a◌֮◌̀◌ᫎ◌̕b; à◌֮◌ᫎ◌̕b; a◌֮◌̀◌ᫎ◌̕b; ) LATIN SMALL LETTER A, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, COMBINING LATIN SMALL LETTER INSULAR T, LATIN SMALL LETTER B
0061 1ACE 0315 0300 05AE 0062;0061 05AE 1ACE 0300 0315 0062;0061 05AE 1ACE 0300 0315 0062;0061 05AE 1ACE 0300 0315 0062;0061 05AE 1ACE 0300 0315 0062; # (a◌ᫎ◌̕◌̀◌֮b; a◌֮◌ᫎ◌̀◌̕b; a◌֮◌ᫎ◌̀◌̕b; a◌֮◌ᫎ◌̀◌̕b; a◌֮◌ᫎ◌̀◌̕b; ) LATIN SMALL LETTER A, COMBINING LATIN SMALL LETTER INSULAR T, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, LATIN SMALL LETTER B
0061 0315 0300 05AE 1ADE 0062;00E0 05AE 1ADE 0315 0062;0061 05AE 0300 1ADE 0315 0062;00E0 05AE 1ADE 0315 0062;0061 05AE 0300 1ADE 0315 0062; # (a◌̕◌̀◌֮◌᫞b; à◌֮◌᫞◌̕b; a◌֮◌̀◌᫞◌̕b; à◌֮◌᫞◌̕b; a◌֮◌̀◌᫞◌̕b; ) LATIN SMALL LETTER A, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, COMBINING GRAVE-DOT, LATIN SMALL LETTER B
0061 1ADE 0315 0300 05AE 0062;0061 05AE 1ADE 0300 0315 0062;0061 05AE 1ADE 0300 0315 0062;0061 05AE 1ADE 0300 0315 0062;0061 05AE 1ADE 0300 0315 0062; # (a◌᫞◌̕◌̀◌֮b; a◌֮◌᫞◌̀◌̕b; a◌֮◌᫞◌̀◌̕b; a◌֮◌᫞◌̀◌̕b; a◌֮◌᫞◌̀◌̕b; ) LATIN SMALL LETTER A, COMBINING GRAVE-DOT, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, LATIN SMALL LETTER B
0061 0315 0300 05AE 1ADF 0062;00E0 05AE 1ADF 0315 0062;0061 05AE 0300 1ADF 0315 0062;00E0 05AE 1ADF 0315 0062;0061 05AE 0300 1ADF 0315 0062; # (a◌̕◌̀◌֮◌᫟b; à◌֮◌᫟◌̕b; a◌֮◌̀◌᫟◌̕b; à◌֮◌᫟◌̕b; a◌֮◌̀◌᫟◌̕b; ) LATIN SMALL LETTER A, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, COMBINING DOT-ACUTE, LATIN SMALL LETTER B
0061 1ADF 0315 0300 05AE 0062;0061 05AE 1ADF 0300 0315 0062;0061 05AE 1ADF 0300 0315 0062;0061 05AE 1ADF 0300 0315 0062;0061 05AE 1ADF 0300 0315 0062; # (a◌᫟◌̕◌̀◌֮b; a◌֮◌᫟◌̀◌̕b; a◌֮◌᫟◌̀◌̕b; a◌֮◌᫟◌̀◌̕b; a◌֮◌᫟◌̀◌̕b; ) LATIN SMALL LETTER A, COMBINING DOT-ACUTE, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, LATIN SMALL LETTER B
0061 0315 0300 05AE 1AEC 0062;00E0 05AE 1AEC 0315 0062;0061 05AE 0300 1AEC 0315 0062;00E0 05AE 1AEC 0315 0062;0061 05AE 0300 1AEC 0315 0062; # (a◌̕◌̀◌֮◌᫬b; à◌֮◌᫬◌̕b; a◌֮◌̀◌᫬◌̕b; à◌֮◌᫬◌̕b; a◌֮◌̀◌᫬◌̕b; ) LATIN SMALL LETTER A, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, COMBINING CARON-ACUTE, LATIN SMALL LETTER B
0061 1AEC 0315 0300 05AE 0062;0061 05AE 1AEC 0300 0315 0062;0061 05AE 1AEC 0300 0315 0062;0061 05AE 1AEC 0300 0315 0062;0061 05AE 1AEC 0300 0315 0062; # (a◌᫬◌̕◌̀◌֮b; a◌֮◌᫬◌̀◌̕b; a◌֮◌᫬◌̀◌̕b; a◌֮◌᫬◌̀◌̕b; a◌֮◌᫬◌̀◌̕b; ) LATIN SMALL LETTER A, COMBINING CARON-ACUTE, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, LATIN SMALL LETTER B
0061 0315 0300 05AE 1AED 0062;00E0 05AE 1AED 0315 0062;0061 05AE 0300 1AED 0315 0062;00E0 05AE 1AED 0315 0062;0061 05AE 0300 1AED 0315 0062; # (a◌̕◌̀◌֮◌᫭b; à◌֮◌᫭◌̕b; a◌֮◌̀◌᫭◌̕b; à◌֮◌᫭◌̕b; a◌֮◌̀◌᫭◌̕b; ) LATIN SMALL LETTER A, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, COMBINING VERTICAL-LINE-DOUBLE-ACUTE, LATIN SMALL LETTER B
0061 1AED 0315 0300 05AE 0062;0061 05AE 1AED 0300 0315 0062;0061 05AE 1AED 0300 0315 0062;0061 05AE 1AED 0300 0315 0062;0061 05AE 1AED 0300 0315 0062; # (a◌᫭◌̕◌̀◌֮b; a◌֮◌᫭◌̀◌̕b; a◌֮◌᫭◌̀◌̕b; a◌֮◌᫭◌̀◌̕b; a◌֮◌᫭◌̀◌̕b; ) LATIN SMALL LETTER A, COMBINING VERTICAL-LINE-DOUBLE-ACUTE, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, LATIN SMALL LETTER B
0061 059A 0316 1DFA 1AEE 0062;0061 1DFA 0316 1AEE 059A 0062;0061 1DFA 0316 1AEE 059A 0062;0061 1DFA 0316 1AEE 059A 0062;0061 1DFA 0316 1AEE 059A 0062; # (a◌֚◌̖◌᷺◌᫮b; a◌᷺◌̖◌᫮◌֚b; a◌᷺◌̖◌᫮◌֚b; a◌᷺◌̖◌᫮◌֚b; a◌᷺◌̖◌᫮◌֚b; ) LATIN SMALL LETTER A, HEBREW ACCENT YETIV, COMBINING GRAVE ACCENT BELOW, COMBINING DOT BELOW LEFT, COMBINING DOUBLE GRAVE ACCENT BELOW, LATIN SMALL LETTER B
0061 1AEE 059A 0316 1DFA 0062;0061 1DFA 1AEE 0316 059A 0062;0061 1DFA 1AEE 0316 059A 0062;0061 1DFA 1AEE 0316 059A 0062;0061 1DFA 1AEE 0316 059A 0062; # (a◌᫮◌֚◌̖◌᷺b; a◌᷺◌᫮◌̖◌֚b; a◌᷺◌᫮◌̖◌֚b; a◌᷺◌᫮◌̖◌֚b; a◌᷺◌᫮◌̖◌֚b; ) LATIN SMALL LETTER A, COMBINING DOUBLE GRAVE ACCENT BELOW, HEBREW ACCENT YETIV, COMBINING GRAVE ACCENT BELOW, COMBINING DOT BELOW LEFT, LATIN SMALL LETTER B
0061 059A 0316 1DFA 1AEF 0062;0061 1DFA 0316 1AEF 059A 0062;0061 1DFA 0316 1AEF 059A 0062;0061 1DFA 0316 1AEF 059A 0062;0061 1DFA 0316 1AEF 059A 0062; # (a◌֚◌̖◌᷺◌᫯b; a◌᷺◌̖◌᫯◌֚b; a◌᷺◌̖◌᫯◌֚b; a◌᷺◌̖◌᫯◌֚b; a◌᷺◌̖◌᫯◌֚b; ) LATIN SMALL LETTER A, HEBREW ACCENT YETIV, COMBINING GRAVE ACCENT BELOW, COMBINING DOT BELOW LEFT, COMBINING DOUBLE ACUTE ACCENT BELOW, LATIN SMALL LETTER B
0061 1AEF 059A 0316 1DFA 0062;0061 1DFA 1AEF 0316 059A 0062;0061 1DFA 1AEF 0316 059A 0062;0061 1DFA 1AEF 0316 059A 0062;0061 1DFA 1AEF 0316 059A 0062; # (a◌᫯◌֚◌̖◌᷺b; a◌᷺◌᫯◌̖◌֚b; a◌᷺◌᫯◌̖◌֚b; a◌᷺◌᫯◌̖◌֚b; a◌᷺◌᫯◌̖◌֚b; ) LATIN SMALL LETTER A, COMBINING DOUBLE ACUTE ACCENT BELOW, HEBREW ACCENT YETIV, COMBINING GRAVE ACCENT BELOW, COMBINING DOT BELOW LEFT, LATIN SMALL LETTER B
0061 0315 0300 05AE 1AF0 0062;00E0 05AE 1AF0 0315 0062;0061 05AE 0300 1AF0 0315 0062;00E0 05AE 1AF0 0315 0062;0061 05AE 0300 1AF0 0315 0062; # (a◌̕◌̀◌֮◌᫰b; à◌֮◌᫰◌̕b; a◌֮◌̀◌᫰◌̕b; à◌֮◌᫰◌̕b; a◌֮◌̀◌᫰◌̕b; ) LATIN SMALL LETTER A, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, COMBINING DOUBLE COMMA ABOVE, LATIN SMALL LETTER B
0061 1AF0 0315 0300 05AE 0062;0061 05AE 1AF0 0300 0315 0062;0061 05AE 1AF0 0300 0315 0062;0061 05AE 1AF0 0300 0315 0062;0061 05AE 1AF0 0300 0315 0062; # (a◌᫰◌̕◌̀◌֮b; a◌֮◌᫰◌̀◌̕b; a◌֮◌᫰◌̀◌̕b; a◌֮◌᫰◌̀◌̕b; a◌֮◌᫰◌̀◌̕b; ) LATIN SMALL LETTER A, COMBINING DOUBLE COMMA ABOVE, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, LATIN SMALL LETTER B
0061 3099 093C 16FF0 1B34 0062;0061 16FF0 093C 1B34 3099 0062;0061 16FF0 093C 1B34 3099 0062;0061 16FF0 093C 1B34 3099 0062;0061 16FF0 093C 1B34 3099 0062; # (a◌゙◌𖿰़◌᬴b; a𖿰◌़◌᬴◌゙b; a𖿰◌़◌᬴◌゙b; a𖿰◌़◌᬴◌゙b; a𖿰◌़◌᬴◌゙b; ) LATIN SMALL LETTER A, COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK, DEVANAGARI SIGN NUKTA, VIETNAMESE ALTERNATE READING MARK CA, BALINESE SIGN REREKAN, LATIN SMALL LETTER B
0061 1B34 3099 093C 16FF0 0062;0061 16FF0 1B34 093C 3099 0062;0061 16FF0 1B34 093C 3099 0062;0061 16FF0 1B34 093C 3099 0062;0061 16FF0 1B34 093C 3099 0062; # (a◌᬴◌゙◌𖿰़b; a𖿰◌᬴◌़◌゙b; a𖿰◌᬴◌़◌゙b; a𖿰◌᬴◌़◌゙b; a𖿰◌᬴◌़◌゙b; ) LATIN SMALL LETTER A, BALINESE SIGN REREKAN, COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK, DEVANAGARI SIGN NUKTA, VIETNAMESE ALTERNATE READING MARK CA, LATIN SMALL LETTER B
0061 05B0 094D 3099 1B44 0062;0061 3099 094D 1B44 05B0 0062;0061 3099 094D 1B44 05B0 0062;0061 3099 094D 1B44 05B0 0062;0061 3099 094D 1B44 05B0 0062; # (a◌ְ◌्◌゙᭄b; a◌゙◌्᭄◌ְb; a◌゙◌्᭄◌ְb; a◌゙◌्᭄◌ְb; a◌゙◌्᭄◌ְb; ) LATIN SMALL LETTER A, HEBREW POINT SHEVA, DEVANAGARI SIGN VIRAMA, COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK, BALINESE ADEG ADEG, LATIN SMALL LETTER B
Expand Down
8 changes: 5 additions & 3 deletions unicodetools/data/ucd/dev/PropList.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# PropList-16.0.0.txt
# Date: 2024-05-31, 18:09:48 GMT
# PropList-17.0.0.txt
# Date: 2024-10-21, 19:52:00 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -990,6 +990,8 @@ FA70..FAD9 ; Ideographic # Lo [106] CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COM
1AB0..1ABD ; Diacritic # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
1ABE ; Diacritic # Me COMBINING PARENTHESES OVERLAY
1AC1..1ACB ; Diacritic # Mn [11] COMBINING LEFT PARENTHESIS ABOVE LEFT..COMBINING TRIPLE ACUTE ACCENT
1ADE..1ADF ; Diacritic # Mn [2] COMBINING GRAVE-DOT..COMBINING DOT-ACUTE
1AEC..1AF0 ; Diacritic # Mn [5] COMBINING CARON-ACUTE..COMBINING DOUBLE COMMA ABOVE
1B34 ; Diacritic # Mn BALINESE SIGN REREKAN
1B44 ; Diacritic # Mc BALINESE ADEG ADEG
1B6B..1B73 ; Diacritic # Mn [9] BALINESE MUSICAL SYMBOL COMBINING TEGEH..BALINESE MUSICAL SYMBOL COMBINING GONG
Expand Down Expand Up @@ -1150,7 +1152,7 @@ FFE3 ; Diacritic # Sk FULLWIDTH MACRON
1E944..1E946 ; Diacritic # Mn [3] ADLAM ALIF LENGTHENER..ADLAM GEMINATION MARK
1E948..1E94A ; Diacritic # Mn [3] ADLAM CONSONANT MODIFIER..ADLAM NUKTA

# Total code points: 1178
# Total code points: 1185

# ================================================

Expand Down
Loading
Loading