Skip to content

Annotation of Easily Confused CJK Unified Ideographs

hfhchan edited this page Jul 3, 2017 · 4 revisions

Introduction

First submitted as IRGN2206 Proposal for Annotating Easily Confused Ideographs, there exist lots of easily confusable characters, especially in Extension B, that were rightfully disunified for having different semantics (IRG terminology: "non-cognate") per their original source.

Easily Confusables, i.e. characters with similar shape, in accordance to the principles set out in Annex S, which are disunified in CJK Unified Ideographs Extensions (i.e. not subject to the Source Separation Rule), may be annotated to reflect the difference in semantic meaning owing to its disunification. The semantic meaning may be applicable to all locales, or only the locales for which a representative glyph appears.

Non-exhaustive list

  • Group 1

    • 𡉚 U+2125A
      • archaic form of 5C01 封
      • confusable with 37B7 㞷
    • 㞷 U+37B7
      • phonetic of 224F8 𢓸, 22687 𢚇, 24775 𤝵, etc
      • confusable with 2125A 𡉚
  • Group 2

    • 𠱬 U+20C6C

      • archaic form of 5468 周
      • confusable with 2055B 𠕛
    • 𠕛 U+2055B

      • archaic form of 5BB3 害
      • confusable with 20C6C 𠱬
  • Group 3

    • 𠁽 U+2007D
      • archaic form of 4E38 丸
      • confusable with 51E1 凡
    • 凡 U+51E1
      • confusable with 2007D 𠁽
  • Group 4

    • 𭰉 U+2DC09
      • archaic form of 6CF2 泲
      • confusable with 6CCD 泍
    • 泍 U+6CCD
      • confusable with 2DC09 𭰉
  • Group 5

    • 𠙹 U+20679
      • alternate form of 20679 甾 vessel
      • confusable with 51F7 凷
    • 凷 U+51F7
      • alternate form of 584A 塊
      • confusable with 20679 𠙹
  • Group 6

    • 曶 U+66F6
      • variant of 忽
      • confusable with 3ADA 㫚
    • 㫚 U+3ADA
      • variant of 昒
      • confusable with 66F6 曶
  • Group 7

    • 𦥑 U+26951
      • semantic of 8207 與, 5B78 學, etc
      • confusable with 81FC 臼
    • 臼 U+81FC
      • semantic of 8202 舂, 820A 舊, etc
      • confusable with 26951 𦥑

Cognate Variants:

  • Group 1
    • 骪 U+9AAA
      • archaic form of 9AAB 骫
    • 𩨖 U+29A16
      • corrupted form of 9AAB 骫