From TORI
Jump to navigation Jump to search

04746birdFragment.jpg

02562tsuru.JPG

, X2FC3 is Unicode character number 12227.

It is interpreted as Kanji Radical (see KanjiRadical).

HTML Entry:
(& # 1 2 2 2 7 ;)
(& # x 2 F C 3 ;)

Encoding: and

The Utf8 encoding of character (, X2FC3) and that of the similar character , X9CE5 can be revealed using the dump.t program with command

php dump.t ⿃鳥

In order to execute it, files dump.t, mb_str_split.t, unichr.t, uniord.t should be loaded. The output is

⿃鳥
The array has 6 bytes; here is its splitting:
e2 bf 83 e9 b3 a5 
array(2) {
  [0]=>
  string(3) "⿃"
  [1]=>
  string(3) "鳥"
}

Unicode character number 12227 id est, [[X2FC3]]
Picture: ⿃ ; uses 3 bytes. These bytes are:
xE2 xBF x83 in the hexadecimal representation and
226 191 131 in the decimal representation

Unicode character number 40165 id est, [[X9CE5]]
Picture: 鳥 ; uses 3 bytes. These bytes are:
xE9 xB3 xA5 in the hexadecimal representation and
233 179 165 in the decimal representation

Semantic

In Chinese and Japanese languages, Character , X2FC3 can be used to denote a generic bird, without to indicate the specie.

Character , X9CE5 has similar meaning.

Phonetic

In Japanese, character , X2FC3 can be pronounced as "とり" ("Tori")

Character , X9CE5 has similar pronunciation.

Confusion

Character number 12227 ,X2FC3 (KanjiRadical) looks similar to character number 40165 ,X9CE5 (KanjiLiberal).

Characters and can be confused. This confusion is recognized and described in the literature [1][2].

References

  1. https://util.unicode.org/UnicodeJsps/character.jsp?a=2FC3&B1=Show Unicode Utilities: Character Properties 2FC3 KANGXI RADICAL BIRD Han Script id: allowed confuse:
  2. https://util.unicode.org/UnicodeJsps/character.jsp?a=9CE5 Unicode Utilities: Character Properties 9CE5 CJK UNIFIED IDEOGRAPH-9CE5 Han Script id: restricted confuse:

Keywords

Bird, Chinese, Japanese, Kanji, KanjiLiberal, KanjiRadical, SomeH, SomeU, Tori, TORI, Unicode, UtfH, X2FC3, X9CE5, ,