From TORI
Revision as of 20:28, 2 June 2021 by T (talk | contribs) (Phonetic)
Jump to: navigation, search

MargaritaOvsianka.jpg

6ppFlip.jpg

(X9CE5) is Unicode character number 40165. It appears as recommended kanju number 1405 in the List_of_jōyō_kanji [1]

Html input:
(& # 4 0 1 6 5 ;)
(& # x 9 C E 5 ;)

Encoding of and

The Utf8 encoding of character , X9CE5 and that of the similar character (, X2FC3) can be revealed by the PHP program dump.t with command

php dump.t ⿃鳥

In order to execute it, files dump.t, mb_str_split.t, unichr.t, uniord.t should be loaded. The output is

⿃鳥
The array has 6 bytes; here is its splitting:
e2 bf 83 e9 b3 a5 
array(2) {
  [0]=>
  string(3) "⿃"
  [1]=>
  string(3) "鳥"
}

Unicode character number 12227 id est, [[X2FC3]]
Picture: ⿃ ; uses 3 bytes. These bytes are:
xE2 xBF x83 in the hexadecimal representation and
226 191 131 in the decimal representation

Unicode character number 40165 id est, [[X9CE5]]
Picture: 鳥 ; uses 3 bytes. These bytes are:
xE9 xB3 xA5 in the hexadecimal representation and
233 179 165 in the decimal representation

Semantic

In Chinese and in Japanese languages, character (X9CE5) usually means a generic bird, without to indicate the specie (,, etc.).

Character &#X2FC3 (X2FC3) has similar meaning

Phonetic

in Japanese, character (X9CE5) is usually pronounced as "tori", ”とり”。

Character (X2FC3) has similar pronunciation.

Confusion

With some software, character [[[鳥]],X9CE5 looks very similar to character ,X2FC3 and may be easy confused. these characters should be considered as synomyms.

Similarity of pictures, sound and meaning of characters (X2FC3) and (X9CE5) may cause confusions: Some softwares treat them as the same, and some recognize the difference. For year 2021, these confusion are already reported [2][3].

In order to avoid confusion, character , X2FC3 is interpreted as Kanji radical (see KanjiRadical), while character , X9CE5 is interpreted as Kanji liberal (see KanjiLiberal).

In all cases when canjis and may cause a confusion, it may have sense to use hexadecimal names X2FC3 and X9CE5.

References

  1. https://en.wikipedia.org/wiki/List_of_jōyō_kanji The jōyō kanji system of representing written Japanese consists of 2,136 characters. .. 1405 鳥 鳥 11 2 bird チョウ、とり chō, tori ..
  2. https://util.unicode.org/UnicodeJsps/character.jsp?a=2FC3&B1=Show Unicode Utilities: Character Properties 2FC3 KANGXI RADICAL BIRD Han Script id: allowed confuse:
  3. https://util.unicode.org/UnicodeJsps/character.jsp?a=9CE5 Unicode Utilities: Character Properties 9CE5 CJK UNIFIED IDEOGRAPH-9CE5 Han Script id: restricted confuse:

https://www.compart.com/en/unicode/U+9CE5

Keywords

Bird, Dump.t, Japanese, Kanji, Kanji liberal, KanjiLiberal, PHP Unicode, Utf8, Utf8table, UtfH, Tori, X2FC3, X9CE5, , ,