Difference between revisions of "鳥"
(3 intermediate revisions by the same user not shown) | |||
Line 5: | Line 5: | ||
</div> |
</div> |
||
[[鳥]] ([[X9CE5]]) is [[Unicode]] character number 40165. |
[[鳥]] ([[X9CE5]]) is [[Unicode]] character number 40165. |
||
− | It appears as recommended |
+ | It appears as recommended kanji number 1405 in the List_of_jōyō_kanji |
<ref> |
<ref> |
||
https://en.wikipedia.org/wiki/List_of_jōyō_kanji |
https://en.wikipedia.org/wiki/List_of_jōyō_kanji |
||
Line 57: | Line 57: | ||
[[鳥]] ([[X9CE5]]) usually means a generic bird, without to indicate the specie ([[烏]],[[鳩]], etc.). |
[[鳥]] ([[X9CE5]]) usually means a generic bird, without to indicate the specie ([[烏]],[[鳩]], etc.). |
||
− | Character [[⿃]] ([[X2FC3]]) has similar meaning |
+ | Character [[⿃]] ([[X2FC3]]) has similar meaning |
==Phonetic== |
==Phonetic== |
||
Line 63: | Line 63: | ||
[[鳥]] ([[X9CE5]]) is usually pronounced as "tori", ”とり”。 |
[[鳥]] ([[X9CE5]]) is usually pronounced as "tori", ”とり”。 |
||
− | Character [[⿃]] ([[X2FC3]]) has similar pronunciation. |
+ | Character [[⿃]] ([[X2FC3]]) has similar pronunciation. |
==Confusion== |
==Confusion== |
||
With some software, |
With some software, |
||
− | character |
+ | character [[鳥]],[[X9CE5]] looks very similar to |
character [[⿃]],[[X2FC3]] and may be easy confused. these characters should be considered as synomyms.<!-- |
character [[⿃]],[[X2FC3]] and may be easy confused. these characters should be considered as synomyms.<!-- |
||
Character [[⿃]] ([[⿃]]) can be generated with html entry<br> |
Character [[⿃]] ([[⿃]]) can be generated with html entry<br> |
Latest revision as of 09:20, 12 August 2021
鳥 (X9CE5) is Unicode character number 40165. It appears as recommended kanji number 1405 in the List_of_jōyō_kanji [1]
Html input:
鳥 (& # 4 0 1 6 5 ;)
鳥 (& # x 9 C E 5 ;)
Encoding of ⿃ and 鳥
The Utf8 encoding of character 鳥, X9CE5 and that of the similar character (⿃, X2FC3) can be revealed by the PHP program dump.t with command
php dump.t ⿃鳥
In order to execute it, files dump.t, mb_str_split.t, unichr.t, uniord.t should be loaded. The output is
⿃鳥 The array has 6 bytes; here is its splitting: e2 bf 83 e9 b3 a5 array(2) { [0]=> string(3) "⿃" [1]=> string(3) "鳥" } Unicode character number 12227 id est, [[X2FC3]] Picture: ⿃ ; uses 3 bytes. These bytes are: xE2 xBF x83 in the hexadecimal representation and 226 191 131 in the decimal representation Unicode character number 40165 id est, [[X9CE5]] Picture: 鳥 ; uses 3 bytes. These bytes are: xE9 xB3 xA5 in the hexadecimal representation and 233 179 165 in the decimal representation
Semantic
In Chinese and in Japanese languages, character 鳥 (X9CE5) usually means a generic bird, without to indicate the specie (烏,鳩, etc.).
Character ⿃ (X2FC3) has similar meaning
Phonetic
in Japanese, character 鳥 (X9CE5) is usually pronounced as "tori", ”とり”。
Character ⿃ (X2FC3) has similar pronunciation.
Confusion
With some software, character 鳥,X9CE5 looks very similar to character ⿃,X2FC3 and may be easy confused. these characters should be considered as synomyms.
Similarity of pictures, sound and meaning of characters ⿃ (X2FC3) and 鳥 (X9CE5) may cause confusions: Some softwares treat them as the same, and some recognize the difference. For year 2021, these confusion are already reported [2][3].
In order to avoid confusion, character ⿃, X2FC3 is interpreted as Kanji radical (see KanjiRadical), while character 鳥, X9CE5 is interpreted as Kanji liberal (see KanjiLiberal).
In all cases when canjis ⿃ and 鳥 may cause a confusion, it may have sense to use hexadecimal names X2FC3 and X9CE5.
References
- ↑ https://en.wikipedia.org/wiki/List_of_jōyō_kanji The jōyō kanji system of representing written Japanese consists of 2,136 characters. .. 1405 鳥 鳥 11 2 bird チョウ、とり chō, tori ..
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=2FC3&B1=Show Unicode Utilities: Character Properties ⿃ 2FC3 KANGXI RADICAL BIRD Han Script id: allowed confuse: 鳥
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=9CE5 Unicode Utilities: Character Properties 鳥 9CE5 CJK UNIFIED IDEOGRAPH-9CE5 Han Script id: restricted confuse: ⿃
https://www.compart.com/en/unicode/U+9CE5
Keywords
Bird, Dump.t, Japanese, Kanji, Kanji liberal, KanjiLiberal, PHP Unicode, Utf8, Utf8table, UtfH, Tori, X2FC3, X9CE5, ⿃, 鳥,