Difference between revisions of "鳥"

From TORI
Jump to navigation Jump to search
(Created page with "<div style="float:right; margin:-50px -14px 0px 8px"> 200px 200px </div> (X9CE5) is Unicode character...")
 
 
(6 intermediate revisions by the same user not shown)
Line 5: Line 5:
 
</div>
 
</div>
 
[[&#X9CE5;]] ([[X9CE5]]) is [[Unicode]] character number 40165.
 
[[&#X9CE5;]] ([[X9CE5]]) is [[Unicode]] character number 40165.
It appears as recommended kanju number 1405 in the List_of_jōyō_kanji
+
It appears as recommended kanji number 1405 in the List_of_jōyō_kanji
 
<ref>
 
<ref>
 
https://en.wikipedia.org/wiki/List_of_jōyō_kanji
 
https://en.wikipedia.org/wiki/List_of_jōyō_kanji
Line 57: Line 57:
 
[[&#X9CE5;]] ([[X9CE5]]) usually means a generic bird, without to indicate the specie ([[烏]],[[鳩]], etc.).
 
[[&#X9CE5;]] ([[X9CE5]]) usually means a generic bird, without to indicate the specie ([[烏]],[[鳩]], etc.).
   
Character [[&#X2FC3]] ([[X2FC3]]) has similar meaning
+
Character [[&#X2FC3;]] ([[X2FC3]]) has similar meaning
   
 
==Phonetic==
 
==Phonetic==
Line 63: Line 63:
 
[[&#X9CE5;]] ([[X9CE5]]) is usually pronounced as "tori", ”とり”。
 
[[&#X9CE5;]] ([[X9CE5]]) is usually pronounced as "tori", ”とり”。
   
Character [[&#X2FC3]] ([[X2FC3]]) has similar pronunciation.
+
Character [[&#X2FC3;]] ([[X2FC3]]) has similar pronunciation.
   
 
==Confusion==
 
==Confusion==
  +
With some software,
With some software, character [[]] looks very similar to character [[⿃]] ([[&#x2FC3;]]) and may be easy confused. these characters should be considered as synomyms.
 
  +
character [[&#X9CE5;]],[[X9CE5]] looks very similar to
 
 
character [[&#x2FC3;]],[[X2FC3]] and may be easy confused. these characters should be considered as synomyms.<!--
 
Character [[⿃]] ([[&#x2FC3;]]) can be generated with html entry<br>
 
Character [[⿃]] ([[&#x2FC3;]]) can be generated with html entry<br>
 
[[&#12227;]] (& # 1 2 2 2 7 ;)<br>
 
[[&#12227;]] (& # 1 2 2 2 7 ;)<br>
[[&#x2FC3;]] (& # x 2 F C 3 ;)
+
[[&#x2FC3;]] (& # x 2 F C 3 ;) !-->
   
 
Similarity of pictures, sound and meaning of characters
 
Similarity of pictures, sound and meaning of characters
Line 99: Line 100:
 
while character [[&#X9CE5;]], [[X9CE5]] is interpreted as [[Kanji liberal]] (see [[KanjiLiberal]]).
 
while character [[&#X9CE5;]], [[X9CE5]] is interpreted as [[Kanji liberal]] (see [[KanjiLiberal]]).
   
In all cases when use of [[canji]]s [[&#x2FC3;]] and
+
In all cases when [[canji]]s [[&#x2FC3;]] and
[[&#X9CE5;]] may cause a confusion,
+
[[&#X9CE5;]] may cause a confusion,
 
it may have sense to use hexadecimal names [[X2FC3]] and [[X9CE5]].
 
it may have sense to use hexadecimal names [[X2FC3]] and [[X9CE5]].
   
Line 121: Line 122:
 
[[UtfH]],
 
[[UtfH]],
 
[[Tori]],
 
[[Tori]],
[[X2FC3;]],
+
[[X2FC3]],
[[X9CE5;]],
+
[[X9CE5]],
 
[[&#X2FC3;]],
 
[[&#X2FC3;]],
 
[[&#X9CE5;]]
 
[[&#X9CE5;]]
Line 132: Line 133:
 
[[Category:Unicode]]
 
[[Category:Unicode]]
 
[[Category:Tori]],
 
[[Category:Tori]],
[[Category:X2FC3;]],
+
[[Category:X2FC3]]
[[Category:X9CE5;]],
+
[[Category:X9CE5]]
[[Category:&#X2FC3;]],
+
[[Category:&#X2FC3]]
[[Category:&#X9CE5;]]
+
[[Category:&#X9CE5]]

Latest revision as of 09:20, 12 August 2021

MargaritaOvsianka.jpg

6ppFlip.jpg

(X9CE5) is Unicode character number 40165. It appears as recommended kanji number 1405 in the List_of_jōyō_kanji [1]

Html input:
(& # 4 0 1 6 5 ;)
(& # x 9 C E 5 ;)

Encoding of and

The Utf8 encoding of character , X9CE5 and that of the similar character (, X2FC3) can be revealed by the PHP program dump.t with command

php dump.t ⿃鳥

In order to execute it, files dump.t, mb_str_split.t, unichr.t, uniord.t should be loaded. The output is

⿃鳥
The array has 6 bytes; here is its splitting:
e2 bf 83 e9 b3 a5 
array(2) {
  [0]=>
  string(3) "⿃"
  [1]=>
  string(3) "鳥"
}

Unicode character number 12227 id est, [[X2FC3]]
Picture: ⿃ ; uses 3 bytes. These bytes are:
xE2 xBF x83 in the hexadecimal representation and
226 191 131 in the decimal representation

Unicode character number 40165 id est, [[X9CE5]]
Picture: 鳥 ; uses 3 bytes. These bytes are:
xE9 xB3 xA5 in the hexadecimal representation and
233 179 165 in the decimal representation

Semantic

In Chinese and in Japanese languages, character (X9CE5) usually means a generic bird, without to indicate the specie (,, etc.).

Character (X2FC3) has similar meaning

Phonetic

in Japanese, character (X9CE5) is usually pronounced as "tori", ”とり”。

Character (X2FC3) has similar pronunciation.

Confusion

With some software, character ,X9CE5 looks very similar to character ,X2FC3 and may be easy confused. these characters should be considered as synomyms.

Similarity of pictures, sound and meaning of characters (X2FC3) and (X9CE5) may cause confusions: Some softwares treat them as the same, and some recognize the difference. For year 2021, these confusion are already reported [2][3].

In order to avoid confusion, character , X2FC3 is interpreted as Kanji radical (see KanjiRadical), while character , X9CE5 is interpreted as Kanji liberal (see KanjiLiberal).

In all cases when canjis and may cause a confusion, it may have sense to use hexadecimal names X2FC3 and X9CE5.

References

  1. https://en.wikipedia.org/wiki/List_of_jōyō_kanji The jōyō kanji system of representing written Japanese consists of 2,136 characters. .. 1405 鳥 鳥 11 2 bird チョウ、とり chō, tori ..
  2. https://util.unicode.org/UnicodeJsps/character.jsp?a=2FC3&B1=Show Unicode Utilities: Character Properties 2FC3 KANGXI RADICAL BIRD Han Script id: allowed confuse:
  3. https://util.unicode.org/UnicodeJsps/character.jsp?a=9CE5 Unicode Utilities: Character Properties 9CE5 CJK UNIFIED IDEOGRAPH-9CE5 Han Script id: restricted confuse:

https://www.compart.com/en/unicode/U+9CE5

Keywords

Bird, Dump.t, Japanese, Kanji, Kanji liberal, KanjiLiberal, PHP Unicode, Utf8, Utf8table, UtfH, Tori, X2FC3, X9CE5, , ,