Difference between revisions of "北"

From TORI
Jump to navigation Jump to search
 
(4 intermediate revisions by the same user not shown)
Line 21: Line 21:
 
</center>
 
</center>
 
</div></div>
 
</div></div>
[[&#X5317;]] [[X5317]] is [[inicode]] character, [[KanjiLiberal]].
+
[[&#X5317;]] [[X5317]] is [[Unicode]] character number 21271 , [[KanjiLiberal]].
   
 
In [[Japanese]], [[&#X5317;]] can be pronounced as [[kita]] and mean [[North]].
 
In [[Japanese]], [[&#X5317;]] can be pronounced as [[kita]] and mean [[North]].
Line 57: Line 57:
   
 
== Encoding ==
 
== Encoding ==
Character [[北]] ([[&#X5317;]], [[X5317]]) can be confused with other characters ([[&#XF963;]], [[XF963]]), that use similar pictures. To avoid confusions, some programming is necessary.
+
Character [[北]] ([[&#X5317;]], [[X5317]]) can be confused with other characters ([[&#XF963;]], [[XF963]]), that use similar pictures.
  +
Some software silently replace ([[&#XF963;]], [[XF963]]) to ([[&#X5317;]], [[X5317]]);
  +
no warning appears at the search or the copypast. This may cause mistakes, confusions:
  +
the two characters are sometimes treated as the same.
  +
To avoid confusions, some programming is necessary.
   
 
Encoding of character [[北]] and its analogies can be revealed with the [[PHP]] program [[ud.t]] . <br>
 
Encoding of character [[北]] and its analogies can be revealed with the [[PHP]] program [[ud.t]] . <br>
Line 104: Line 108:
 
</poem>
 
</poem>
   
The programs mentioned are designed to handle the [[Japanese]] characters in the [[Utf8]] endoding. The cover the [[unicode]] characters until [[XFFFF]] are covered; they all are encoded with 3 bytes (or less). Characters that require 4 bytes cannot be treated in the same a way: the programs should be upgraded.
+
The programs mentioned are designed to handle the [[Japanese]] characters in the [[Utf8]] endoding. The [[unicode]] characters until [[XFFFF]] are covered; they all are encoded with 3 bytes (or less).
  +
Unicode character number 194603 (&#194603; [[&#X2F82B;]], [[X2F82B]]) is reported
Perhaps, [[X2F82B]] (also reported to confuse with [[北]]) cannot be considered here.
 
  +
<ref>https://www.compart.com/en/unicode/U+2F82B
  +
Unicode Character “北” (U+2F82B)
  +
  +
Name: CJK Compatibility Ideograph-2F82B[1]
  +
Unicode Version: 3.1 (March 2001)[2]
  +
Block: CJK Compatibility Ideographs Supplement, U+2F800 - U+2FA1F[3]
  +
Plane: Supplementary Ideographic Plane, U+20000 - U+2FFFF[3]
  +
Script: Han (Hanzi, Kanji, Hanja) (Hani) [4]
  +
Category: Other Letter (Lo) [1]
  +
Bidirectional Class: Left To Right (L) [1]
  +
Combining Class: Not Reordered (0) [1]
  +
Character is Mirrored: No [1]
  +
HTML Entity:
  +
&#194603;
  +
&#x2F82B;
  +
UTF-8 Encoding: 0xF0 0xAF 0xA0 0xAB
  +
UTF-16 Encoding: 0xD87E 0xDC2B
  +
UTF-32 Encoding: 0x0002F82B
  +
Decomposition: 北 (U+5317)[1]
  +
Based on "北" (U+5317)
  +
U+F963
  +
  +
CJK Compatibility Ideograph-F963
  +
U+2F82B
  +
  +
CJK Compatibility Ideograph-2F82B
  +
</ref><ref name="x2f82b">
  +
https://util.unicode.org/UnicodeJsps/character.jsp?a=2F82B
  +
[[北]]
  +
2F82B
  +
CJK COMPATIBILITY IDEOGRAPH-2F82B
  +
Han Script
  +
id: allowed
  +
confuse: [[北]] , [[北]]
  +
</ref> to confuse with [[北]], but is not covered by the same algorithms.
   
 
==Semantic==
 
==Semantic==
Line 177: Line 216:
   
 
Unicode character [[X2F82B]] [[&#X2F82B;]]
 
Unicode character [[X2F82B]] [[&#X2F82B;]]
<ref>
+
<ref name="x2f82b">
 
https://util.unicode.org/UnicodeJsps/character.jsp?a=2F82B
 
https://util.unicode.org/UnicodeJsps/character.jsp?a=2F82B
 
[[北]]
 
[[北]]
Line 191: Line 230:
   
 
==Keywords==
 
==Keywords==
  +
[[Jisho]],
 
 
[[Japanese]],
 
[[Japanese]],
 
[[Kanji]],
 
[[Kanji]],
Line 207: Line 246:
   
 
[[Category:Japanese]]
 
[[Category:Japanese]]
  +
[[Category:Jisho]]
 
[[Category:Kanji]]
 
[[Category:Kanji]]
 
[[Category:KanjiConfudal]]
 
[[Category:KanjiConfudal]]

Latest revision as of 19:27, 6 August 2021

北西 北東
西 RoVe100.png
南西 南東

Fig.1.

X5317 is Unicode character number 21271 , KanjiLiberal.

In Japanese, can be pronounced as kita and mean North.

KanjiLiberal (X5317) is easy to confuse with
KanjiConfudal XF963; attempts to access redirect here.

KitaDraw.png

Fig.2. Draw of by Jisho [1]

Encoding

Character (, X5317) can be confused with other characters (, XF963), that use similar pictures. Some software silently replace (, XF963) to (, X5317); no warning appears at the search or the copypast. This may cause mistakes, confusions: the two characters are sometimes treated as the same. To avoid confusions, some programming is necessary.

Encoding of character and its analogies can be revealed with the PHP program ud.t .
For the execution, file uni.t also may be required.
then, command

php ud.t F963 5317 5357 897F 6771

returns the following output:


K= 6

F963 63843
Unicode character number 63843 id est, XF963
Picture:  ; uses 3 bytes. These bytes are:
XEF XA5 XA3 in the hexadecimal representation and
239 165 163 in the decimal representation

5317 21271
Unicode character number 21271 id est, X5317
Picture:  ; uses 3 bytes. These bytes are:
XE5 X8C X97 in the hexadecimal representation and
229 140 151 in the decimal representation

5357 21335
Unicode character number 21335 id est, X5357
Picture:  ; uses 3 bytes. These bytes are:
XE5 X8D X97 in the hexadecimal representation and
229 141 151 in the decimal representation

897F 35199 西 西
Unicode character number 35199 id est, X897F
Picture: 西 ; uses 3 bytes. These bytes are:
XE8 XA5 XBF in the hexadecimal representation and
232 165 191 in the decimal representation

6771 26481
Unicode character number 26481 id est, X6771
Picture:  ; uses 3 bytes. These bytes are:
XE6 X9D XB1 in the hexadecimal representation and
230 157 177 in the decimal representation

The programs mentioned are designed to handle the Japanese characters in the Utf8 endoding. The unicode characters until XFFFF are covered; they all are encoded with 3 bytes (or less). Unicode character number 194603 (北 北, X2F82B) is reported [2][3] to confuse with , but is not covered by the same algorithms.

Semantic

Hou1a.jpg

Fig.3. at top of [4]

indicates the specific direction, North. This is shown in Fig.1 and Fig.3.

Other 3 basic directions are denoted with kanjis 西,, .

Usually, at maps, is at the top.

Phonetic

can be pronounced as "Kita", きた ; but also as "Hoku", ホク [1].

Graphic

Jisho [1] suggests the drawing of shown in Fig.2.

Confusion

is easy to confuse with other Unicode characters. Pictures of the following characters are similar, and some silent (without any warning) redirections occurs at the search and/or copypasts:

KanjiLiberal X5317 [5]

KanjiConfudal XF963 [6]

Unicode character X2F82B 北 [3] (It takes more than 3 bytes in the Utf8 and cannot be shown at this site).

References

  1. 1.0 1.1 1.2 https://jisho.org/search/%23kanji%20%E5%8C%97
    https://jisho.org/search/%23kanji%20北
    5 strokes Radical: spoon 匕 Parts: 匕 爿 north Kun: きた On: ホク On reading compounds 北緯 【ホクイ】 north latitude 北欧 【ホクオウ】 Northern Europe, Nordic countries, Scandinavia 極北 【キョクホク】 extreme north, North Pole 西北 【セイホク】 north-west Kun reading compounds 北 【きた】 north, the North, northern territories, North Korea, north wind 北アイルランド 【きたアイルランド】 Northern Ireland 西北 【せいほく】 north-west Readings Japanese names: きら、 ほう、 ほっ、 ほつ
  2. https://www.compart.com/en/unicode/U+2F82B Unicode Character “北” (U+2F82B) 北 Name: CJK Compatibility Ideograph-2F82B[1] Unicode Version: 3.1 (March 2001)[2] Block: CJK Compatibility Ideographs Supplement, U+2F800 - U+2FA1F[3] Plane: Supplementary Ideographic Plane, U+20000 - U+2FFFF[3] Script: Han (Hanzi, Kanji, Hanja) (Hani) [4] Category: Other Letter (Lo) [1] Bidirectional Class: Left To Right (L) [1] Combining Class: Not Reordered (0) [1] Character is Mirrored: No [1] HTML Entity: 北 北 UTF-8 Encoding: 0xF0 0xAF 0xA0 0xAB UTF-16 Encoding: 0xD87E 0xDC2B UTF-32 Encoding: 0x0002F82B Decomposition: 北 (U+5317)[1] Based on "北" (U+5317) U+F963 北 CJK Compatibility Ideograph-F963 U+2F82B 北 CJK Compatibility Ideograph-2F82B
  3. 3.0 3.1 https://util.unicode.org/UnicodeJsps/character.jsp?a=2F82B 2F82B CJK COMPATIBILITY IDEOGRAPH-2F82B Han Script id: allowed confuse: ,
  4. https://kotobank.jp/word/方位-131742 .. デジタル大辞泉「方位」の解説 ほう‐い〔ハウヰ〕【方位】 1 ある方向が、基準の方向に対してどのようであるかの関係を表したもの。通常は子午線の方向を北・南、これに直角に交わる方向に東・西を定めた4方位を基準とし、その中間を北東・北西・南東・南西として加え8方位に、さらにその中間に北北東・南南西などをとり16方位に、さらに細分して32方位にして示す。古くは12の方向に分けて十二支を配し、北を子(ね)、北東を丑寅(うしとら)などとよんだ。天文学・測地学では、方位角を用いて表す。 2 各方角に陰陽・五行(ごぎょう)・十二支・八卦(はっけ)などを配し、それぞれに吉凶があるとする民間信仰。恵方(えほう)・金神(こんじん)・鬼門などの俗信を生んだ。「方位を見る」
  5. https://util.unicode.org/UnicodeJsps/character.jsp?a=5317 5317 CJK UNIFIED IDEOGRAPH-5317 Han Script id: restricted confuse: ,
  6. https://util.unicode.org/UnicodeJsps/character.jsp?a=F963 F963 CJK COMPATIBILITY IDEOGRAPH-F963 Han Script id: allowed confuse: ,

Keywords

Jisho, Japanese, Kanji, KanjiConfudal, KanjiLiberal, KanjiRadical, North, Unicode, X65B9 , X2F45 , , , 西, Cetegory:Unicode,,,