Difference between revisions of "XF981"
(Created page with "XF981 is unicode character number 63873, 女 (& # 6 3 8 7 3;). The three characters ⼥,女,女 look very similar. In addition, the M...") |
|||
Line 44: | Line 44: | ||
& # X F 9 8 1 ; <br> |
& # X F 9 8 1 ; <br> |
||
However, for the activation, the spaces should be removed from the formula above. |
However, for the activation, the spaces should be removed from the formula above. |
||
+ | |||
+ | Vairous ways to encode character [[XF981]] are suggested by the unicode table |
||
+ | <ref> |
||
+ | https://unicode-table.com/en/F981/ |
||
+ | Ideograph woman, girl; feminine; rad. 38 CJK was approved as part of Unicode 1.1 in 1993. |
||
+ | Properties |
||
+ | Age 1.1 |
||
+ | Block CJK Compatibility Ideographs |
||
+ | Bidi Paired Bracket Type None |
||
+ | Composition Exclusion No |
||
+ | Case Folding F981 |
||
+ | Simple Case Folding F981 |
||
+ | Block CJK Compatibility Ideographs |
||
+ | Unicode version: 1.1 (1993) |
||
+ | Ideograph meaning |
||
+ | Ideograph woman, girl; feminine; rad. 38 CJK. CJK Compatibility Ideographs. |
||
+ | Ideograph woman, girl; feminine; rad. 38 CJK was approved as part of Unicode 1.1 in 1993. |
||
+ | Properties |
||
+ | Age 1.1 |
||
+ | Block CJK Compatibility Ideographs |
||
+ | Bidi Paired Bracket Type None |
||
+ | Composition Exclusion No |
||
+ | Case Folding F981 |
||
+ | Simple Case Folding F981 |
||
+ | </ref>:<br> |
||
+ | <table> |
||
+ | <tr> |
||
+ | <th class="left">Encoding</th> |
||
+ | <th>hex</th> |
||
+ | <th>dec (bytes)</th> |
||
+ | <th>dec</th> |
||
+ | <th>binary</th> |
||
+ | </tr> |
||
+ | <tr> |
||
+ | <td class="left">UTF-8</td> |
||
+ | <td>EF A6 81</td> |
||
+ | <td>239 166 129</td> |
||
+ | <td>15705729</td> |
||
+ | <td>11101111 10100110 10000001</td> |
||
+ | </tr> |
||
+ | <tr> |
||
+ | <td class="left">UTF-16BE</td> |
||
+ | <td>F9 81</td> |
||
+ | <td>249 129</td> |
||
+ | <td>63873</td> |
||
+ | <td>11111001 10000001</td> |
||
+ | </tr> |
||
+ | <tr> |
||
+ | <td class="left">UTF-16LE</td> |
||
+ | <td>81 F9</td> |
||
+ | <td>129 249</td> |
||
+ | <td>33273</td> |
||
+ | <td>10000001 11111001</td> |
||
+ | </tr> |
||
+ | <tr> |
||
+ | <td class="left">UTF-32BE</td> |
||
+ | <td>00 00 F9 81</td> |
||
+ | <td>0 0 249 129</td> |
||
+ | <td>63873</td> |
||
+ | <td>00000000 00000000 11111001 10000001</td> |
||
+ | </tr> |
||
+ | <tr> |
||
+ | <td class="left">UTF-32LE</td> |
||
+ | <td>81 F9 00 00</td> |
||
+ | <td>129 249 0 0</td> |
||
+ | <td>2180579328</td> |
||
+ | <td>10000001 11111001 00000000 00000000</td> |
||
+ | </tr> |
||
+ | </table> |
||
==Phonetic== |
==Phonetic== |
||
Line 50: | Line 119: | ||
==Semantic== |
==Semantic== |
||
− | <div style="float:left;margin:- |
+ | <div style="float:left;margin:-80px 8px 0px -170px"> |
[[File:OnnaAtField.jpg|130px]]<br><!-- |
[[File:OnnaAtField.jpg|130px]]<br><!-- |
||
[[File:Onna1282725fragment.png|130px]]<br>!--> |
[[File:Onna1282725fragment.png|130px]]<br>!--> |
||
Line 59: | Line 128: | ||
Both in [[Chinese]] and [[Japanese]] languages, character [[XF981]] has meaning "woman", Human female. |
Both in [[Chinese]] and [[Japanese]] languages, character [[XF981]] has meaning "woman", Human female. |
||
+ | |||
==Synonyms: [[⼥]],[[女]],[[女]] == |
==Synonyms: [[⼥]],[[女]],[[女]] == |
||
Characters<br> |
Characters<br> |
||
Line 76: | Line 146: | ||
The confusion of [[⼥]],[[女]],[[女]] |
The confusion of [[⼥]],[[女]],[[女]] |
||
− | can be shown with the program below: |
+ | can be shown with the [[PHP]] program below: |
<pre> |
<pre> |
||
<?php |
<?php |
||
Line 109: | Line 179: | ||
} |
} |
||
− | $a=unichr(0x2f25); |
+ | $a=unichr(0x2f25); echo "$a\n"; |
− | echo "$a\n"; |
+ | $a.=unichr(0x5973);echo "$a\n"; |
− | $a.=unichr( |
+ | $a.=unichr(0xF981);echo "$a\n"; |
− | echo "$a\n"; |
||
− | $a.=unichr(0xF981); |
||
− | echo "$a\n"; |
||
//$a='⼤,大;⼩,小'; # two pairs of different unicode characters separated with "," and ";" |
//$a='⼤,大;⼩,小'; # two pairs of different unicode characters separated with "," and ";" |
||
Line 265: | Line 332: | ||
[[Female]], |
[[Female]], |
||
[[Japanese]], |
[[Japanese]], |
||
⚫ | |||
+ | [[KanjiLiberal]], |
||
+ | [[KanjiRadical]], |
||
[[Onna]], |
[[Onna]], |
||
[[Utf8]], |
[[Utf8]], |
||
Line 271: | Line 341: | ||
[[Woman]] |
[[Woman]] |
||
+ | [[X2F25]], |
||
[[X5973]], |
[[X5973]], |
||
⚫ | |||
[[XF981]], |
[[XF981]], |
||
[[⼥]], |
[[⼥]], |
||
Line 282: | Line 352: | ||
[[Category:Onna]] |
[[Category:Onna]] |
||
[[Category:Japanese]] |
[[Category:Japanese]] |
||
+ | [[Category:Kanji]] |
||
[[Category:Utf8]] |
[[Category:Utf8]] |
||
[[Category:Woman]] |
[[Category:Woman]] |
Latest revision as of 12:24, 5 June 2021
XF981 is unicode character number 63873, 女 (& # 6 3 8 7 3;).
The three characters ⼥,女,女 look very similar. In addition, the Mediawiki engine, at least with the default setting, confuse the last two. For this reason, articles X2F25, X5973, XF981 are loaded; names of these articles are easy to distinguish both by Humans and by the software.
Encoding of 女
In Utf8 encoding, Character XF981 (女) is encoded with 3 bytes. These bytes are:
xEF xA6 x81 in the hexadecimal representation and
239 166 129 in the decimal representation.
In HTML, character 女 can be generated as follows:
& # X F 9 8 1 ;
However, for the activation, the spaces should be removed from the formula above.
Vairous ways to encode character XF981 are suggested by the unicode table
[2]:
Encoding | hex | dec (bytes) | dec | binary |
---|---|---|---|---|
UTF-8 | EF A6 81 | 239 166 129 | 15705729 | 11101111 10100110 10000001 |
UTF-16BE | F9 81 | 249 129 | 63873 | 11111001 10000001 |
UTF-16LE | 81 F9 | 129 249 | 33273 | 10000001 11111001 |
UTF-32BE | 00 00 F9 81 | 0 0 249 129 | 63873 | 00000000 00000000 11111001 10000001 |
UTF-32LE | 81 F9 00 00 | 129 249 0 0 | 2180579328 | 10000001 11111001 00000000 00000000 |
Phonetic
In Japanese, XF981 (女) can be pronounced as おんな (onna).
Semantic
Both in Chinese and Japanese languages, character XF981 has meaning "woman", Human female.
Synonyms: ⼥,女,女
Characters
X2F25 ⼥ (⼥ , & # 1 2 0 6 9 ;)
X5973 女 (女 , & # 2 2 8 9 9 ;)
XF981 女 (女 , & # 6 3 8 7 3 ;)
are synonyms, and they are easy to confuse.
[3]..
Similar confusion takes place with Cyrillic characters; for example,
A (& # 0 0 6 5 ;
) and
А (& # 1 0 4 0 ;
)
at some graphical interfaces look very similar.
The confusion of ⼥,女,女 can be shown with the PHP program below:
<?php function unichr($dec) { if ($dec < 128) { $utf = chr($dec); } else if ($dec < 2048) { $utf = chr(192 + (($dec - ($dec % 64)) / 64)); $utf .= chr(128 + ($dec % 64)); } else { $utf = chr(224 + (($dec - ($dec % 4096)) / 4096)); $utf .= chr(128 + ((($dec % 4096) - ($dec % 64)) / 64)); $utf .= chr(128 + ($dec % 64)); } return $utf; } function mb_str_split($str) { // split multibyte string in characters // at all positions except the start: ^ // and the end: $ $pattern = '/(?<!^)(?!$)/u'; return preg_split($pattern,$str); } function uniord($a) { $M=strlen($a); $p=ord($a[0]); if($M==1) return $p; $p-=194; $p*=64; $p+=ord($a[1]); if($M==2) return $p; $p-=2050; $p*=64; $p+=ord($a[2]); return $p; } $a=unichr(0x2f25); echo "$a\n"; $a.=unichr(0x5973);echo "$a\n"; $a.=unichr(0xF981);echo "$a\n"; //$a='⼤,大;⼩,小'; # two pairs of different unicode characters separated with "," and ";" //$a='⼥,女;AАᎪᗅA'; # different unicode characters separated with "," and ";" //$a='⼥,女'; # pair of different unicode characters separated with coma $N=strlen($a); echo "The array has $N bytes; here is its splitting:\n"; for($n=0;$n<$N;$n++) { printf("%02x ",ord($a[$n]) ); } echo "\n"; $b = mb_str_split($a); var_dump($b); $M=count($b); #mb_internal_encoding("UTF-8"); for($m=0;$m<$M;$m++) { printf("\n"); $c=$b[$m]; $u=uniord($c); printf("Unicode character number %05d id est, x%04X\n",$u,$u); $d=strlen($c); echo "Picture: $c uses $d bytes. These bytes are:\n"; for($n=0;$n<$d;$n++) printf("x%2X ",ord($c[$n])); printf("in the hexadecimal representation and\n"); for($n=0;$n<$d;$n++) printf("%3d ",ord($c[$n])); printf("in the decimal representation\n"); } ?>
This program uses portable PHP functions unichr.t, mb_str_split.t and uniord.t. The output is
⼥ ⼥女 ⼥女女 The array has 9 bytes; here is its splitting: e2 bc a5 e5 a5 b3 ef a6 81 array(3) { [0]=> string(3) "⼥" [1]=> string(3) "女" [2]=> string(3) "女" } Unicode character number 12069 id est, x2F25 Picture: ⼥ uses 3 bytes. These bytes are: xE2 xBC xA5 in the hexadecimal representation and 226 188 165 in the decimal representation Unicode character number 22899 id est, x5973 Picture: 女 uses 3 bytes. These bytes are: xE5 xA5 xB3 in the hexadecimal representation and 229 165 179 in the decimal representation Unicode character number 63873 id est, xF981 Picture: 女 uses 3 bytes. These bytes are: xEF xA6 x81 in the hexadecimal representation and 239 166 129 in the decimal representation
X2F25, X5973, XF981
Not only Humans, but also the default mediawiki software confuse X5973 and XF981, redirecting from one to another. The same refers to various text editors, they confuse these characters. Such a case can be expressed with sentence: "If something is wrong, Cherchez la ⼥" [4][5][6].
Characters
X2F25, (⼥ , & # x 2 f 2 5 ;)
[7],
X5973 (女 , & # x 5 9 7 3 ;)
[8] and
XF981 (女 , & # x F 9 8 1 ;)
[9]
are easy to confuse.
All the three appear with pictures similar to ⼥.
For this reason, the content is repeated at articles
X2F25, X5973, XF981.
These names are easy to distinguish both by Humans and by the software.
Confusions
In order to avoid confusion, the hexadecimal name XF981 should be used instead of 女 in all places, where the the difference between X2F25 (⼥), X5973 (女), XF981 (女) may be important. (For example, the link to article about X5973 may send to article about XF981, or vice versa, or the ink may just fail, as some software confuse X5973 and XF981; the article can be just lost, replaced to "redirect" to itself. In order to avoid such lost (replacement to fake "redirecr", descriptions of the 3 versions of kanji Onna are repeated in articles X2F25, X5973, XF981.
References
- ↑ https://jisho.org/search/%23kanji%20%E5%A5%B3 https://jisho.org/search/%23kanji_女 woman, female Kun: おんな、 め On: ジョ、 ニョ、 ニョウ Jōyō kanji, taught in grade 1 JLPT level N5 151 of 2500 most used kanji in newspapers On reading compounds 女 【ジョ】 woman, girl, daughter, Chinese "Girl" constellation (one of the 28 mansions) 女王 【ジョオウ】 queen, female champion 処女 【ショジョ】 virgin, maiden 一女 【イチジョ】 one daughter, eldest daughter, first-born daughter 女王 【ジョオウ】 queen, female champion 女房 【ニョウボウ】 wife (esp. one's own wife), court lady, female court attache, woman who served at the imperial palace, woman (esp. as a love interest) 老若男女 【ロウニャクナンニョ】 men and women of all ages 天女 【テンニョ】 heavenly nymph, celestial maiden, beautiful and kind woman 女房 【ニョウボウ】 wife (esp. one's own wife), court lady, female court attache, woman who served at the imperial palace, woman (esp. as a love interest) 女官 【ジョカン】 court lady, lady-in-waiting Kun reading compounds 女 【おんな】 female, woman, female sex, female lover, girlfriend, mistress, (someone's) woman 女形 【おんながた】 onnagata, male actor in female kabuki roles, female partner (in a relationship) 醜女 【しゅうじょ】 homely woman, plain-looking woman, female demon 囲い女 【かこいおんな】 mistress 雌 【め】 female, smaller (of the two), weaker, woman, wife 女神 【めがみ】 goddess, female deity 早乙女 【さおとめ】 young female rice planter, young girl 醜女 【しゅうじょ】 homely woman, plain-looking woman, female demon
- ↑ https://unicode-table.com/en/F981/ Ideograph woman, girl; feminine; rad. 38 CJK was approved as part of Unicode 1.1 in 1993. Properties Age 1.1 Block CJK Compatibility Ideographs Bidi Paired Bracket Type None Composition Exclusion No Case Folding F981 Simple Case Folding F981 Block CJK Compatibility Ideographs Unicode version: 1.1 (1993) Ideograph meaning Ideograph woman, girl; feminine; rad. 38 CJK. CJK Compatibility Ideographs. Ideograph woman, girl; feminine; rad. 38 CJK was approved as part of Unicode 1.1 in 1993. Properties Age 1.1 Block CJK Compatibility Ideographs Bidi Paired Bracket Type None Composition Exclusion No Case Folding F981 Simple Case Folding F981
- ↑ https://en.wiktionary.org/wiki/%E5%A5%B3 https://en.wiktionary.org/wiki/女
- ↑ https://archive.org/details/lesmohicansdepa02dumagoog/page/n243/mode/2up?view=theater Alexandre Dumas. Les Mohicans de Paris. 1874, p.332. .. L'huissier disparut par une porte, et revint presque aua* sitôt. — Dans deux minutes, M. Jackal est à vous. Effectivement, un instant après, la porté se rouvrit, et, avant que Ton vit encore personne, on entendit une voix qui criait : — Cherchez la femme, pardieu! cherchez la femme! Puis parut Thomme dont on venait d'entendre la voix. Essayons de tracer le portrait de M. Jackal...
- ↑ https://fr.wikipedia.org/wiki/Cherchez_la_femme « Cherchez la femme » est une expression connue sous sa forme française dans des ouvrages écrits en anglais, en italien et dans plusieurs autres langues. ..
- ↑ https://en.wikipedia.org/wiki/Cherchez_la_femme Cherchez la femme (French: [ʃɛʁʃe la fam]) is a French phrase which literally means 'look for the woman'. .. Il y a une femme dans toutes les affaires; aussitôt qu'on me fait un rapport, je dis: « Cherchez la femme ! »
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=2F25 ⼥ 2F25 KANGXI RADICAL WOMAN Han Script id: allowed confuse: 女 , 女
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=5973 女 5973 CJK UNIFIED IDEOGRAPH-5973 Han Script id: restricted confuse: 女 , ⼥
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=F981 女 F981 CJK COMPATIBILITY IDEOGRAPH-F981 Han Script id: allowed confuse: 女 , ⼥