XF981

From TORI
Jump to: navigation, search

XF981 is unicode character number 63873, (& # 6 3 8 7 3;).

The three characters ,, look very similar. In addition, the Mediawiki engine, at least with the default setting, confuse the last two. For this reason, articles X2F25, X5973, XF981 are loaded; names of these articles are easy to distinguish both by Humans and by the software.

OnnaDeaw.png
Drawing of XF981 [1]

Encoding of

In Utf8 encoding, Character XF981 () is encoded with 3 bytes. These bytes are:
xEF xA6 x81 in the hexadecimal representation and
239 166 129 in the decimal representation.

In HTML, character can be generated as follows:
& # X F 9 8 1 ;
However, for the activation, the spaces should be removed from the formula above.

Vairous ways to encode character XF981 are suggested by the unicode table [2]:

Encoding hex dec (bytes) dec binary
UTF-8 EF A6 81 239 166 129 15705729 11101111 10100110 10000001
UTF-16BE F9 81 249 129 63873 11111001 10000001
UTF-16LE 81 F9 129 249 33273 10000001 11111001
UTF-32BE 00 00 F9 81 0 0 249 129 63873 00000000 00000000 11111001 10000001
UTF-32LE 81 F9 00 00 129 249 0 0 2180579328 10000001 11111001 00000000 00000000

Phonetic

In Japanese, XF981 () can be pronounced as おんな (onna).

Semantic

OnnaAtField.jpg

or or

Both in Chinese and Japanese languages, character XF981 has meaning "woman", Human female.

Synonyms: ,,

Characters
X2F25 ( , & # 1 2 0 6 9 ;)
X5973 ( , & # 2 2 8 9 9 ;)
XF981 ( , & # 6 3 8 7 3 ;)
are synonyms, and they are easy to confuse. [3]..

Similar confusion takes place with Cyrillic characters; for example,
A (& # 0 0 6 5 ;) and
А (& # 1 0 4 0 ;)
at some graphical interfaces look very similar.

The confusion of ,, can be shown with the PHP program below:

<?php 
function unichr($dec) {
  if ($dec < 128) {
    $utf = chr($dec);
  } else if ($dec < 2048) {
    $utf = chr(192 + (($dec - ($dec % 64)) / 64));
    $utf .= chr(128 + ($dec % 64));
  } else {
    $utf = chr(224 + (($dec - ($dec % 4096)) / 4096));
    $utf .= chr(128 + ((($dec % 4096) - ($dec % 64)) / 64));
    $utf .= chr(128 + ($dec % 64));
  }
  return $utf;
} 

function mb_str_split($str) {
   // split multibyte string in characters
   // at all positions except the start: ^
   // and the end: $
   $pattern = '/(?<!^)(?!$)/u';
   return preg_split($pattern,$str);
}

function uniord($a) 
{
  $M=strlen($a);
  $p=ord($a[0]);                    if($M==1) return $p;
  $p-=194;  $p*=64; $p+=ord($a[1]); if($M==2) return $p;
  $p-=2050; $p*=64; $p+=ord($a[2]);           return $p;
}

$a=unichr(0x2f25); echo "$a\n";
$a.=unichr(0x5973);echo "$a\n";
$a.=unichr(0xF981);echo "$a\n";

//$a='⼤,大;⼩,小'; # two pairs of different unicode characters separated with "," and ";"
//$a='⼥,女;AАᎪᗅA'; # different unicode characters separated with "," and ";"
//$a='⼥,女'; # pair of different unicode characters separated with coma

$N=strlen($a);
echo "The array has $N bytes; here is its splitting:\n";

for($n=0;$n<$N;$n++)
{
printf("%02x ",ord($a[$n]) );
}
echo "\n";

$b = mb_str_split($a);

var_dump($b);
$M=count($b);

#mb_internal_encoding("UTF-8");

for($m=0;$m<$M;$m++)
{
printf("\n");
$c=$b[$m];
$u=uniord($c);
printf("Unicode character number %05d id est, x%04X\n",$u,$u);
$d=strlen($c);
echo "Picture: $c uses $d bytes. These bytes are:\n";
for($n=0;$n<$d;$n++) printf("x%2X ",ord($c[$n]));
printf("in the hexadecimal representation and\n");
for($n=0;$n<$d;$n++) printf("%3d ",ord($c[$n]));
printf("in the decimal representation\n");
}
?>

This program uses portable PHP functions unichr.t, mb_str_split.t and uniord.t. The output is

⼥
⼥女
⼥女女
The array has 9 bytes; here is its splitting:
e2 bc a5 e5 a5 b3 ef a6 81 
array(3) {
  [0]=>
  string(3) "⼥"
  [1]=>
  string(3) "女"
  [2]=>
  string(3) "女"
}

Unicode character number 12069 id est, x2F25
Picture: ⼥ uses 3 bytes. These bytes are:
xE2 xBC xA5 in the hexadecimal representation and
226 188 165 in the decimal representation

Unicode character number 22899 id est, x5973
Picture: 女 uses 3 bytes. These bytes are:
xE5 xA5 xB3 in the hexadecimal representation and
229 165 179 in the decimal representation

Unicode character number 63873 id est, xF981
Picture: 女 uses 3 bytes. These bytes are:
xEF xA6 x81 in the hexadecimal representation and
239 166 129 in the decimal representation

X2F25, X5973, XF981

Not only Humans, but also the default mediawiki software confuse X5973 and XF981, redirecting from one to another. The same refers to various text editors, they confuse these characters. Such a case can be expressed with sentence: "If something is wrong, Cherchez la " [4][5][6].

Characters
X2F25, ( , & # x 2 f 2 5 ;) [7],
X5973 ( , & # x 5 9 7 3 ;) [8] and
XF981 ( , & # x F 9 8 1 ;) [9]
are easy to confuse. All the three appear with pictures similar to . For this reason, the content is repeated at articles X2F25, X5973, XF981. These names are easy to distinguish both by Humans and by the software.

Confusions

In order to avoid confusion, the hexadecimal name XF981 should be used instead of in all places, where the the difference between X2F25 (), X5973 (), XF981 () may be important. (For example, the link to article about X5973 may send to article about XF981, or vice versa, or the ink may just fail, as some software confuse X5973 and XF981; the article can be just lost, replaced to "redirect" to itself. In order to avoid such lost (replacement to fake "redirecr", descriptions of the 3 versions of kanji Onna are repeated in articles X2F25, X5973, XF981.

References

  1. https://jisho.org/search/%23kanji%20%E5%A5%B3 https://jisho.org/search/%23kanji_女 woman, female Kun: おんな、 め On: ジョ、 ニョ、 ニョウ Jōyō kanji, taught in grade 1 JLPT level N5 151 of 2500 most used kanji in newspapers On reading compounds 女 【ジョ】 woman, girl, daughter, Chinese "Girl" constellation (one of the 28 mansions) 女王 【ジョオウ】 queen, female champion 処女 【ショジョ】 virgin, maiden 一女 【イチジョ】 one daughter, eldest daughter, first-born daughter 女王 【ジョオウ】 queen, female champion 女房 【ニョウボウ】 wife (esp. one's own wife), court lady, female court attache, woman who served at the imperial palace, woman (esp. as a love interest) 老若男女 【ロウニャクナンニョ】 men and women of all ages 天女 【テンニョ】 heavenly nymph, celestial maiden, beautiful and kind woman 女房 【ニョウボウ】 wife (esp. one's own wife), court lady, female court attache, woman who served at the imperial palace, woman (esp. as a love interest) 女官 【ジョカン】 court lady, lady-in-waiting Kun reading compounds 女 【おんな】 female, woman, female sex, female lover, girlfriend, mistress, (someone's) woman 女形 【おんながた】 onnagata, male actor in female kabuki roles, female partner (in a relationship) 醜女 【しゅうじょ】 homely woman, plain-looking woman, female demon 囲い女 【かこいおんな】 mistress 雌 【め】 female, smaller (of the two), weaker, woman, wife 女神 【めがみ】 goddess, female deity 早乙女 【さおとめ】 young female rice planter, young girl 醜女 【しゅうじょ】 homely woman, plain-looking woman, female demon
  2. https://unicode-table.com/en/F981/ Ideograph woman, girl; feminine; rad. 38 CJK was approved as part of Unicode 1.1 in 1993. Properties Age 1.1 Block CJK Compatibility Ideographs Bidi Paired Bracket Type None Composition Exclusion No Case Folding F981 Simple Case Folding F981 Block CJK Compatibility Ideographs Unicode version: 1.1 (1993) Ideograph meaning Ideograph woman, girl; feminine; rad. 38 CJK. CJK Compatibility Ideographs. Ideograph woman, girl; feminine; rad. 38 CJK was approved as part of Unicode 1.1 in 1993. Properties Age 1.1 Block CJK Compatibility Ideographs Bidi Paired Bracket Type None Composition Exclusion No Case Folding F981 Simple Case Folding F981
  3. https://en.wiktionary.org/wiki/%E5%A5%B3 https://en.wiktionary.org/wiki/女
  4. https://archive.org/details/lesmohicansdepa02dumagoog/page/n243/mode/2up?view=theater Alexandre Dumas. Les Mohicans de Paris. 1874, p.332. .. L'huissier disparut par une porte, et revint presque aua* sitôt. — Dans deux minutes, M. Jackal est à vous. Effectivement, un instant après, la porté se rouvrit, et, avant que Ton vit encore personne, on entendit une voix qui criait : — Cherchez la femme, pardieu! cherchez la femme! Puis parut Thomme dont on venait d'entendre la voix. Essayons de tracer le portrait de M. Jackal...
  5. https://fr.wikipedia.org/wiki/Cherchez_la_femme « Cherchez la femme » est une expression connue sous sa forme française dans des ouvrages écrits en anglais, en italien et dans plusieurs autres langues. ..
  6. https://en.wikipedia.org/wiki/Cherchez_la_femme Cherchez la femme (French: [ʃɛʁʃe la fam]) is a French phrase which literally means 'look for the woman'. .. Il y a une femme dans toutes les affaires; aussitôt qu'on me fait un rapport, je dis: « Cherchez la femme ! »
  7. https://util.unicode.org/UnicodeJsps/character.jsp?a=2F25 2F25 KANGXI RADICAL WOMAN Han Script id: allowed confuse: 女 , 女
  8. https://util.unicode.org/UnicodeJsps/character.jsp?a=5973 女 5973 CJK UNIFIED IDEOGRAPH-5973 Han Script id: restricted confuse: 女 , ⼥
  9. https://util.unicode.org/UnicodeJsps/character.jsp?a=F981 F981 CJK COMPATIBILITY IDEOGRAPH-F981 Han Script id: allowed confuse: 女 , ⼥

Keywords

Chinese, Confusion, Female, Japanese, Kanji, KanjiLiberal, KanjiRadical, Onna, Utf8, Utf8table, UtfH, Woman

X2F25, X5973, XF981, ,