XF981

From TORI
Revision as of 21:40, 25 May 2021 by T (talk | contribs) (Created page with "XF981 is unicode character number 63873, (& # 6 3 8 7 3;). The three characters ,, look very similar. In addition, the M...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

XF981 is unicode character number 63873, (& # 6 3 8 7 3;).

The three characters ,, look very similar. In addition, the Mediawiki engine, at least with the default setting, confuse the last two. For this reason, articles X2F25, X5973, XF981 are loaded; names of these articles are easy to distinguish both by Humans and by the software.

OnnaDeaw.png
Drawing of XF981 [1]

Encoding of

In Utf8 encoding, Character XF981 () is encoded with 3 bytes. These bytes are:
xEF xA6 x81 in the hexadecimal representation and
239 166 129 in the decimal representation.

In HTML, character can be generated as follows:
& # X F 9 8 1 ;
However, for the activation, the spaces should be removed from the formula above.

Phonetic

In Japanese, XF981 () can be pronounced as おんな (onna).

Semantic

OnnaAtField.jpg

or or

Both in Chinese and Japanese languages, character XF981 has meaning "woman", Human female.

Synonyms: ,,

Characters
X2F25 ( , & # 1 2 0 6 9 ;)
X5973 ( , & # 2 2 8 9 9 ;)
XF981 ( , & # 6 3 8 7 3 ;)
are synonyms, and they are easy to confuse. [2]..

Similar confusion takes place with Cyrillic characters; for example,
A (& # 0 0 6 5 ;) and
А (& # 1 0 4 0 ;)
at some graphical interfaces look very similar.

The confusion of ,, can be shown with the program below:

<?php 
function unichr($dec) {
  if ($dec < 128) {
    $utf = chr($dec);
  } else if ($dec < 2048) {
    $utf = chr(192 + (($dec - ($dec % 64)) / 64));
    $utf .= chr(128 + ($dec % 64));
  } else {
    $utf = chr(224 + (($dec - ($dec % 4096)) / 4096));
    $utf .= chr(128 + ((($dec % 4096) - ($dec % 64)) / 64));
    $utf .= chr(128 + ($dec % 64));
  }
  return $utf;
} 

function mb_str_split($str) {
   // split multibyte string in characters
   // at all positions except the start: ^
   // and the end: $
   $pattern = '/(?<!^)(?!$)/u';
   return preg_split($pattern,$str);
}

function uniord($a) 
{
  $M=strlen($a);
  $p=ord($a[0]);                    if($M==1) return $p;
  $p-=194;  $p*=64; $p+=ord($a[1]); if($M==2) return $p;
  $p-=2050; $p*=64; $p+=ord($a[2]);           return $p;
}

$a=unichr(0x2f25);
echo "$a\n";
$a.=unichr(0x5973);
echo "$a\n";
$a.=unichr(0xF981);
echo "$a\n";

//$a='⼤,大;⼩,小'; # two pairs of different unicode characters separated with "," and ";"
//$a='⼥,女;AАᎪᗅA'; # different unicode characters separated with "," and ";"
//$a='⼥,女'; # pair of different unicode characters separated with coma

$N=strlen($a);
echo "The array has $N bytes; here is its splitting:\n";

for($n=0;$n<$N;$n++)
{
printf("%02x ",ord($a[$n]) );
}
echo "\n";

$b = mb_str_split($a);

var_dump($b);
$M=count($b);

#mb_internal_encoding("UTF-8");

for($m=0;$m<$M;$m++)
{
printf("\n");
$c=$b[$m];
$u=uniord($c);
printf("Unicode character number %05d id est, x%04X\n",$u,$u);
$d=strlen($c);
echo "Picture: $c uses $d bytes. These bytes are:\n";
for($n=0;$n<$d;$n++) printf("x%2X ",ord($c[$n]));
printf("in the hexadecimal representation and\n");
for($n=0;$n<$d;$n++) printf("%3d ",ord($c[$n]));
printf("in the decimal representation\n");
}
?>

This program uses portable PHP functions unichr.t, mb_str_split.t and uniord.t. The output is

⼥
⼥女
⼥女女
The array has 9 bytes; here is its splitting:
e2 bc a5 e5 a5 b3 ef a6 81 
array(3) {
  [0]=>
  string(3) "⼥"
  [1]=>
  string(3) "女"
  [2]=>
  string(3) "女"
}

Unicode character number 12069 id est, x2F25
Picture: ⼥ uses 3 bytes. These bytes are:
xE2 xBC xA5 in the hexadecimal representation and
226 188 165 in the decimal representation

Unicode character number 22899 id est, x5973
Picture: 女 uses 3 bytes. These bytes are:
xE5 xA5 xB3 in the hexadecimal representation and
229 165 179 in the decimal representation

Unicode character number 63873 id est, xF981
Picture: 女 uses 3 bytes. These bytes are:
xEF xA6 x81 in the hexadecimal representation and
239 166 129 in the decimal representation

X2F25, X5973, XF981

Not only Humans, but also the default mediawiki software confuse X5973 and XF981, redirecting from one to another. The same refers to various text editors, they confuse these characters. Such a case can be expressed with sentence: "If something is wrong, Cherchez la " [3][4][5].

Characters
X2F25, ( , & # x 2 f 2 5 ;) [6],
X5973 ( , & # x 5 9 7 3 ;) [7] and
XF981 ( , & # x F 9 8 1 ;) [8]
are easy to confuse. All the three appear with pictures similar to . For this reason, the content is repeated at articles X2F25, X5973, XF981. These names are easy to distinguish both by Humans and by the software.

Confusions

In order to avoid confusion, the hexadecimal name XF981 should be used instead of in all places, where the the difference between X2F25 (), X5973 (), XF981 () may be important. (For example, the link to article about X5973 may send to article about XF981, or vice versa, or the ink may just fail, as some software confuse X5973 and XF981; the article can be just lost, replaced to "redirect" to itself. In order to avoid such lost (replacement to fake "redirecr", descriptions of the 3 versions of kanji Onna are repeated in articles X2F25, X5973, XF981.

References

  1. https://jisho.org/search/%23kanji%20%E5%A5%B3 https://jisho.org/search/%23kanji_女 woman, female Kun: おんな、 め On: ジョ、 ニョ、 ニョウ Jōyō kanji, taught in grade 1 JLPT level N5 151 of 2500 most used kanji in newspapers On reading compounds 女 【ジョ】 woman, girl, daughter, Chinese "Girl" constellation (one of the 28 mansions) 女王 【ジョオウ】 queen, female champion 処女 【ショジョ】 virgin, maiden 一女 【イチジョ】 one daughter, eldest daughter, first-born daughter 女王 【ジョオウ】 queen, female champion 女房 【ニョウボウ】 wife (esp. one's own wife), court lady, female court attache, woman who served at the imperial palace, woman (esp. as a love interest) 老若男女 【ロウニャクナンニョ】 men and women of all ages 天女 【テンニョ】 heavenly nymph, celestial maiden, beautiful and kind woman 女房 【ニョウボウ】 wife (esp. one's own wife), court lady, female court attache, woman who served at the imperial palace, woman (esp. as a love interest) 女官 【ジョカン】 court lady, lady-in-waiting Kun reading compounds 女 【おんな】 female, woman, female sex, female lover, girlfriend, mistress, (someone's) woman 女形 【おんながた】 onnagata, male actor in female kabuki roles, female partner (in a relationship) 醜女 【しゅうじょ】 homely woman, plain-looking woman, female demon 囲い女 【かこいおんな】 mistress 雌 【め】 female, smaller (of the two), weaker, woman, wife 女神 【めがみ】 goddess, female deity 早乙女 【さおとめ】 young female rice planter, young girl 醜女 【しゅうじょ】 homely woman, plain-looking woman, female demon
  2. https://en.wiktionary.org/wiki/%E5%A5%B3 https://en.wiktionary.org/wiki/女
  3. https://archive.org/details/lesmohicansdepa02dumagoog/page/n243/mode/2up?view=theater Alexandre Dumas. Les Mohicans de Paris. 1874, p.332. .. L'huissier disparut par une porte, et revint presque aua* sitôt. — Dans deux minutes, M. Jackal est à vous. Effectivement, un instant après, la porté se rouvrit, et, avant que Ton vit encore personne, on entendit une voix qui criait : — Cherchez la femme, pardieu! cherchez la femme! Puis parut Thomme dont on venait d'entendre la voix. Essayons de tracer le portrait de M. Jackal...
  4. https://fr.wikipedia.org/wiki/Cherchez_la_femme « Cherchez la femme » est une expression connue sous sa forme française dans des ouvrages écrits en anglais, en italien et dans plusieurs autres langues. ..
  5. https://en.wikipedia.org/wiki/Cherchez_la_femme Cherchez la femme (French: [ʃɛʁʃe la fam]) is a French phrase which literally means 'look for the woman'. .. Il y a une femme dans toutes les affaires; aussitôt qu'on me fait un rapport, je dis: « Cherchez la femme ! »
  6. https://util.unicode.org/UnicodeJsps/character.jsp?a=2F25 2F25 KANGXI RADICAL WOMAN Han Script id: allowed confuse: 女 , 女
  7. https://util.unicode.org/UnicodeJsps/character.jsp?a=5973 女 5973 CJK UNIFIED IDEOGRAPH-5973 Han Script id: restricted confuse: 女 , ⼥
  8. https://util.unicode.org/UnicodeJsps/character.jsp?a=F981 F981 CJK COMPATIBILITY IDEOGRAPH-F981 Han Script id: allowed confuse: 女 , ⼥

Keywords

Chinese, Confusion, Female, Japanese, Onna, Utf8, Utf8table, UtfH, Woman

X5973, XF981, XF981, ,