Difference between revisions of "Onna"
| (2 intermediate revisions by the same user not shown) | |||
| Line 1: | Line 1: | ||
{{top}} |
{{top}} |
||
| + | <div style="float:right;margin:-70px 0px 0px 0px;"> |
||
| − | Thank you. I made some corrections you suggest. |
||
| + | {{pic|OnnaDeaw.png|300px}}<br> |
||
| − | if you see more mistakes, let me know. |
||
| + | <center>Drawing of [[X2F25]], [[X5973]] or [[XF981]] <ref name="jisho"> |
||
| + | https://jisho.org/search/%23kanji%20%E5%A5%B3 |
||
| + | https://jisho.org/search/%23kanji_女 |
||
| + | woman, female |
||
| + | Kun: |
||
| + | [[おんな]]、 め |
||
| + | On: |
||
| + | ジョ、 ニョ、 ニョウ |
||
| + | Jōyō kanji, taught in grade 1 |
||
| + | JLPT level N5 |
||
| + | 151 of 2500 most used kanji in newspapers |
||
| + | On reading compounds |
||
| + | [[女]] 【ジョ】 woman, girl, daughter, Chinese "Girl" constellation (one of the 28 mansions) |
||
| + | 女王 【ジョオウ】 queen, female champion |
||
| + | 処女 【ショジョ】 virgin, maiden |
||
| + | 一女 【イチジョ】 one daughter, eldest daughter, first-born daughter |
||
| + | 女王 【ジョオウ】 queen, female champion |
||
| + | 女房 【ニョウボウ】 wife (esp. one's own wife), court lady, female court attache, woman who served at the imperial palace, woman (esp. as a love interest) |
||
| + | 老若男女 【ロウニャクナンニョ】 men and women of all ages |
||
| + | 天女 【テンニョ】 heavenly nymph, celestial maiden, beautiful and kind woman |
||
| + | 女房 【ニョウボウ】 wife (esp. one's own wife), court lady, female court attache, woman who served at the imperial palace, woman (esp. as a love interest) |
||
| + | 女官 【ジョカン】 court lady, lady-in-waiting |
||
| + | Kun reading compounds |
||
| + | 女 【おんな】 female, woman, female sex, female lover, girlfriend, mistress, (someone's) woman |
||
| + | 女形 【おんながた】 onnagata, male actor in female kabuki roles, female partner (in a relationship) |
||
| + | 醜女 【しゅうじょ】 homely woman, plain-looking woman, female demon |
||
| + | 囲い女 【かこいおんな】 mistress |
||
| + | 雌 【め】 female, smaller (of the two), weaker, woman, wife |
||
| + | 女神 【めがみ】 goddess, female deity |
||
| + | 早乙女 【さおとめ】 young female rice planter, young girl |
||
| + | 醜女 【しゅうじょ】 homely woman, plain-looking woman, female demon |
||
| + | </ref></center> |
||
| + | </div> |
||
| + | [[Onna]] is a set of three visually identical Unicode characters representing the kanji [[女]] (“woman”) that originate from different parts of the Unicode standard. |
||
| + | Term [[Onna]] ([[おんな]]) or "onna" may refer to one of the following three [[Unicode]] characters:<br> |
||
| − | Following your sample, I have updated article |
||
| + | 12069 ([[X2F25]], [[⼥]])<ref name="x2f25"/>, [[KanjiRadical]] <br> |
||
| − | https://mizugadro.mydns.jp/t/index.php/Onna |
||
| + | 22899 ([[X5973]], [[女]])<ref name="x5973"/>, [[KanjiLiberal]] <br> |
||
| + | 63873 ([[XF981]], [[女]])<ref name="xf981"/>, [[KanjiConfudal]] |
||
| + | [[Onna]] may refer also to the set of these 3 characters. |
||
| − | Can you criticize it? |
||
| − | I copypast the source below. |
||
| + | Often, these characters are pronounced as "[[Onna]]" and refer |
||
| − | =r= |
||
| + | to a [[woman]], Human female |
||
| − | Your **[[Onna]]** article is interesting and mostly technically correct. The structure is good and the examples help. I will give **constructive criticism in four categories**: |
||
| + | <ref name="jisho"/>, see picture below. |
||
| + | {{fig|HumanFemale276.jpg|96|-12|0|-20|[[Onna]], [[おんな]] <ref>https://commons.wikimedia.org/wiki/File:Human_female.jpg English: Naked [[female]] human body. Русский: Обнаженная женщина. English: Model name: (preferred not to be stated) At time of photograph: Age: 40 Height: 166 cm Weight: 47 kg BMI: 17.1 Ornaments: Ear piercing, ring on left ring finger (not in retouched images), nail polish on toe nails. There is some tilting of the upper trunk towards the left of the body, which may be positional or anatomical. Date 29 September 2011 Source Own work Author Taken at City Studios in Stockholm (www.stockholmsfotografen.se), September 29, 2011, with assistance from KYO (The organisation of life models) in Stockholm. ..</ref>}} |
||
| + | [[Character]]s of set [[Onna]] cause [[confusion]]s. |
||
| − | 1. **technical accuracy (Unicode facts)** |
||
| + | ==Confusion== |
||
| + | As of 2026, no unique Unicode number is assigned to the glyph [[⼥]]. |
||
| + | Unicode encodes abstract characters rather than graphical glyphs. |
||
| + | Therefore different characters may share identical visual representations in many fonts. This causes a confusion: different characters look the same. |
||
| + | Both teachers of [[Japanese]] and Japanese textbooks ignore this [[confusion]]. |
||
| − | 2. **language/grammar improvements** |
||
| + | A newcomer encounters the problem and first attributes it to his or her own mistake. |
||
| − | 3. **logic and tone** |
||
| + | Then novice notices that the error appears repeatedly, again and again, and, en fin, recognizes, that it is not his/her mistake, but the [[bug]] of the software. <!-- |
||
| + | the dirty trick on side of the font designers, committed in order to [[trump]] the newcomers, and both the teachers and the manuals make this difficulty harder, keeping it in secret from the pupils.!--> |
||
| + | The problem is related not only to learning [[Japanese]]. |
||
| − | 4. **structural suggestions** |
||
| + | Even specialists, even native Japanese speakers looking at characters |
||
| − | I will be direct so the article becomes stronger. |
||
| + | [[⼥]] |
||
| + | [[女]], |
||
| + | [[女]] are unlikely to guess:<br> |
||
| + | Which of them is character [[X2F25]] <ref name="x2f25"/>?<br> |
||
| + | Which of them is character [[X5973]] <ref name="x5973"/>?<br> |
||
| + | Which of them is character [[XF981]] <ref name="xf981"/>? |
||
| + | Term [[Onna]] is synonym of construction |
||
| − | == 1. Important technical clarification == |
||
| + | «[[⼥]] or [[女]] or [[女]]" |
||
| + | for cases, when the only view of the Kanji is available, and it is difficult to identify it. |
||
| + | Not only Humans, but also some software confuse the characters with similar or the same [[glyph]](s). |
||
| − | Your core statement is almost correct, but needs **one correction**. |
||
| + | The default [[Mediawiki]] software replaces character [[XF981]] to character [[X5973]] without any warning. <br> |
||
| − | You wrote that Unicode has **three characters for “onna”**: |
||
| + | This causes problems at automatic processing of data: in some cases the two objects are the same, and sometimes they are not. (Similar confusion takes place at the careless use of term «[[equality]]» applied to triangles in [[geometry]]). |
||
| − | * X2F25 ⼥ |
||
| − | * X5973 女 |
||
| − | * XF981 女 |
||
| + | In a text, that assumes any kind of citing, for example, copy-pasting |
||
| − | This is true **as code points**, but their roles are different. |
||
| + | into the address bar of a brouser or any search engine, |
||
| − | <pre> |
||
| + | the characters corresponding to the sound "[[onna]]" |
||
| − | | Code | Character | Role | |
||
| + | should be specified as |
||
| − | | ----- | --------- | ----------------------------- | |
||
| + | [[X2F25]], |
||
| − | | X5973 | 女 | real ideograph (normal kanji) | |
||
| + | [[X5973]], |
||
| − | | X2F25 | ⼥ | Kangxi radical symbol | |
||
| + | [[XF981]] rather than |
||
| − | | XF981 | 女 | compatibility ideograph | |
||
| + | [[⼥]], |
||
| − | </pre> |
||
| + | [[女]], |
||
| − | So the **primary character** is: |
||
| + | [[女]]: the software confuses the last two characters. |
||
| + | At least since year 2021, this confusion is recognized and described |
||
| − | X5973 女 |
||
| + | <ref name="x2f25"> |
||
| + | https://util.unicode.org/UnicodeJsps/character.jsp?a=2F25 [[⼥]] 2F25 KANGXI RADICAL WOMAN Han Script id: allowed confuse: 女 , 女 |
||
| + | </ref><ref name="x5973"> |
||
| + | https://util.unicode.org/UnicodeJsps/character.jsp?a=5973 [[女]] 5973 CJK UNIFIED IDEOGRAPH-5973 Han Script id: restricted confuse: 女 , ⼥ |
||
| + | </ref><ref name="xf981"> |
||
| + | https://util.unicode.org/UnicodeJsps/character.jsp?a=F981 女 F981 CJK COMPATIBILITY IDEOGRAPH-F981 Han Script id: allowed confuse: [[女]] , [[⼥]] |
||
| + | </ref>. |
||
| + | [[Confusion]]s related to the apparent bugs with graphical representations of the [[Unicode]] characters are described in articles |
||
| − | The others exist for **compatibility with historical encodings**. |
||
| + | «[[Chikara]]», |
||
| + | «[[Miru]]», |
||
| + | «[[Onna]]», |
||
| + | «[[Sakana]]», |
||
| + | «[[StickPi]]», |
||
| + | «[[TsukiGatsu]]». |
||
| + | ==Unicode== |
||
| − | Unicode therefore does **not consider them separate characters in meaning**, only in encoding history. |
||
| + | At least three Unicode characters are qualified as «[[Onna]]».<br> |
||
| + | These characters are<!-- |
||
| + | [[X2F25]], |
||
| + | [[X5973]], |
||
| + | [[XF981]]: |
||
| + | !--> |
||
| + | <big><big>[[⼥]]</big></big> [[X2F25]], |
||
| + | <big><big>[[女]]</big></big> [[X5973]], |
||
| + | <big><big>[[女]]</big></big> [[XF981]]. |
||
| + | The [[Utf8]] encoding can be revealed with the [[PHP]] program [[onna.t]];<br> |
||
| − | You may want to add one sentence like: |
||
| + | it is copypasted below. File [[uni.t]] also may have need to be loaded. |
||
| + | <pre> |
||
| + | <?php |
||
| + | include "uni.t"; |
||
| + | $a=unichr(0x2f25); |
||
| + | $a.=unichr(0x5973); |
||
| + | $a.=unichr(0xF981); |
||
| + | echo "$a\n"; |
||
| + | $N=strlen($a); |
||
| + | echo "The array has $N bytes; here is its splitting:\n"; |
||
| + | for($n=0;$n<$N;$n++){ printf("%02x ",ord($a[$n]) ); } |
||
| + | echo "\n"; |
||
| + | $b = mb_str_split($a); |
||
| − | > Character [[X5973]] is the standard ideograph. |
||
| + | var_dump($b); |
||
| − | > Characters [[X2F25]] and [[XF981]] are compatibility forms used in radical lists or legacy encodings. |
||
| + | $M=count($b); |
||
| + | for($m=0;$m<$M;$m++) { |
||
| + | printf("\n"); |
||
| + | $c=$b[$m]; |
||
| + | $u=uniord($c); |
||
| + | printf("Unicode character number %05d id est, x%04X\n",$u,$u); |
||
| + | $d=strlen($c); |
||
| + | echo "Picture: $c uses $d bytes. These bytes are:\n"; |
||
| + | for($n=0;$n<$d;$n++) printf("x%2X ",ord($c[$n])); |
||
| + | printf("in the hexadecimal representation and\n"); |
||
| + | for($n=0;$n<$d;$n++) printf("%3d ",ord($c[$n])); |
||
| + | printf("in the decimal representation\n"); |
||
| + | } |
||
| + | ?> |
||
| + | </pre> |
||
| + | File [[uni.t]] also would be loaded for the execution; then, command |
||
| + | php onna.t |
||
| − | That will make the article **more academically accurate**. |
||
| + | produces the output below: |
||
| − | == 2. Grammar corrections == |
||
| + | <poem> |
||
| + | ⼥女女 |
||
| + | The array has 9 bytes; here is its splitting: |
||
| + | e2 bc a5 e5 a5 b3 ef a6 81 |
||
| + | array(3) { |
||
| + | [0]=> |
||
| + | string(3) "⼥" |
||
| + | [1]=> |
||
| + | string(3) "女" |
||
| + | [2]=> |
||
| + | string(3) "女" |
||
| + | } |
||
| + | Unicode character number 12069 id est, x2F25 |
||
| − | Several sentences can be improved slightly. |
||
| + | Picture: ⼥ uses 3 bytes. These bytes are: |
||
| + | xE2 xBC xA5 in the hexadecimal representation and |
||
| + | 226 188 165 in the decimal representation |
||
| + | Unicode character number 22899 id est, x5973 |
||
| − | ### A |
||
| + | Picture: 女 uses 3 bytes. These bytes are: |
||
| + | xE5 xA5 xB3 in the hexadecimal representation and |
||
| + | 229 165 179 in the decimal representation |
||
| + | Unicode character number 63873 id est, xF981 |
||
| − | You wrote: |
||
| + | Picture: 女 uses 3 bytes. These bytes are: |
||
| − | |||
| + | xEF xA6 x81 in the hexadecimal representation and |
||
| − | > To year 2026, yet, no unique Unicode number is assigned to glyph |
||
| + | 239 166 129 in the decimal representation |
||
| − | |||
| − | Better: |
||
| − | |||
| − | > As of 2026, no unique Unicode number is assigned to the glyph |
||
| − | |||
| − | --- |
||
| − | |||
| − | ### B |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > The newcomer meets the problem and first attributes it to his of her own mistake. |
||
| − | |||
| − | Better: |
||
| − | |||
| − | > A newcomer encounters the problem and first attributes it to his or her own mistake. |
||
| − | |||
| − | --- |
||
| − | |||
| − | ### C |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > Even a specialists |
||
| − | |||
| − | Correct form: |
||
| − | |||
| − | > Even a specialist |
||
| − | |||
| − | or |
||
| − | |||
| − | > Even specialists |
||
| − | |||
| − | --- |
||
| − | |||
| − | ### D |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > the characters corresponding to sound onna |
||
| − | |||
| − | Better: |
||
| − | |||
| − | > the characters corresponding to the sound *onna* |
||
| − | |||
| − | --- |
||
| − | |||
| − | ### E |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > the software confuse the last two characters |
||
| − | |||
| − | Better: |
||
| − | |||
| − | > the software confuses the last two characters |
||
| − | |||
| − | == 3. Tone (important) == |
||
| − | |||
| − | Your article occasionally becomes **very emotional**. For example: |
||
| − | |||
| − | > dirty trick on side of the font designers |
||
| − | |||
| − | > sabotage, silent conspiracy |
||
| − | |||
| − | For a **technical article**, this weakens credibility. |
||
| − | |||
| − | A neutral version would be stronger: |
||
| − | |||
| − | Instead of: |
||
| − | |||
| − | > dirty trick on side of the font designers |
||
| − | |||
| − | Write: |
||
| − | |||
| − | > a consequence of historical encoding decisions in Unicode and legacy character sets. |
||
| − | |||
| − | Instead of: |
||
| − | |||
| − | > sabotage |
||
| − | |||
| − | Write: |
||
| − | |||
| − | > an unfortunate compatibility artifact. |
||
| − | |||
| − | Readers will trust the article **much more**. |
||
| − | |||
| − | == 4. One conceptual point == |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > no unique Unicode number is assigned to glyph |
||
| − | |||
| − | Strictly speaking this is **normal in Unicode**. |
||
| − | |||
| − | Unicode encodes **characters**, not **glyphs**. |
||
| − | |||
| − | So multiple code points may intentionally share one glyph. |
||
| − | |||
| − | You might want to add a short explanation: |
||
| − | |||
| − | > Unicode encodes abstract characters rather than graphical glyphs. |
||
| − | > Therefore different characters may share identical visual representations in many fonts. |
||
| − | |||
| − | That will clarify the situation. |
||
| − | |||
| − | == 5. The UTF-8 section == |
||
| − | |||
| − | Your **PHP demonstration is excellent**. |
||
| − | It clearly shows that the characters have different byte sequences. |
||
| − | |||
| − | Example: |
||
| − | <poem> |
||
| − | ⼥ e2 bc a5 |
||
| − | 女 e5 a5 b3 |
||
| − | 女 ef a6 81 |
||
</poem> |
</poem> |
||
| + | The similar analysis can be performed with more universal dumping routine [[du.t]]; |
||
| − | This is actually a **very good technical illustration**. |
||
| + | here is the example of the use: |
||
| + | php du.t "⼥女女" |
||
| − | == 6. The censorship section == |
||
| + | In such a way, |
||
| − | This section is interesting socially, but it is **not strongly connected to the Unicode problem**. |
||
| + | a consequence of historical encoding decisions in Unicode and legacy character sets, together with ignorance of teachers and authors of manuals force the pupils to learn [[Unicode]], UTF8 and the programming in order to distinguish characters used in [[Japanese]] language. |
||
| + | ==[[Uniglif]] and [[Tarja]]== |
||
| − | You might consider either: |
||
| − | * shortening it, or |
||
| − | * creating a separate article about **Vestism and Onna**. |
||
| + | In the sci-fi utopia «[[Tartaria]]», the advanced font [[Uniglif]] is mentioned. |
||
| − | Otherwise the article changes from **typography → sociology** abruptly. |
||
| + | Each [[glyph]] in that font is assigned the unique [[Unicode]] number, providing the |
||
| + | bijective relation between the set of [[glyph]]s and the set of [[character]]s, at least for characters with number not exceeding XFFFF ([[ascii]], [[TwoByteCharacter]]s, [[ThreeByteCharacter]]s). With font [[Uniglif]], no confusions similar to that above appear. |
||
| + | In the real life, while no analogy of [[Uniglif]] is available, the technical language [[Tarja]] can be used to avoid confusions. |
||
| − | == 7. Small structural suggestion == |
||
| + | This is [[japanese]]-based technical [[slang]] that avoids ambiguous [[glyphs]], |
||
| + | avoid characters that are not yet supplied with default exclusive [[glyph]]s. |
||
| + | At the translation form [[Japanese]] to [[Tarja]], for characters |
||
| − | At the beginning you might add a **one-sentence definition**, similar to what you did in TsukiGatsu. |
||
| + | [[⼥]], |
||
| + | [[女]] and |
||
| + | [[女]], |
||
| + | their explicit numbers |
||
| + | [[X2F25]], |
||
| + | [[X5973]] and |
||
| + | [[XF981]] can be used. |
||
| + | Alternatively, the transliterations «[[おんな]]» or «[[Onna]]» of «[[onna]]» can be used. |
||
| − | Example: |
||
| + | In addition, words borrowed from other languages («[[Female]]», «[[Mujer]]», ..) |
||
| − | > **Onna** is a set of three visually identical Unicode characters representing the kanji 女 (“woman”) that originate from different parts of the Unicode standard. |
||
| + | can be used in [[Tarja]] when it causes no confusion. |
||
| + | ==Examples== |
||
| − | This helps readers immediately understand the page. |
||
| + | Dictionary [[Jisho]] suggest examples with sound [[onna]] |
||
| + | <ref> |
||
| + | https://jisho.org/search/%E5%A5%B3%20%E3%81%8A%E3%82%93%E3%81%AA%20%23words?page=2 |
||
| + | 女 おんな #words |
||
| + | .. |
||
| + | Words — 107 found |
||
| + | おんなおや 女親 |
||
| + | Links |
||
| + | Noun |
||
| + | 1. mother; female parent |
||
| + | Details ▸ |
||
| + | おんなざか 女坂 |
||
| + | Links |
||
| + | Noun |
||
| + | 1. the easier of two slopes |
||
| + | Details ▸ |
||
| + | おんなきょうだい 女兄弟 |
||
| + | Links |
||
| + | Noun |
||
| + | 1. sisters; female siblings |
||
| + | Other forms |
||
| + | 女姉妹 【おんなきょうだい】 |
||
| + | Details ▸ |
||
| + | おんなかぶき 女歌舞伎 |
||
| + | Links |
||
| + | Noun |
||
| + | 1. girls' kabuki |
||
| + | Details ▸ |
||
| + | .. |
||
| + | </ref>: |
||
| + | おんなざか 女坂 |
||
| − | == 8. Overall evaluation == |
||
| + | mother; female parent |
||
| + | おんなきょうだい 女兄弟 |
||
| − | Strengths: |
||
| + | the easier of two slopes |
||
| + | 女姉妹 【おんなきょうだい】 |
||
| − | ✔ clear concept |
||
| + | sisters; female siblings |
||
| − | ✔ good technical demonstration |
||
| − | ✔ interesting taxonomy of confusion sets |
||
| − | ✔ good references |
||
| + | おんなかぶき 女歌舞伎 |
||
| − | Things to improve: |
||
| + | girls' kabuki |
||
| − | • slightly neutralize tone |
||
| − | • add Unicode explanation about characters vs glyphs |
||
| − | • small grammar fixes |
||
| − | • possibly shorten the sociological section |
||
| + | ==[[Censorship]] and [[Vestism]]== |
||
| − | Your project is gradually becoming something like a **“taxonomy of Unicode glyph confusions.”** |
||
| + | <div class="thumb tright" style="float:right; background-color:#fff; margin:-20px 0px -4px 0px"> |
||
| − | That is actually a **very original idea**. |
||
| + | {{picx|Svetlana2.091.jpg|144px}} |
||
| + | {{picx|Svetlana4.101a.jpg|144px}} |
||
| + | {{picx|Svetlana6.110.jpg|144px}} |
||
| + | {{picx|Svetlana8.120a.jpg|144px}} |
||
| + | {{picx|Svetlana9.130b.jpg|144px}} |
||
| + | </div> |
||
| + | Objects and subjects, denoted with term [[onna]], often become targets of aggression and/or [[censorship]].<br> |
||
| + | [[vestism|Vestists]] insist, that the body should be hidden, and punish those who do not obey <ref> |
||
| + | https://edition.cnn.com/2023/09/21/middleeast/iran-hijab-law-parliament-jail-intl-hnk |
||
| + | Iranian women face 10 years in jail for inappropriate dress after ‘hijab bill’ approved |
||
| + | By [[Tara Subramaniam]], [[Adam Pourahmadi]] and [[Mostafa Salem]], [[CNN]]. |
||
| + | Published 12:34 PM EDT, Thu September 21, 2023</ref>. |
||
| + | Such a practice is described also in the sci-fi novel «[[Meganesia.Deportation]]». |
||
| + | The 5 pictures at right are designed to measure the hatred/tolerance of a [[religion]]. <br> |
||
| − | If you want, I can also show you **one extremely strange Unicode case related to 女** where **four different code points render exactly the same glyph in many fonts**. |
||
| + | Counting, how many of shown dressing styles are allowed by a [[religion]], gives the qualification of its tolerance with respect to [[onna]] in the 5 grade scale. |
||
| + | ==Reuse the [[glyph]]== |
||
| − | It is even more confusing than the three you already documented. |
||
| + | [[女]] appears as a component in various [[glyph]]s. |
||
| + | Few examples are suggested in this section. |
||
| + | [[36B2]] [[㚲]] <ref> |
||
| − | =r= |
||
| + | https://util.unicode.org/UnicodeJsps/character.jsp?a=36B2 |
||
| − | Your revised article is **much stronger than the previous version**. The structure is clearer, the technical explanation about Unicode vs glyphs is good, and the UTF-8 example is convincing. I will give **precise criticism** again, but now it is mostly **fine-tuning**. |
||
| + | [[㚲]] |
||
| + | 36B2 |
||
| + | [[CJK]] UNIFIED IDEOGRAPH-36B2 |
||
| + | Han Script |
||
| + | confuse: none .. |
||
| + | (kDefinition) small and weak, used in girl's name, a woman's feature; lady's face .. |
||
| + | (kJapanese) セン|ショウ|テン .. |
||
| + | </ref> [[セン]], [[ショウ]], [[テン]], yan, ten, small, weak |
||
| + | [[597B]] [[奻]] <ref> |
||
| − | I will separate it into **(1) technical correctness, (2) language corrections, (3) style/logic improvements, (4) small factual notes**. |
||
| + | [[奻]] |
||
| − | |||
| + | 597B |
||
| − | ==1. Technical accuracy (very good overall)== |
||
| + | CJK UNIFIED IDEOGRAPH-597B |
||
| − | |||
| + | Han Script |
||
| − | Your definition at the beginning is now excellent: |
||
| + | confuse: none .. |
||
| − | |||
| + | (kJapanese) ダン|ナン .. |
||
| − | > Onna is a set of three visually identical Unicode characters representing the kanji 女 (“woman”) that originate from different parts of the Unicode standard. |
||
| + | </ref> [[ダン]], [[ナン]], dan, nan, quarrel, dispute |
||
| − | |||
| − | This is **accurate and clear**. |
||
| − | |||
| − | Your table: |
||
| − | * X2F25 ⼥ — Kangxi radical |
||
| − | * X5973 女 — unified ideograph |
||
| − | * XF981 女 — compatibility ideograph |
||
| − | |||
| − | is correct. |
||
| − | |||
| − | Your UTF-8 demonstration is also technically correct: |
||
| − | <pre> |
||
| − | | character | UTF-8 | |
||
| − | | --------- | -------- | |
||
| − | | ⼥ | E2 BC A5 | |
||
| − | | 女 | E5 A5 B3 | |
||
| − | | 女 | EF A6 81 | |
||
| − | </pre> |
||
| − | This is a **very good example** showing the difference between codepoints. |
||
| − | |||
| − | == 2. Language corrections== |
||
| − | |||
| − | A few sentences still sound slightly unnatural in English. |
||
| − | |||
| − | ### A |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > both the teachers of Japanese and the manuals of Japanese just ignore the confusion. |
||
| − | |||
| − | Better: |
||
| − | |||
| − | > both teachers of Japanese and Japanese textbooks usually ignore this confusion. |
||
| − | |||
| − | --- |
||
| − | |||
| − | ### B |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > the dirty trick on side of the font designers |
||
| − | |||
| − | Better neutral wording: |
||
| − | |||
| − | > an unfortunate consequence of font design and historical encoding decisions. |
||
| − | |||
| − | --- |
||
| − | |||
| − | ### C |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > the novice sees, that the error reproduces |
||
| − | |||
| − | Better: |
||
| − | |||
| − | > the novice notices that the error appears repeatedly |
||
| − | |||
| − | --- |
||
| − | |||
| − | ### D |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > automatic treat of data |
||
| − | |||
| − | Better: |
||
| − | |||
| − | > automatic processing of data |
||
| − | |||
| − | --- |
||
| − | |||
| − | ### E |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > copypasting to frame of a brouser |
||
| − | |||
| − | Two corrections: |
||
| − | |||
| − | > copy-pasting into the address bar of a browser |
||
| − | |||
| − | --- |
||
| − | |||
| − | ### F |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > ChatGPT indicates, that there was not bad will |
||
| − | |||
| − | Better: |
||
| − | |||
| − | > ChatGPT indicates that there was no bad intent |
||
| − | |||
| − | == 3. Tone (still slightly emotional)== |
||
| − | |||
| − | You improved it a lot, but a few phrases still sound **too accusatory**. |
||
| − | |||
| − | For example: |
||
| − | |||
| − | > ignorance of teachers and authors of manuals |
||
| − | |||
| − | That may provoke disagreement. |
||
| − | |||
| − | Safer wording: |
||
| − | |||
| − | > the issue is rarely discussed in teaching materials. |
||
| − | |||
| − | == 4. Minor factual note == |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | > At least three Unicode characters are related to sound Onna. |
||
| − | |||
| − | Strictly speaking: |
||
| − | |||
| − | Unicode characters are **not related to sounds**; they represent **characters**, not pronunciations. |
||
| − | |||
| − | More accurate wording: |
||
| − | |||
| − | > At least three Unicode characters correspond to the kanji 女, which can be pronounced *onna*. |
||
| − | |||
| − | == 5. Small typo == |
||
| − | |||
| − | You wrote: |
||
| − | |||
| − | ``` |
||
| − | ThrteeByteCharacter |
||
| − | ``` |
||
| − | |||
| − | should be |
||
| − | |||
| − | ``` |
||
| − | ThreeByteCharacter |
||
| − | ``` |
||
| − | |||
| − | == 6. Very interesting section: “Reuse the glyph” == |
||
| − | |||
| − | This is actually **one of the best parts** of the article. |
||
| − | |||
| − | Your examples: |
||
| − | |||
| − | * 妙 |
||
| − | * 妹 |
||
| − | * 姉 |
||
| − | * 好 |
||
| − | * 姦 |
||
| − | * 安 |
||
| − | |||
| − | show how 女 appears as a **component** in many characters. |
||
| − | |||
| − | That section demonstrates the **graphical importance of the glyph** well. |
||
| − | |||
| − | == 7. Historical section == |
||
| − | |||
| − | Your historical explanation is mostly correct, but one sentence should be softened: |
||
| − | |||
| − | > Unicode and many default fonts had been designed in century 20, while computation had been underdeveloped. |
||
| − | |||
| − | Better: |
||
| − | |||
| − | > Unicode was designed in the late 20th century when compatibility with existing character encodings was a major concern. |
||
| − | |||
| − | Because the goal was **compatibility**, not lack of computing power. |
||
| − | |||
| − | == 8. The censorship section == |
||
| − | |||
| − | It is interesting but **philosophical rather than technical**. |
||
| − | |||
| − | You might consider adding one sentence to connect it: |
||
| − | |||
| − | Example: |
||
| − | |||
| − | > Because the character 女 refers to women, discussions about this character sometimes intersect with broader cultural debates about gender and social norms. |
||
| − | |||
| − | That makes the transition less abrupt. |
||
| − | |||
| − | == 9. Overall evaluation == |
||
| − | |||
| − | Your **Unicode-confusion taxonomy project** is becoming clearer. |
||
| − | |||
| − | You now have several families: |
||
| − | <pre> |
||
| − | | page | confusion | |
||
| − | | ---------- | ------------- | |
||
| − | | TsukiGatsu | 月 radicals | |
||
| − | | Onna | 女 radicals | |
||
| − | | Sakana | fish radical | |
||
| − | | Chikara | power radical | |
||
| − | </pre> |
||
| − | This is actually a **very interesting catalog of Unicode visual equivalence sets**. |
||
| + | [[597D]] [[好]] <ref> |
||
| − | Very few resources document them systematically. |
||
| + | [[好]] |
||
| + | 597D |
||
| + | [[CJK]] UNIFIED IDEOGRAPH-597D |
||
| + | Han Script |
||
| + | confuse: none .. |
||
| + | (kDefinition) good, excellent, fine; well .. |
||
| + | (kJapanese) コウ|このむ|すく|よい|よし .. |
||
| + | </ref> [[コウ]], [[このむ]], [[すく]], [[よい]], [[よし]], yoi, good |
||
| + | [[5999]] [[妙]] <ref> |
||
| − | ==✅ **Overall assessment**== |
||
| + | https://util.unicode.org/UnicodeJsps/character.jsp?a=5999 |
||
| + | [[妙]] |
||
| + | 5999 |
||
| + | [[CJK]] UNIFIED IDEOGRAPH-5999 |
||
| + | Han Script |
||
| + | confuse: none .. |
||
| + | (kDefinition) mysterious, subtle; exquisite |
||
| + | (kJapanese) ミョウ|ビョウ|たえ |
||
| + | </ref> [[ミョウ]], [[ビョウ]], [[たえ]], miyou, mysterious, strange |
||
| + | [[59B9]] [[妹]] <ref> |
||
| − | Strengths: |
||
| + | https://util.unicode.org/UnicodeJsps/character.jsp?a=59B9 |
||
| − | * clear definition |
||
| + | [[妹]] |
||
| − | * correct Unicode analysis |
||
| + | 59B9 |
||
| − | * strong UTF-8 example |
||
| + | [[CJK]] UNIFIED IDEOGRAPH-59B9 |
||
| − | * good references |
||
| + | Han Script |
||
| − | * consistent wiki structure |
||
| + | confuse: none .. |
||
| + | (kDefinition) younger sister .. |
||
| + | (kJapanese) マイ|バイ|メ|いもうと|いも .. |
||
| + | </ref> |
||
| + | [[マイ]], [[バイ]], [[メ]], [[いもうと]], [[いも]], [[imouto]], younger sister |
||
| + | [[59C9]] [[姉]] |
||
| − | Remaining improvements: |
||
| + | <ref> |
||
| − | * slightly soften tone |
||
| + | https://util.unicode.org/UnicodeJsps/character.jsp?a=59C9 |
||
| − | * fix a few English phrases |
||
| + | [[姉]] |
||
| − | * small technical wording adjustments |
||
| + | 59C9 |
||
| + | [[CJK]] UNIFIED IDEOGRAPH-59C9 |
||
| + | Han Script |
||
| + | confuse: none .. |
||
| + | (kDefinition) elder sister .. |
||
| + | (kJapanese) シ|あね|ねえさん .. |
||
| + | </ref> シ, [[あね]], [[ねえさん]], vasan, elder sister |
||
| + | [[59E6]] [[姦]] <ref> |
||
| − | If you want, I can also show you something **extremely curious about 女 in Chinese character evolution**: |
||
| + | https://util.unicode.org/UnicodeJsps/character.jsp?a=59E6 |
||
| + | [[姦]] |
||
| + | 59E6 |
||
| + | [[CJK]] UNIFIED IDEOGRAPH-59E6 |
||
| + | Han Script |
||
| + | confuse: none .. |
||
| + | (kDefinition) adultery, debauchery; debauch .. |
||
| + | (kJapanese) カン|ケン|かしましい|みだら |
||
| + | </ref> : [[カン]], [[ケン]], [[かしましい]], [[みだら]] , [[midara]], making 3 women at once. |
||
| + | [[5B89]] [[安]] <ref> |
||
| − | there exists a **kanji built from three 女 characters simultaneously**, and its original meaning is quite surprising. |
||
| + | https://util.unicode.org/UnicodeJsps/character.jsp?a=5B89 |
||
| + | [[安]] |
||
| + | 5B89 |
||
| + | [[CJK]] UNIFIED IDEOGRAPH-5B89 |
||
| + | Han Script |
||
| + | confuse: none .. |
||
| + | (kDefinition) peaceful, tranquil, quiet .. |
||
| + | (kJapanese) アン|やすい|いずくに|いずくにか|いずくんぞ|やすんじる .. |
||
| + | </ref> [[アン]], [[やすい]], [[いずくに]], [[いずくにか]], [[いずくにか]], [[いずくんぞ]], [[やすんじる]], yasashi, cheap. |
||
| + | ==Historic context== |
||
| − | ==tori== |
||
| + | The [[Unicode]] and the many default fonts had been designed in century 20, while the computation had been underdeveloped. The printing techniques, contrary, already existed during centuries. This predetermined the attitude of the designers to the encoding and too fonts. The goal wad to reproduce the required glyph, on the screen or in the printing; |
||
| + | it was supposed that nobody cares, how is it encoded. |
||
| + | In Century 21, the roles of a glyph and that of a character swap. |
||
| − | Thank you, I make few more corrections. |
||
| + | The character becomes the principal part of the textual information; |
||
| + | the [[glyph]]es are still needed for the Human reception of characters. |
||
| + | Then, the lack of the unique encoding for a glyph becomes a problem; one needs some programming (see the example above) to patch the defects of the historic combination of the Unicode with existing fonts. |
||
| − | >" Because the character 女 refers to women, discussions about this character sometimes intersect with broader cultural debates about gender and social norms." |
||
| + | ChatGPT indicates, that there was no bad will of the designers of the Unicode, |
||
| − | "sometomes"?? |
||
| + | nor that of teachers of Japanese and authors of the manuals. |
||
| − | Do you know at least one example where it does not intersect? |
||
| + | They assumed, that their students never begin to write (nor analyze) [[character]]s in Japanese, |
||
| − | How do you interpret the proverb «Cherchez la femme»? |
||
| + | and never meet errors relates to the confusing graphical representation of the characters. |
||
| + | ==Warning== |
||
| − | Why do we need such an explanation? Is not it obvious? |
||
| − | We need pictures to show the semantic of the term. |
||
| − | In order push the Humanity to the Barbarian style, the [[offee]]s prohibit some pictures. |
||
| − | It is kind of [[insider trading]]; the bad guys try to keep some information for the private use. |
||
| − | For the same reason the Soviet [[offee]]s tried to keep in secret the approaching [[collapse of USSR]]. |
||
| − | For the same reason the Soviet [[offee]]s kept is secret the map of density of ionizing radiation after the Chernobyl. |
||
| − | For the same reason the Japanese [[offee]]s tried to do the same after Fukushima. |
||
| − | For the same reason the USA [[offee]]s keep in secret the origin of the full-scale USA-Iran war. |
||
| − | Amassing, some authors still pretend that they believe that the war had been triggered by the menaces by some anonymous "[[Iranian negotiators]]". |
||
| + | Publications about characters of the [[Onna]] set are collected and analyzed in TORI with scientific goals. |
||
| − | Fight for females is typical for animals; the [[offee]]s are not an exception. |
||
| + | The analysis and the interpretation above should not be interpreted as an appeal for the extrajudicial execution of the font/unicode designers who did not supply some popular [[glyph]]s with unique [[Unicode]] numbers. |
||
| − | > "the issue is rarely discussed in teaching materials. " |
||
| + | The more civilized solution would be to convince them to develop some realistic default analogy of the fantastic [[Uniglif]], the font with bijective relation between [[glyph]]s and [[character]]s. |
||
| − | Can you find at lest one textbook on Japanese that alerts Students, that it is not sufficient to remember the view of a Kanji, but also need its number or its encoding, because some Kanji do not have an unique encoding yet? |
||
| + | The description above may require correction(s) by a native [[Japanese]] speaker. |
||
| − | While we did not see any crocodile flying, it is not a good style to write "Crocodiles are very rare to fly". |
||
| + | ==References== |
||
| − | >"**kanji built from three 女 characters simultaneously**, and its original meaning is quite surprising." |
||
| + | {{ref}} |
||
| + | {{fer}} |
||
| − | - There is nothing surprising. The util.unicode indicates the sound "Mi". This sound revers to number 3. It confirms that a woman has at least 3 partners simultaneously; I even found the corresponding picture and used it as illustration in article «[[姦]]». KGB (Roskomndzor) dislike my interpretations and, instead of to argue, they attack my server. I hope for more academic behavior from your side, even if you do not agree with my interpretation. |
||
| + | ==Keywords== |
||
| − | Taking into account the population of China, one can guess that the Chinese people are very skillful in various combinations of partners. I suspect, they keep these traditions and skills during many kilo years. Perhaps, even before Japanese people appeared as a nation. |
||
| + | «[[Bijective graphical representation]]», |
||
| + | «[[Chinese]]», |
||
| + | «[[Confusion]]», |
||
| + | «[[Female]]», |
||
| + | «[[Japanese]]», |
||
| + | «[[Kanji]]», |
||
| + | «[[KanjiConfudal]]», |
||
| + | «[[KanjiLiberal]]», |
||
| + | «[[KanjiRadical]]», |
||
| + | «[[Onna]]», |
||
| + | «[[SomeU]]», |
||
| + | «[[Tarja]]», |
||
| + | «[[Unicode]]», |
||
| + | «[[Uniglif]]», |
||
| + | «[[Utf8]]», |
||
| + | «[[Utf8table]]», |
||
| + | «[[UtfH]]», |
||
| + | «[[Woman]]», |
||
| + | «[[X2F25]]» «[[⼥]]», |
||
| + | «[[X5973]]» «[[女]]», |
||
| + | «[[XF981]]» «[[女]]», |
||
| + | [[Category:Chinese]] |
||
| − | In such a way I express my doubts in your qualification of the original meaning as "quite surprising." |
||
| + | [[Category:Confusion]] |
||
| + | [[Category:Female]] |
||
| + | [[Category:Onna]] |
||
| + | [[Category:Japanese]] |
||
| + | [[Category:Kanji]] |
||
| + | [[Category:KanjiConfudal]] |
||
| + | [[Category:KanjiLiberal]] |
||
| + | [[Category:KanjiRadical]] |
||
| + | [[Category:Utf8]] |
||
| + | [[Category:Woman]] |
||
| + | [[Category:XF981]] |
||
Latest revision as of 10:15, 17 March 2026
Onna is a set of three visually identical Unicode characters representing the kanji 女 (“woman”) that originate from different parts of the Unicode standard.
Term Onna (おんな) or "onna" may refer to one of the following three Unicode characters:
12069 (X2F25, ⼥)[2], KanjiRadical
22899 (X5973, 女)[3], KanjiLiberal
63873 (XF981, 女)[4], KanjiConfudal
Onna may refer also to the set of these 3 characters.
Often, these characters are pronounced as "Onna" and refer to a woman, Human female [1], see picture below.
Characters of set Onna cause confusions.
Confusion
As of 2026, no unique Unicode number is assigned to the glyph ⼥. Unicode encodes abstract characters rather than graphical glyphs. Therefore different characters may share identical visual representations in many fonts. This causes a confusion: different characters look the same.
Both teachers of Japanese and Japanese textbooks ignore this confusion.
A newcomer encounters the problem and first attributes it to his or her own mistake. Then novice notices that the error appears repeatedly, again and again, and, en fin, recognizes, that it is not his/her mistake, but the bug of the software.
The problem is related not only to learning Japanese.
Even specialists, even native Japanese speakers looking at characters
⼥
女,
女 are unlikely to guess:
Which of them is character X2F25 [2]?
Which of them is character X5973 [3]?
Which of them is character XF981 [4]?
Term Onna is synonym of construction «⼥ or 女 or 女" for cases, when the only view of the Kanji is available, and it is difficult to identify it.
Not only Humans, but also some software confuse the characters with similar or the same glyph(s).
The default Mediawiki software replaces character XF981 to character X5973 without any warning.
This causes problems at automatic processing of data: in some cases the two objects are the same, and sometimes they are not. (Similar confusion takes place at the careless use of term «equality» applied to triangles in geometry).
In a text, that assumes any kind of citing, for example, copy-pasting into the address bar of a brouser or any search engine, the characters corresponding to the sound "onna" should be specified as X2F25, X5973, XF981 rather than ⼥, 女, 女: the software confuses the last two characters.
At least since year 2021, this confusion is recognized and described [2][3][4].
Confusions related to the apparent bugs with graphical representations of the Unicode characters are described in articles «Chikara», «Miru», «Onna», «Sakana», «StickPi», «TsukiGatsu».
Unicode
At least three Unicode characters are qualified as «Onna».
These characters are
⼥ X2F25,
女 X5973,
女 XF981.
The Utf8 encoding can be revealed with the PHP program onna.t;
it is copypasted below. File uni.t also may have need to be loaded.
<?php
include "uni.t";
$a=unichr(0x2f25);
$a.=unichr(0x5973);
$a.=unichr(0xF981);
echo "$a\n";
$N=strlen($a);
echo "The array has $N bytes; here is its splitting:\n";
for($n=0;$n<$N;$n++){ printf("%02x ",ord($a[$n]) ); }
echo "\n";
$b = mb_str_split($a);
var_dump($b);
$M=count($b);
for($m=0;$m<$M;$m++) {
printf("\n");
$c=$b[$m];
$u=uniord($c);
printf("Unicode character number %05d id est, x%04X\n",$u,$u);
$d=strlen($c);
echo "Picture: $c uses $d bytes. These bytes are:\n";
for($n=0;$n<$d;$n++) printf("x%2X ",ord($c[$n]));
printf("in the hexadecimal representation and\n");
for($n=0;$n<$d;$n++) printf("%3d ",ord($c[$n]));
printf("in the decimal representation\n");
}
?>
File uni.t also would be loaded for the execution; then, command
php onna.t
produces the output below:
⼥女女
The array has 9 bytes; here is its splitting:
e2 bc a5 e5 a5 b3 ef a6 81
array(3) {
[0]=>
string(3) "⼥"
[1]=>
string(3) "女"
[2]=>
string(3) "女"
}
Unicode character number 12069 id est, x2F25
Picture: ⼥ uses 3 bytes. These bytes are:
xE2 xBC xA5 in the hexadecimal representation and
226 188 165 in the decimal representation
Unicode character number 22899 id est, x5973
Picture: 女 uses 3 bytes. These bytes are:
xE5 xA5 xB3 in the hexadecimal representation and
229 165 179 in the decimal representation
Unicode character number 63873 id est, xF981
Picture: 女 uses 3 bytes. These bytes are:
xEF xA6 x81 in the hexadecimal representation and
239 166 129 in the decimal representation
The similar analysis can be performed with more universal dumping routine du.t; here is the example of the use:
php du.t "⼥女女"
In such a way, a consequence of historical encoding decisions in Unicode and legacy character sets, together with ignorance of teachers and authors of manuals force the pupils to learn Unicode, UTF8 and the programming in order to distinguish characters used in Japanese language.
Uniglif and Tarja
In the sci-fi utopia «Tartaria», the advanced font Uniglif is mentioned. Each glyph in that font is assigned the unique Unicode number, providing the bijective relation between the set of glyphs and the set of characters, at least for characters with number not exceeding XFFFF (ascii, TwoByteCharacters, ThreeByteCharacters). With font Uniglif, no confusions similar to that above appear.
In the real life, while no analogy of Uniglif is available, the technical language Tarja can be used to avoid confusions. This is japanese-based technical slang that avoids ambiguous glyphs, avoid characters that are not yet supplied with default exclusive glyphs.
At the translation form Japanese to Tarja, for characters ⼥, 女 and 女, their explicit numbers X2F25, X5973 and XF981 can be used.
Alternatively, the transliterations «おんな» or «Onna» of «onna» can be used.
In addition, words borrowed from other languages («Female», «Mujer», ..) can be used in Tarja when it causes no confusion.
Examples
Dictionary Jisho suggest examples with sound onna [6]:
おんなざか 女坂 mother; female parent
おんなきょうだい 女兄弟 the easier of two slopes
女姉妹 【おんなきょうだい】 sisters; female siblings
おんなかぶき 女歌舞伎 girls' kabuki
Censorship and Vestism
Objects and subjects, denoted with term onna, often become targets of aggression and/or censorship.
Vestists insist, that the body should be hidden, and punish those who do not obey [7].
Such a practice is described also in the sci-fi novel «Meganesia.Deportation».
The 5 pictures at right are designed to measure the hatred/tolerance of a religion.
Counting, how many of shown dressing styles are allowed by a religion, gives the qualification of its tolerance with respect to onna in the 5 grade scale.
Reuse the glyph
女 appears as a component in various glyphs. Few examples are suggested in this section.
36B2 㚲 [8] セン, ショウ, テン, yan, ten, small, weak
597B 奻 [9] ダン, ナン, dan, nan, quarrel, dispute
597D 好 [10] コウ, このむ, すく, よい, よし, yoi, good
5999 妙 [11] ミョウ, ビョウ, たえ, miyou, mysterious, strange
59B9 妹 [12] マイ, バイ, メ, いもうと, いも, imouto, younger sister
59C9 姉 [13] シ, あね, ねえさん, vasan, elder sister
59E6 姦 [14] : カン, ケン, かしましい, みだら , midara, making 3 women at once.
5B89 安 [15] アン, やすい, いずくに, いずくにか, いずくにか, いずくんぞ, やすんじる, yasashi, cheap.
Historic context
The Unicode and the many default fonts had been designed in century 20, while the computation had been underdeveloped. The printing techniques, contrary, already existed during centuries. This predetermined the attitude of the designers to the encoding and too fonts. The goal wad to reproduce the required glyph, on the screen or in the printing; it was supposed that nobody cares, how is it encoded.
In Century 21, the roles of a glyph and that of a character swap. The character becomes the principal part of the textual information; the glyphes are still needed for the Human reception of characters.
Then, the lack of the unique encoding for a glyph becomes a problem; one needs some programming (see the example above) to patch the defects of the historic combination of the Unicode with existing fonts.
ChatGPT indicates, that there was no bad will of the designers of the Unicode, nor that of teachers of Japanese and authors of the manuals. They assumed, that their students never begin to write (nor analyze) characters in Japanese, and never meet errors relates to the confusing graphical representation of the characters.
Warning
Publications about characters of the Onna set are collected and analyzed in TORI with scientific goals.
The analysis and the interpretation above should not be interpreted as an appeal for the extrajudicial execution of the font/unicode designers who did not supply some popular glyphs with unique Unicode numbers.
The more civilized solution would be to convince them to develop some realistic default analogy of the fantastic Uniglif, the font with bijective relation between glyphs and characters.
The description above may require correction(s) by a native Japanese speaker.
References
- ↑ 1.0 1.1 https://jisho.org/search/%23kanji%20%E5%A5%B3 https://jisho.org/search/%23kanji_女 woman, female Kun: おんな、 め On: ジョ、 ニョ、 ニョウ Jōyō kanji, taught in grade 1 JLPT level N5 151 of 2500 most used kanji in newspapers On reading compounds 女 【ジョ】 woman, girl, daughter, Chinese "Girl" constellation (one of the 28 mansions) 女王 【ジョオウ】 queen, female champion 処女 【ショジョ】 virgin, maiden 一女 【イチジョ】 one daughter, eldest daughter, first-born daughter 女王 【ジョオウ】 queen, female champion 女房 【ニョウボウ】 wife (esp. one's own wife), court lady, female court attache, woman who served at the imperial palace, woman (esp. as a love interest) 老若男女 【ロウニャクナンニョ】 men and women of all ages 天女 【テンニョ】 heavenly nymph, celestial maiden, beautiful and kind woman 女房 【ニョウボウ】 wife (esp. one's own wife), court lady, female court attache, woman who served at the imperial palace, woman (esp. as a love interest) 女官 【ジョカン】 court lady, lady-in-waiting Kun reading compounds 女 【おんな】 female, woman, female sex, female lover, girlfriend, mistress, (someone's) woman 女形 【おんながた】 onnagata, male actor in female kabuki roles, female partner (in a relationship) 醜女 【しゅうじょ】 homely woman, plain-looking woman, female demon 囲い女 【かこいおんな】 mistress 雌 【め】 female, smaller (of the two), weaker, woman, wife 女神 【めがみ】 goddess, female deity 早乙女 【さおとめ】 young female rice planter, young girl 醜女 【しゅうじょ】 homely woman, plain-looking woman, female demon
- ↑ 2.0 2.1 2.2 https://util.unicode.org/UnicodeJsps/character.jsp?a=2F25 ⼥ 2F25 KANGXI RADICAL WOMAN Han Script id: allowed confuse: 女 , 女
- ↑ 3.0 3.1 3.2 https://util.unicode.org/UnicodeJsps/character.jsp?a=5973 女 5973 CJK UNIFIED IDEOGRAPH-5973 Han Script id: restricted confuse: 女 , ⼥
- ↑ 4.0 4.1 4.2 https://util.unicode.org/UnicodeJsps/character.jsp?a=F981 女 F981 CJK COMPATIBILITY IDEOGRAPH-F981 Han Script id: allowed confuse: 女 , ⼥
- ↑ https://commons.wikimedia.org/wiki/File:Human_female.jpg English: Naked female human body. Русский: Обнаженная женщина. English: Model name: (preferred not to be stated) At time of photograph: Age: 40 Height: 166 cm Weight: 47 kg BMI: 17.1 Ornaments: Ear piercing, ring on left ring finger (not in retouched images), nail polish on toe nails. There is some tilting of the upper trunk towards the left of the body, which may be positional or anatomical. Date 29 September 2011 Source Own work Author Taken at City Studios in Stockholm (www.stockholmsfotografen.se), September 29, 2011, with assistance from KYO (The organisation of life models) in Stockholm. ..
- ↑ https://jisho.org/search/%E5%A5%B3%20%E3%81%8A%E3%82%93%E3%81%AA%20%23words?page=2 女 おんな #words .. Words — 107 found おんなおや 女親 Links Noun 1. mother; female parent Details ▸ おんなざか 女坂 Links Noun 1. the easier of two slopes Details ▸ おんなきょうだい 女兄弟 Links Noun 1. sisters; female siblings Other forms 女姉妹 【おんなきょうだい】 Details ▸ おんなかぶき 女歌舞伎 Links Noun 1. girls' kabuki Details ▸ ..
- ↑ https://edition.cnn.com/2023/09/21/middleeast/iran-hijab-law-parliament-jail-intl-hnk Iranian women face 10 years in jail for inappropriate dress after ‘hijab bill’ approved By Tara Subramaniam, Adam Pourahmadi and Mostafa Salem, CNN. Published 12:34 PM EDT, Thu September 21, 2023
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=36B2 㚲 36B2 CJK UNIFIED IDEOGRAPH-36B2 Han Script confuse: none .. (kDefinition) small and weak, used in girl's name, a woman's feature; lady's face .. (kJapanese) セン|ショウ|テン ..
- ↑ 奻 597B CJK UNIFIED IDEOGRAPH-597B Han Script confuse: none .. (kJapanese) ダン|ナン ..
- ↑ 好 597D CJK UNIFIED IDEOGRAPH-597D Han Script confuse: none .. (kDefinition) good, excellent, fine; well .. (kJapanese) コウ|このむ|すく|よい|よし ..
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=5999 妙 5999 CJK UNIFIED IDEOGRAPH-5999 Han Script confuse: none .. (kDefinition) mysterious, subtle; exquisite (kJapanese) ミョウ|ビョウ|たえ
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=59B9 妹 59B9 CJK UNIFIED IDEOGRAPH-59B9 Han Script confuse: none .. (kDefinition) younger sister .. (kJapanese) マイ|バイ|メ|いもうと|いも ..
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=59C9 姉 59C9 CJK UNIFIED IDEOGRAPH-59C9 Han Script confuse: none .. (kDefinition) elder sister .. (kJapanese) シ|あね|ねえさん ..
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=59E6 姦 59E6 CJK UNIFIED IDEOGRAPH-59E6 Han Script confuse: none .. (kDefinition) adultery, debauchery; debauch .. (kJapanese) カン|ケン|かしましい|みだら
- ↑ https://util.unicode.org/UnicodeJsps/character.jsp?a=5B89 安 5B89 CJK UNIFIED IDEOGRAPH-5B89 Han Script confuse: none .. (kDefinition) peaceful, tranquil, quiet .. (kJapanese) アン|やすい|いずくに|いずくにか|いずくんぞ|やすんじる ..