https://mizugadro.mydns.jp/t/index.php?title=%E2%BC%A9&feed=atom&action=history
⼩ - Revision history
2024-03-29T07:02:01Z
Revision history for this page on the wiki
MediaWiki 1.31.16
https://mizugadro.mydns.jp/t/index.php?title=%E2%BC%A9&diff=36837&oldid=prev
T: Created page with "<div style="float:right;margin:-64px -14px 0px 12px"> 200px<br><big><center> 森の間の⼩<!--さな!-->男</center></big> </div> ⼩ is uni..."
2021-05-22T08:52:43Z
<p>Created page with "<div style="float:right;margin:-64px -14px 0px 12px"> <a href="/t/index.php/File:Lilliput2.jpg" title="File:Lilliput2.jpg">200px</a><br><big><center> 森の間の<a href="/t/index.php/%E2%BC%A9" title="⼩">⼩</a><!--さな!-->男</center></big> </div> <a href="/t/index.php/%E2%BC%A9" title="⼩">⼩</a> is uni..."</p>
<p><b>New page</b></p><div><div style="float:right;margin:-64px -14px 0px 12px"><br />
[[File:Lilliput2.jpg|200px]]<br><big><center><br />
森の間の[[⼩]]<!--さな!-->男</center></big><br />
</div><br />
[[⼩]] is [[unicode]] character number 12073 (see [[Utf8table]])<br />
<ref><br />
https://0g0.org/unicode/2F29/<br />
[[⼩]] U+2F29 Unicode文字<br />
</ref><br />
<br />
==Phonetic==<br />
In Japanese, [[⼩]] can be pronounced <ref><br />
https://en.wikipedia.org/wiki/List_of_jōyō_kanji<br />
The jōyō kanji system of representing written Japanese consists of 2,136 characters. ..<br />
</ref> as<br><br />
"ショウ", "ちい-さい", "こ", "お";<br><br />
"shō", "chii-sai", "ko", "o"<br />
<br />
==Semantic==<br />
[[⼩]] may mean "small", "tiny", "young"<br />
<ref><br />
https://en.wiktionary.org/wiki/%E5%B0%8F<br><br />
https://en.wiktionary.org/wiki/⼩<br />
(Redirected from [[⼩]])<br />
</ref><br />
<br />
[[⼩]] is often followed by the two hiragana characters,<br />
[[⼩]]さな;<br><br />
these two hiragana characters show, that [[⼩]] refers to size od the object.<br />
<br />
==Synonym: [[小]] ==<br />
<br />
Character<br><br />
[[小]] ([[&#23567;]] (& # 2 3 5 6 7 ;)) <br><br />
is synonym of<br><br />
[[⼩]] ([[&#12073;]] (& # 1 2 0 7 3 ;)):<br><br />
In the most of software, these characters have similar pictures (that may cause confusion mentioned below);<br><br />
in [[Japanese]], these characters have<br />
similar meaning ("small")<br />
and similar pronunciation ("chiisai").<br />
<br />
==Antonyms: [[⼤]] and [[大]] ==<br />
<div style="float:right;margin:-40px -14px 0px 8px"><br />
[[File:Gulliver2.jpg|200px]]<br><big><center>岩の間の[[⼤]][[男]]</center></big><br />
</div><br />
Unicode characters<br><br />
[[⼤]] ([[&#12068;]] (& # 1 2 0 6 8 ;)),<br><br />
[[大]] ([[&#22823;]] (& # 2 2 8 2 3 ;)),<br><br />
can be interpreted as antonyms of [[⼩]].<br />
In [[Japanese]] and in [[Chinese]], these characters may have meaning the<br />
opposite meaning; each of [[⼤]] and [[大]] may mean "big", "large", "huge".<br />
<br />
==Encoding==<br />
Html input:<br> <br />
[[⼩]] ([[&#12073;]] (& # 1 2 0 7 3 ;))<br><br />
[[⼩]] ([[&#x2F29;]] (& # x 2 F 2 9 ;))<br />
<br />
[[⼩]] is encoded with 3 bytes:<br><br />
xE2 xBC xA0 in the hexadecimal representation and<br><br />
226 188 169 in the decimal representation<br />
<br />
This encoding is compared to encoding of related characters<br />
by the [[PHP]] program below:<br />
<br />
<pre><br />
<?php<br />
function mb_str_split($str) {<br />
// split multibyte string in characters<br />
// at all positions, not after the start: ^<br />
// and not before the end: $<br />
$pattern = '/(?<!^)(?!$)/u';<br />
return preg_split($pattern,$str);<br />
}<br />
<br />
function uniord($a) <br />
{<br />
$M=strlen($a);<br />
$p=ord($a[0]); if($M==1) return $p;<br />
$p-=194; $p*=64; $p+=ord($a[1]); if($M==2) return $p;<br />
$p-=2050; $p*=64; $p+=ord($a[2]); return $p;<br />
}<br />
<br />
$a='⼤ 大 ⼩ 小'; /* two pairs of different unicode characters separated with spacebars */<br />
<br />
$N=strlen($a);<br />
echo "The array has $N bytes; here is its splitting:\n";<br />
<br />
for($n=0;$n<$N;$n++)<br />
{<br />
printf("%02x ",ord($a[$n]) );<br />
}<br />
echo "\n";<br />
<br />
$b = mb_str_split($a);<br />
<br />
var_dump($b);<br />
$M=count($b);<br />
<br />
#mb_internal_encoding("UTF-8");<br />
<br />
for($m=0;$m<$M;$m++)<br />
{<br />
printf("\n");<br />
$c=$b[$m];<br />
$u=uniord($c);<br />
printf("Unicode character number %05d id est, x%04x\n",$u,$u);<br />
$d=strlen($c);<br />
echo "Picture: $c uses $d bytes. These bytes are:\n";<br />
for($n=0;$n<$d;$n++) printf("x%2x ",ord($c[$n]));<br />
printf("in the hexadecimal representation and\n");<br />
for($n=0;$n<$d;$n++) printf("%3d ",ord($c[$n]));<br />
printf("in the decimal representation\n");<br />
}<br />
?><br />
</pre><br />
<br />
<br />
The output is:<br />
<br />
<pre><br />
The array has 15 bytes; here is its splitting:<br />
e2 bc a4 20 e5 a4 a7 20 e2 bc a9 20 e5 b0 8f <br />
array(7) {<br />
[0]=><br />
string(3) "⼤"<br />
[1]=><br />
string(1) " "<br />
[2]=><br />
string(3) "大"<br />
[3]=><br />
string(1) " "<br />
[4]=><br />
string(3) "⼩"<br />
[5]=><br />
string(1) " "<br />
[6]=><br />
string(3) "小"<br />
}<br />
<br />
Unicode character number 12068 id est, x2f24<br />
Picture: ⼤ uses 3 bytes. These bytes are:<br />
xe2 xbc xa4 in the hexadecimal representation and<br />
226 188 164 in the decimal representation<br />
<br />
Unicode character number 00032 id est, x0020<br />
Picture: uses 1 bytes. These bytes are:<br />
x20 in the hexadecimal representation and<br />
32 in the decimal representation<br />
<br />
Unicode character number 22823 id est, x5927<br />
Picture: 大 uses 3 bytes. These bytes are:<br />
xe5 xa4 xa7 in the hexadecimal representation and<br />
229 164 167 in the decimal representation<br />
<br />
Unicode character number 00032 id est, x0020<br />
Picture: uses 1 bytes. These bytes are:<br />
x20 in the hexadecimal representation and<br />
32 in the decimal representation<br />
<br />
Unicode character number 12073 id est, x2f29<br />
Picture: ⼩ uses 3 bytes. These bytes are:<br />
xe2 xbc xa9 in the hexadecimal representation and<br />
226 188 169 in the decimal representation<br />
<br />
Unicode character number 00032 id est, x0020<br />
Picture: uses 1 bytes. These bytes are:<br />
x20 in the hexadecimal representation and<br />
32 in the decimal representation<br />
<br />
Unicode character number 23567 id est, x5c0f<br />
Picture: 小 uses 3 bytes. These bytes are:<br />
xe5 xb0 x8f in the hexadecimal representation and<br />
229 176 143 in the decimal representation<br />
</pre><br />
<br />
The program reveals the encoding of the four related Unicode characters:<br><br />
[[⼤]] ([[&#12068;]] (& # 1 2 0 6 8 ;)),<br><br />
[[大]] ([[&#22823;]] (& # 2 2 8 2 3 ;)),<br><br />
[[⼩]] ([[&#12073;]] (& # 1 2 0 7 3 ;)),<br><br />
[[小]] ([[&#23567;]] (& # 2 3 5 6 7 ;))<br />
<br />
==References==<br />
<references/><br />
<br />
<br />
==Keywords==<br />
[[Japanese]],<br />
[[Kanji]],<br />
[[SomeU]],<br />
[[Unicode]],<br />
[[Utf8]],<br />
[[UtfH]],<br />
[[Utf8table]],<br />
<br />
[[⼤]] ([[&#12068;]] (& # 1 2 0 6 8 ;)),<br />
[[大]] ([[&#22823;]] (& # 2 2 8 2 3 ;)),<br />
[[⼩]] ([[&#12073;]] (& # 1 2 0 7 3 ;)),<br />
[[小]] ([[&#23567;]] (& # 2 3 5 6 7 ;))<br />
<br />
[[Category:U12068]]<br />
[[Category:Japanese]]<br />
[[Category:Kanji]]<br />
[[Category:SomeU]]<br />
[[Category:Unicode]]<br />
[[Category:UtfH]]<br />
[[Category:Utf8]]<br />
[[Category:Utf8table]]<br />
[[Category:⼩]]<br />
[[Category:小]]</div>
T