ãããã°ã©ãã®ããã®æåã³ã¼ãæè¡å ¥éããèªãã§èªåãªãã«çè§£ããç¹ãã¶ãã¯ãªã¨ã¾ã¨ãã¦ã¿ãã ããã»ã©æ£ç¢ºæ§ãæ±ãã¦æ¸ãã¦ããããã§ã¯ãªãã®ã§ãééã£ã¦ãå¯è½æ§å¤§ã§ãã ééããªã©ããã°ã³ã¡ã³ããªã©é ããã¨ãããããã§ãã ããããã®æåã³ã¼ãã¯ã©ãéãã®ãï¼ æ¥æ¬èªã®æåã³ã¼ãã¯å¤§ãã以ä¸ã®ï¼ã¤ã«åãããã JIS X 0208 æåéåããã¼ã¹ã«ãããã® Unicodeæåéåããã¼ã¹ã«ãããã® JIS X 0208 æåéåããã¼ã¹ã«ããæåã³ã¼ãã«ã¯ãEUC-JP, Shift_JIS, ISO-2022-JP ãããã Unicodeæåéåããã¼ã¹ã«ããæåã³ã¼ãã«ã¯ãUTF-8, UTF-16 ãªã©ãããã ä¸ã§æãããæåã³ã¼ããã¨ã¯æ£ç¢ºã«ã¯ãã¨ã³ã³ã¼ãã£ã³ã°ï¼æå符å·åæ¹å¼ï¼ãã®äºãæãã æå符å·åæ¹å¼ æåéåã£ã¦ï¼ èªãã§ãã®ã¾ãã¾âæåã®ç¨®é¡ã®éã¾ãâãããã£ã©
ã¯ããã« 2008å¹´11æ27æ¥ãGoogleã¯æ¥æ¬ã®æºå¸¯é»è©±ã®çµµæåãUnicodeã«åé²ããè¨ç»ãå ¬è¡¨ãããããã¾ã§7åã«ããã£ã¦ãä¼ããã¦ããé£è¼ãçµµæåãéãã¦ãã¾ã£ããã³ãã©ã®ç®±ãã¯ããã®å ¬è¡¨ããå¾ã®åãã追ã£ããã®ã ã ã§ã¯ããã以åã®å社ã¯ä½ããã¦ããã®ãï¼ãã¤ã¾ããGoogleã¯ã©ããªããã»ã¹ãçµã¦çµµæåãUnicodeã«ææ¡ããã¨æ±ºããã®ã ãããä»åãå ±åããã®ã¯ãã®ãã¨ã ã ã¤ã³ã¿ãã¥ã¼ã«çãã¦ãããã®ã¯æ¡äºå彦æ°ãæ°ã¯å¤§å¦æä»£ã«ã¹ã«ã©ã·ããï¼å¥¨å¦éï¼ã§æ¸¡ã£ã¦ä»¥æ¥ç±³å½ã«æ®ãã¤ã¥ãã¦ãããè¨èªå¦ã»æ¥æ¬èªå¦ãå°æ»ãã大å¦é¢çã大妿å¡ãªã©ã®çµæ´ãæã¡ã1996å¹´ã«å¦è¡çããNetscapeå½éåé¨éã«å ¥ç¤¾ã2004å¹´ã«Mozilla Japanã®è¨ç«ã«ãããã£ãå¾ã2005å¹´ã«Googleã«ç§»ã£ãçµé¨è±ããªå½éåã¨ã³ã¸ãã¢ã ããã¦ã³ãã³ãã¥ã¼ã«ããç±³æ¬ç¤¾ã«ãã£ã¦ãä»åã®ç¬¦å·
æ®æ®µä½¿ç¨ããæ¼¢åã®æéã¨ãªããå¸¸ç¨æ¼¢å表ããã2010年度ã«ãæ¹æ£ããããæ°ãã«è¿½å ããã196æåã®ä¸ã«ãæåã³ã¼ããã·ããJISãã«ãªãæ¼¢åãå«ã¾ãã¦ãããããæ å ±ã·ã¹ãã ã«å¤§ããªå½±é¿ãä¸ãããã ãææ°ã®JISè¦æ ¼ãJIS X 0213:2004ãã®æ¹æ£ã«å§å¡ã¨ãã¦ãããã£ã京é½å¤§å¦äººæç§å¦ç ç©¶æé屿±ã¢ã¸ã¢äººææ å ±å¦ç ç©¶ã»ã³ã¿ã¼ã®å®å²¡åä¸åææããåé¡ã®æ ¸å¿ã解説ããããããããï¼æ¥çµã³ã³ãã¥ã¼ã¿ï¼ 2009å¹´11æ10æ¥ãæé¨ç§å¦çã®ãæå審è°ä¼å½èªåç§ä¼ãã«ããã¦ãå¸¸ç¨æ¼¢åè¡¨ã®æ¹æ£æ¡ãæ¿èªããããç¾è¡ã®å¸¸ç¨æ¼¢å表ã«ãã1945åãããéããéããåºããåããè¹ãã®5åãåé¤ããæ°ãã«196åã追å ããæ¹æ£æ¡ã§ã2010年度ã®å é£å示ãç®æãã¦ããã æ°ããå¸¸ç¨æ¼¢å表ãå示ãããã¨ããã·ããJISãããEUC-JPãã¨ãã£ã徿¥ããããæåã³ã¼ãã使ç¨ããã·ã¹ãã ã§å¤§ããªåé¡ãçã
ãã¡ã¤ã«åã¯ãå·¦ããå³ã«èªããã¨ã¯éããªã?!ï¼ã»ãã¥ãªãã£Tips for Todayï¼8ï¼ï¼1/3 ãã¼ã¸ï¼ ç§ãã¡ã®å¸¸èãä¸çã§ã¯éç¨ããªããã¨ãããã¾ããæ»æè ã¯ãããªå¿ã®ãããçã£ã¦ãè½ã¨ãç©´ã仿ãã¾ããä»åã¯ãããåèªèããããã®ãããªææ³ã¨ããã®å¯¾çTipsã解説ãã¾ãï¼ç·¨éé¨ï¼ çããããã«ã¡ã¯ã飯ç°ã§ããå æ¥ãã»ãã¥ãªãã£ç®¡çè ã®æ¹ã ã¨ãä»å¾ã®ã¦ã¤ã«ã¹å¯¾çã®ããæ¹ãã«ã¤ãã¦æè¦äº¤æãããæ©ä¼ãããã¾ãããåå è ããã¯æ´»çºãªæè¦ã質åãé£ã³äº¤ããçãä¸ãããè¦ããæè¦äº¤æä¼ã¨ãªãã¾ãããç§èªèº«ãå¤ãã®æ°ä»ããå¦ã³ãå¾ããã¨ãã§ããè²´éãªæéãéãããã¨ãã§ãã¾ããã ãã®æè¦äº¤æä¼ã®ä¸ã§ãUnicodeã®å¶å¾¡æåãå©ç¨ãããã¡ã¤ã«ã®æ¡å¼µåå½è£ ã®è©±é¡ãåºã¾ããããã®ææ³ã¯ç®æ°ããææ³ã§ã¯ãªããæ°å¹´åãããã§ã«ææããã¦ãããã®ã§ããããããä¹ ãã¶ãã«æ¬ææ³ã«ã¤ãã¦è°è«ããã
2009å¹´09æ13æ¥13:00 ã«ãã´ãªLightweight Languages #perl - utf8::decode()ã§ã¯ãªãEncode::decode_utf8()ã使ãã¹ãçç± é§ç®ã§ãã [ã] Perl ã® utf8 ã¾ããã®ãã¾ããªã æè¿è¯ã使ããã¾ããªããã¨ãããã¤ãã£ãªã ã utf8::decode($text) unless utf8::is_utf8($text); ããããå ´åã¯ãEncode::decode_utf8()ã§ãªãã¨ã 以ä¸ãããããã ããã #!/usr/bin/perl use strict; use warnings; use Encode; use Devel::Peek; for my $bytes ( "\x2F", "\xC0\xAF", "\xE0\x80\xAF", "\xF0\x80\x80\xAF" ) { my $u
_æ¢ã«ãããåã«ãªãã¤ã¤ããæåã¨ã³ã³ã¼ãã£ã³ã°ããªãã¼ã·ã§ã³ 大å£éç·ããã®æ¥è¨ã使 ããããåã«ãªããªãæåã¨ã³ã³ã¼ãã£ã³ã°ããªãã¼ã·ã§ã³ãã«ç«¯ãçºãã¦ãå ¥åãã¼ã¿ãªã©ã®æåã¨ã³ã³ã¼ãã£ã³ã°ã®å¦¥å½æ§ãã§ãã¯ãã©ãè¡ãããè°è«ã«ãªã£ã¦ãã¾ãããã§ãã¯èªä½ãå¿ è¦ã§ãããã¨ã¯çããåæã®ããã§ããã ãã§ãã¯æ å½ã¯ã¢ããªã±ã¼ã·ã§ã³ããåºç¤ã½ããï¼è¨èªããã¬ã¼ã ã¯ã¼ã¯ãªã©ï¼ã å ¥åã»å¦çã»åºåã®ã©ãã§ãã§ãã¯ããã®ã ã¨ããç¹ã§ããã¾ãã¾ãªæè¦ãå¯ãããã¦ãã¾ãã大å£ããèªèº«ã¯ãã¢ããªã±ã¼ã·ã§ã³ãå ¥åæç¹ã§ãã§ãã¯ãã¹ãã¨ä¸»å¼µããã¦ãã¾ããããã«å¯¾ãã¦ãããåºç¤ã½ããã§ãã§ãã¯ãã¹ãã ã¨ããæååãã使ãã¨ããã«ãã§ãã¯ãã¹ãã ã¨ããæè¦ãåºã¦ãã¾ãã ãã¨ãã°ãid:ikepyonã®æ¥è¨ã[ã»ãã¥ãªãã£]使 ããããåã«ãªããªãæåã¨ã³ã³ã¼ãã£ã³ã°ããªãã¼ã·ã§ã³ãã§ã¯ããã®ãã§ãã¯ã¯åºç¤ã½ã
çµµæåã®åé²ãããã£ã¦ãå½éè¦æ ¼ã§å¤§è«äº--ãGoogleææ¡ããæ¯ãè¿ã çããããã«ã¡ã¯ãé¢ç½ãã¦ã¿ã¡ã«ãªãï¼ï¼ï¼æåã³ã¼ã漫è«ã®æéããã£ã¦ã¾ããã¾ããã2æããã¨ã³ã¨ã³ã§æ¸ãã¦ããçµµæåã®å ±åããããããä»åãæçµåãã©ãããããããä»ãåããã ããã ãã¦ãååã¯ã©ãã¾ã§ã話ãããã®ã§ããã£ããæ¥æ¬ã®çµµæåãUnicodeã«åé²ãããã¨ããGoogleã¨Appleã«ããææ¡ï¼ä»¥ä¸ã主å°è ã®åãã¨ãGoogleææ¡ã¨ç¥ï¼ã§ãããå»å¹´ã®12æã«ãããªãã¯ã¬ãã¥ã¼ãéå§ãããã¨ãUnicode-MLã§æãªãã¬éé£ã®åµãå¹ãèãããã¨ãããã§ã®åçºãä¸è¨ã§è¨ã表ããªããæ¥æ¬ã®æåã«å¼·ãä¾åããçµµæåãåç´ã«å½éè¦æ ¼ã«åé²ãããã¨ããç¹ã«ãã£ããã¨ã ãªããªãå½éè¦æ ¼ã®å¯©è°ã¯åå åå½ã®ç·æã§æãç«ã£ã¦ãããç¹å®ã®å½ãã便å©ã«ä½¿ããªãæåãåé²ãããã¨ã¯ãå½ç¶å¼·ãå対ããããããã§ããããã«
2009å¹´06æ23æ¥15:30 ã«ãã´ãªLightweight Languages perl - use CGI; use Encode; # éè±èªWebããã°ã©ãã³ã°3åå ããã¯ãå®ã¯Perlã«éããæªã ã«äºå®ã ã£ããããã®ã§ãã.... Perl ã§ãã©ã¼ã ãã¼ã¿ãã UTF-8 æ¥æ¬èªæåãã¨ãã ãæ¹æ³ (ããã°ã©ãã³ã°ã®å°ç³ã»å¤§ç³) UTF-8 ã®ãã©ã¼ã ã«ãã£ã¦ããããããã¼ã¿ã®ãªãããæ¥æ¬èªæåãã¨ãã ããã¨ã¯ï¼æ¥æ¬ã® Perl CGI ããã°ã©ããªãããã¦ãå¿ è¦ã«ãªããã¨ã§ããï¼ ã¨ãããï¼ãã®æ¹æ³ã¯æå¤ã«ç¢ºç«ããã¦ããªãããã«ã¿ããï¼ ããããå çºè¨ã®æ¹æ³ã¯å ç¥å¸°ããããã®ã§ã Perlããã°ã©ãã¼ä»¥å¤ã«ããWebããã°ã©ãã¼ã§ããã°æç¨ãªentryã§ãã Perlã§Webããã°ã©ãã³ã°ããå ´åã®ä¸åå Queryã¯CGIã¢ã¸ã¥ã¼ã«ã§å¦çãã æåã³ã¼ãã¯Encode
æ®éã§ã¯èããããªãåªéç--ãGoogleææ¡ããæ¯ãè¿ã çããããã«ã¡ã¯ãæ¯åº¦ããªãã¿ï¼ï¼ï¼æåã³ã¼ã漫è«ã®æéããã£ã¦ã¾ããã¾ãããååã3æã®æ²è¼ã§ããã3ã«æã¶ãã§ãããä»ã¾ã§3åã«ããã£ã¦çµµæåãUnicodeåã³ISO/IEC 10646ï¼å½é符å·åæåéåï¼ã«åé²ãããã¨ããææ¡ã®åãã«ã¤ãã¦ã説æãã¦ãã¾ããããä»åãã2åã«åãã¦å®çµç·¨ããå±ããã¾ããã©ãããããããä»ãåããã ããã ã²ããã¶ãã§ããããããã¾ã§ã®ãã¤ã³ããæ´çãã¦ããã¾ããããåè¿°ãããææ¡ãã¨ã¯ããã¨ãã¨ã¯Unicodeã«åé²ããããã«GoogleãAppleã¨å ±åã§ä½æãããã®ã§ãã以ä¸ã主å±è ã®ååãã¨ããGoogleææ¡ãã¨å¼ã¶ãã¨ã«ãã¾ããããã¯ãã®2æã«éãããæé«è°æ±ºæ©é¢ãUTCä¼è°ã§æ¿èªããã¦Unicodeã³ã³ã½ã¼ã·ã¢ã ã®ç·æã¨ãªãã¾ãããã¤ãã§Googleææ¡ã¯ISO/IEC 1
ååã¾ã§ãæ¯ãè¿ã--Unicodeã³ã³ã½ã¼ã·ã¢ã ã®å½±é¿å ååã¯ã©ãã¾ã§ã話ããã¾ããã£ããä¸çä¸ã®æåã®åé²ãç®çã¨ããæåã³ã¼ãè¦æ ¼ãUnicodeã¯ãç±³å½ã®IT伿¥ãä¸å¿ã«çµæãããUnicodeã³ã³ã½ã¼ã·ã¢ã ãå¶å®ããããã¡ã¯ãè¦æ ¼ã«éããªããã¨ããããå ¬çãªå½éæ©é¢ãå®ãããã¸ã¥ã¼ã«è¦æ ¼ISO/IEC 10646ã¨åæãããã¨ã§ãWTO/TBTåå®ã«ãã¨ã¥ãä¸çä¸ã®å½ã ã«æ®åãããããã¡ãªãããå¾ããã¨ã ã¾ããUnicodeã³ã³ã½ã¼ã·ã¢ã èªä½ã¯ãªã¼ãã³ãªçµç¹ã ããã©ãæå¿æ±ºå®ãè¡ãUTCï¼Unicode Technical Committee/Unicodeæè¡å§å¡ä¼ï¼ã§ä¸ç¥¨ãæããæ¨©å©ãæã¤ã®ã¯ä¸æ¡ãã®å£ä½ã«éããããã¨ãããã¦UTCã¯ISO/IEC 10646ã®ã¢ã¡ãªã«ã»ãã·ã§ãã«ããã£ã§ããL2å§å¡ä¼ã¨ååã§ããéå¬ããã¦ããããåæã«L2å§å¡ä¼ã¨Unicodeã³ã³ã½ã¼
Unicodeãæºå¸¯é»è©±ã®çµµæåãåé²ã¸ çµµæåã£ã¦ãªã«ï¼ããèããã¦ãå¤ãã®äººã¯ããããããã¯ã¨çããããã¯ããããè¨ãã°ã¡ãã£ã¨åã«ãã¡ã¼ã«ã®ãã¼ããã¼ã¯ã«ã ã¾ããããªï¼ã8å²ã®å¥³æ§ã¯ãæäººä»¥å¤ã«ã使ãããï¼RBB NAVIï¼ãªãã¦ãããã¥ã¼ã¹ãããã¾ãããæºå¸¯é»è©±ã®å人æ®åçã9å²ãä¸åãï¼å¹³æ20å¹´å é£åºæ¶è²»åå調æ»ï¼ãã®å½ã«ããã¦ãçµµæåã¯ãããããµãããã®ã«ãªã£ã¦ããç¾å®ãããã¾ãã 2008å¹´ã®11æ27æ¥ãGoogleãæºå¸¯é»è©±ã§ä½¿ãããçµµæåãå½éçãªæåã³ã¼ãè¦æ ¼ãUnicodeã«åé²ãããã¨ããããã¸ã§ã¯ãé²è¡ä¸ã§ãããã¨ãçºè¡¨ãã¾ãããã§ã¯ããã®ãã¥ã¼ã¹ã¯ä½ãæå³ããã®ã§ããããããã¦ç§ãã¡ã«ä½ãããããã®ã§ããããä»åãã3åã«åãã¦èãã¦ã¿ããã¨æãã¾ãã ã¾ãæ´å²ãæ¯ãè¿ã£ã¦ã¿ã¾ãããããã¤ã¯çµµæåã使ã£ãã®ã¯æºå¸¯é»è©±ãæåã¨ããããã§ããã¾ãããå è¡ãããã®
ã°ã¼ã°ã«ã¯æ¥æ¬ã®æºå¸¯é»è©±ã®çµµæåããä¸çã«åºããåå¨ã«ãªãããããã ã 11æ27æ¥ã«Google Japan Blogã«æç¨¿ãããã¨ã³ããªã«ããã¨ãã°ã¼ã°ã«ã¯æ¥æ¬ã®æºå¸¯é»è©±ã®çµµæåã®å ¨ã¦ããã¦ãã³ã¼ãã®æåã¨ãã¦å ±é符å·åããèãã ã¨ããã çµµæåã¯ãã¨ãã¨æ¥æ¬ã®æºå¸¯é»è©±ä¼ç¤¾ãåºæã®ãã®ã使ã£ã¦ãããç°ãªãéä¿¡ä¼ç¤¾å士ã§çµµæåãéãåããã¨ã¯ã§ããªãã£ããç¾å¨ã§ã¯åãã£ãªã¢ãååãã¦ããäºãä¼¼ããããªçµµæåãããå ´åã«ã¯ã夿ãã¦è¡¨ç¤ºãã¦ããã ã°ã¼ã°ã«ã¯ãããæ¡å¤§ããçµµæåãã¦ãã³ã¼ãã¨ãã¦æ¨æºåãããã¨ã§ãã©ã®éä¿¡äºæ¥è éã§éã£ãçµµæåãåãããã«è¡¨ç¤ºãããä¸çãå®ç¾ãããã¨ãããããã«ããæ¤ç´¢ã¨ã³ã¸ã³ã§çµµæåãæ¢ãã°ãçµæãè¿ã£ã¦ãããï¼ã°ã¼ã°ã« ã¦ãã³ã¼ãã½ããã¦ã§ã¢ã¨ã³ã¸ãã¢ã®Markus Schereræ°)ããã«ããããã¨ã®ãã¨ã ã ãã®ã°ã¼ã°ã«ã®éæãå®ç¾ããããã«ã¯ãç¾å¨
æ£ããä¸¦ã³æ¿ãã§ã¯ã表示ã¯(A)ã®ã¾ã¾ã§ãããééã£ãä¸¦ã³æ¿ãã§ã¯ãæ£è¦çµåã¯ã©ã¹ãäºãã«çããMACRONã¨ACUTEãä¸¦ã³æ¿ããããã表示ã¯(B)ã®ããã«ãeã®ä¸ã®ã¢ã¯ã»ã³ãè¨å·ã®ä½ç½®ãå ¥ãæ¿ãã£ã¦ãã¾ãã¾ãã æ£è¦åè§£ã»äºæåè§£ ããæååã®æ£è¦åè§£ (Canonical Decomposition) ãå¾ãã«ã¯ãã¾ããããããã®æåãæ£è¦ãããã³ã°ã«ãã£ã¦å帰çã«ãå¯è½ãªéããåè§£ãã¾ããããªãã¡ã1ååè§£ããå¾ã«ç¾ããæåããªããåè§£å¯è½ã§ããã°ããã«åè§£ãã¾ããåè§£ãããã³ã°ããã®æåèªèº«ã§ããå ´åã¯ãåè§£ä¸å¯è½ãªã®ã§ããã®ã¾ã¾ã§ãã ããããåè§£ããã ãã§ã¯å¿ ãããæ£ããçµæãå¾ããã¾ãããã¤ã¾ããçµåæåã®é åºã®ä¸ææ§ãä¿è¨¼ãããããåè§£å¾ã®æååã«å¯¾ãã¦æ£è¦é åºã¢ã«ã´ãªãºã ãé©ç¨ããªããã°ãªãã¾ããããã®ããã«ãæ£è¦ãããã³ã°ã«ããå帰çåè§£ã¨ãæ£è¦é åºã¢ã«ã´ãªãºã ã«ã
id:tomi-ru ããã [http://e8y.net/mag/015-encode/:title] ã¨ããã¨ã¦ããã©ã¯ãã£ã«ã«ãª [http://search.cpan.org/perldoc?Encode:title=Encode] å ¥éããæ¸ãã«ãªã£ãã®ã§ï¼ããããéãåãå£ã§æ¸ãã¦ã¿ãããªãã¾ããã ãã¡ããã®åºç¤ï¼èªã¿é£ã°ãå¯ï¼ æåã»ãã, ãã£ã©ã¯ã¿ã»ãã, æåéå, æåéå - Wikipedia ã¨ã³ã³ã¼ãã£ã³ã°, 符å·åæ¹å¼, æå符å·åæ¹å¼ - Wikipedia ãã®2ã¤ã¯ç°ãªãã¾ããã¨ãã«ç¥ããªãã¦ãä¸è¨ã®ææ¸ãèªããã¨ã¯ã§ãã¾ããï¼çè§£ãã¦ããã¨ããã«ãªãã¾ããããããç¥ããã人ã¯èªç¿ãã¦ãã ããã æåã»ããã®ä¾ Unicode JIS X 0208 ã²ãããªã¨ãã«ã¿ã«ãã¨ãæ¼¢åã¨ã ASCII æå ã¨ã³ã³ã¼ãã£ã³ã°ã®ä¾ UTF-8 ISO-202
2008å¹´05æ11æ¥21:00 ã«ãã´ãªLightweight LanguagesTips perl - æååç §ã(en|de)codeãã ãã§ã«æ£è§£ãæ¸ããã¦ãã¾ããã [ã] Unicode ã®16鲿°ã®å®ä½åç §ãæ£è¦è¡¨ç¾ãªã©ã§å ã«æ»ã pack 㨠Encode::decode ã使ãã¨è¯ãã¿ããã ã¯ã¦ãªããã¯ãã¼ã¯ - miyagawaã®ããã¯ãã¼ã¯ / 2008å¹´05æ11æ¥ ãã HTML::Entities::decode / regexp ã§ã chr(hex($1)) ã®ã»ãããããããããªãã㪠繰ãè¿ãã¦ããã ãã®ä¾¡å¤ã¯ããã®ã§ã HTML::Entitiesã使ã ã¾ããHTML::Entitiesã®decode_entities()ã使ãã¨ããæ¹æ³ãããã¾ããããããã¹ããã©ã¯ãã£ã¹ããªã #!/usr/local/bin/perl use strict;
2008å¹´05æ08æ¥04:00 ã«ãã´ãªLightweight Languages perl - Encode ä¸ç´ 以忏ãã 404 Blog Not Found:perl - Encode å ¥é ã¯å¤§å¥½è©ã§ãããã ã¦ã§ãã§å©ç¨ãããæåã³ã¼ããUnicodeãASCIIãä¸åã--ã°ã¼ã°ã«ãæããã«:ãã¼ã±ãã£ã³ã° - CNET Japan UnicodeãASCIIã追ãè¶ããWorld Wide Webä¸ã§æãå¤ãå©ç¨ããã¦ããæåã³ã¼ãä½ç³»ã«ãªã£ãã¨Googleã®ã·ãã¢ã¤ã³ã¿ã¼ãã·ã§ãã«ã½ããã¦ã§ã¢ã¢ã¼ããã¯ãMark Davisæ°ãããã°ã§è¿°ã¹ã¦ããã ã¨ããæä»£ã«å®å ¨å¯¾å¿ããã«ã¯ãå ¥é以ä¸ã®ç¥èãã¡ãã£ã¨å¿ è¦ã«ãªãã¾ãã ä¾ãã°ãæ¬blogããã¹ããã¦ããã¦ããlivedoor blogã®æåã³ã¼ãã¯EUC-JPããæä»£ã¯Unicodeãã ã¨è¨ã£ã¦ããããããäºæ ãã¾ã
2008å¹´02æ18æ¥10:00 ã«ãã´ãªLightweight Languages perl - utf8::is_utf8("\x{ff}") == 0 ã¡ããã©ããæ©ä¼ãªã®ã§ãPerl 5.8以éã«ãããutf8ãã©ã°ã®ç«ã¡æ¹ãã unknownplace.org - 2008/02/17 - utf8::is_utf8 ã¨ãããã¨ã§ã"\x{6751}\x{702c}\x{5927}\x{8f14}" ãªã©ã¨ããData::Dumper表è¨ã§ããªãããã utf-8ãã©ã°ããã¤ãããããªããã¨ãããã¨ãããããã£ããã ã¨æãã®ã ããã©ã \x{UUUUUU}ã¨utf8 flag ã¾ãã¯ã¯ã¤ãºã§ãã以ä¸ãã©ãåºåãããããçããªããã sub pfrag{ print utf8::is_utf8($_[0]) ? 1 : 0, "\n" } pfrag "Hell\xC3, worl
Redirecting⦠Click here if you are not redirected.
Python ã® unicodedata ã¢ã¸ã¥ã¼ã« Unicode ã®ã¡ãã£ã¨ããããã¹ãå¦çããããã¨æãã Python ã® unicodedata ã¢ã¸ã¥ã¼ã«ã使ã£ã¦ã¿ã¾ãããããã¯é常ã«ä¾¿å©ã§ãã unicodedata 㯠Python ã«æ¨æºã§ä»å±ãããããå¥éã®ã¤ã³ã¹ãã¼ã«ã¯ä¸è¦ã§ããæ¬¡ã®ãããªãã¨ãã§ãã¾ãã æåã®ååãåå¾ãã æåã®ååãåå¾ãããã¨ãã§ãã¾ããUnicode ã®æåã«ã¯ãã¹ã¦ä¸æã®ååãã¤ãããã¦ãã¾ããã½ã¼ã¹ã³ã¼ãå ã§ Unicode ã®ã³ã¼ããã¤ã³ãã使ãã¨ã㯠U+20AC (EURO SIGN) ãªã©ã¨ã³ã¡ã³ããã¤ãã¦ããã¨ä¾¿å©ã§ãããã >>> unicodedata.name(u'A') 'LATIN CAPITAL LETTER A' >>> unicodedata.name(u'ã') 'HIRAGANA LETTER A' æåã®
ãªãªã¼ã¹ãé害æ å ±ãªã©ã®ãµã¼ãã¹ã®ãç¥ãã
ææ°ã®äººæ°ã¨ã³ããªã¼ã®é ä¿¡
å¦çãå®è¡ä¸ã§ã
j次ã®ããã¯ãã¼ã¯
kåã®ããã¯ãã¼ã¯
lãã¨ã§èªã
eã³ã¡ã³ãä¸è¦§ãéã
oãã¼ã¸ãéã
{{#tags}}- {{label}}
{{/tags}}