IBM Research
2. Whatâs this about then? ⢠Thereâs loads of groovy content on Wikipedia[citation needed]. ⢠You are lazy. ⢠You want groovy content on your site. 4. API has lots of options Param Values What does it do? format php, json, Output format. TODO redirects 0, 1 Redirect to good pages. rvsection 0, 1, 2, 3, etc Page section to get data for. action query, parse API method. 5. Getting WikiText? Easy ⢠ht
Wikipediaã®ãåé¤ãããè¨äº(AfD: Article for Deletion)ãã«é¢ãããã¼ã¿è¦è¦åããã¸ã§ã¯ã "Notabilia" ãç´¹ä»ãããè¦è¦åãéãã¦è¦åºããããã¿ã¼ã³ãèå³æ·±ãããã®ä»ãç 究ææã®èå³æ·±ãçºè¦ãç´¹ä»ãããã¾ãã触çºãããã¢ã¤ãã¢ãè¿°ã¹ãã Notabiliaã¯ãWikipediaã®è¨äºãåé¤ã«è³ãã¾ã§ã®å¯©è°éç¨ãè¦è¦å (visualize) ãã¦ããã Taraborelli & Ciampagliaã®ç 究 (è«æPDF) ããã¨ã«ããæ å ±è¦è¦å®¶ã (information visualizer) ã® Moritz Stefaner ãå¶ä½ããã Stefaner ã¯ãèªç¥ç§å¦ã®å¦å£«å·ã¨ãã¤ã³ã¿ãã§ã¼ã¹ã»ãã¶ã¤ã³ã®ä¿®å£«å·ãæã£ã¦ããã Wikipedia ã®è¨äºã¯ã次ã®éç¨ã§åé¤ãããããåé¤ä¾é ¼ããåºãããè¨äºã¯ãç¹çæ§ (notabilit
The Content Innovation Cloud Harness the power of a unified content, process and application intelligence platform to unlock the value of enterprise content. Learn more
ãã²ã¼ãã«ã®ä¸å®å ¨æ§å®çãã§å¦å¹´ãä¸ã¤ä¸ãã£ã¦ãããããã以éã«ç»å ´ãããªãµä»¥å¤ã¯ãåç»å ´æã®å¦å¹´ãè¨è¼ãã¦ããã ãåã æ¬ä½åã®ä¸»äººå ¬ãæ¬ä½åã¯å½¼ã®è¦ç¹ãéãã¦èªããããé«æ ¡2å¹´çãååã¯ä¸æãä¸å¦ã®é ã¯3å¹´éãã£ã¨æ¾èª²å¾ã«å³æ¸é¤¨ã§æ°å¼ãå±éããçæ´»ãç¶ãã¦ãããé«æ ¡ã§ã¯ãã«ã«ãããã©ã¨ã¨ãã«æ°å¦ã®åé¡ã«æãæ®ããã ç¼é¡ãããã¦ããã¨ããäºä»¥å¤ã身ä½çãªç¹å¾´ã¯ä¸æãéåã¯è¦æãä¸äººã£åãåºæ¬çã«ã¯å¤§äººããæ§æ ¼ã ããã«ã«ãããã©ã®æ°å¦ã®ã»ã³ã¹ã«ã³ã³ãã¬ãã¯ã¹ãæ±ããããæ°å¦ã®åé¡ããã¾ã解ããªãã¨ãã«æããã£ãããããªã©è² ããå«ãã®é¨åããããã¾ããã«ã«ã交éäºæ ã«éã£ãã¨ãã¯å¦æ ¡ããé£ã³åºãã¦ç é¢ã«èµ°ã£ã¦ãããªã©ãææ çãªè¡åã«èµ°ããã¨ããããå¹´é ã®ç·ã®åããããã«ã«ãããã©ã女ã®åã¨ãã¦æèãã¦ããæåãè¦ãããããèªåããç©æ¥µçã«ææã®ã¢ããã¼ãããããã¨ã¯ãªããã¦ã¼ãªã«å¯¾ãã¦ç¹
ãã¦ã³ãã¼ã http://download.wikimedia.org/ã®ãDatabase XML and SQL dumpsã®ãªã³ã¯ãã, XMLå½¢å¼, ããã³SQLå½¢å¼ã§ã®åå¾ãå¯è½ãã¾ã, ãã¦ã³ãã¼ããã¼ã¸ã«ã¦ãDump in progressãã¨ãªã£ã¦ãããã®ã¯å¦çä¸ã®ãã®ãªã®ã§ãDump completeãã¨ãªã£ã¦ããç®æãæ¢ããè±èªçWikipediaã®ãã¼ã¿ã¯ enwiki, æ¥æ¬èªçWikipediaã®ãã¼ã¿ã¯ jawikiã®ãªã³ã¯ãè¨å®ããã¦ããç®æãã, åå¥ã®ãã¦ã³ãã¼ããã¼ã¸ã«ç§»åãåå¾ããã ãã¦ã³ãã¼ã(è£å£) ä¸è¨ãããã¼ã¸ããã§ã¯, ãdump abortedãããDump in progressãã¸ã®ãªã³ã¯ãããªã, ç®çã®è¨èªçã¸ã®ãªã³ã¯ãè¦ä»ãããªãå ´åãã¾ãã«ããããã®ãããªå ´å, 以ä¸ã®URLããç´æ¥æ¥ç¶ããã æ¥æ¬èªç è±èªç ä¸å½èªç
ãã¼ã¿ã¤ã³ãã¼ãé¢é£ Wikipediaã®æ¬æããã¼ã¸ã¿ã¤ãã«ãå«ãã æ å ±ããã¦ã³ãã¼ãã§ãã¾ããå½¢å¼ã¯SQLã®ãã³ããã¡ã¤ã«ãXMLã§ãã ãã¦ã³ãã¼ãããæ å ±ãæ ¼ç´ãããã¼ãã«ã®æ§æã説æãã¦ã¾ãã Wikipediaã«æ¸ããã¦ãããã¦ã³ãã¼ããã¼ã¿ã®åãæ±ãã«ã«é¢ãã説æãã¼ã¸ã importDump.phpã使ç¨ãããã¼ã¿ã®ã¤ã³ãã¼ãæé ã解説ãã¦ãã¾ãã jawiki-latest-pages-meta-current.xml.bz2ãxml2sqlã使ç¨ãã¦ã¤ã³ãã¼ãããéã®æé ãæ¸ãã¦ããã¾ãã jawiki-latest-pages-meta-current.xml.bz2ãxml2sqlã使ç¨ãã¦ã¤ã³ãã¼ãããéã®æé ãæ¸ãã¦ããã¾ãã ãã¼ã¿å©ç¨é¢é£ Hadoop使ã£ã¦MapReduceã§Wikipediaã®ãã¼ã¿ãåãæ±ã£ã¦ãã人ã®ãã¼ã¸ã tf-idfã§pages-
ååä¸ã¯å ±åç 究ã®ãã¼ãã£ã³ã°ãPolycom ã§é»è©±ä¼è°ããããªç°¡åã«ããã¨ãã§ããã¨ã¯ã便å©ãªæ代ã«ãªã£ããã®ã ã åå¾ã¯ NLP.app åå¼·ä¼(èªç¶è¨èªå¦çã®å¿ç¨åå¼·ä¼)㧠Delip Rao and David Yarowsky. Ranking and Semi-supervised Classification on Large Scale Graphs Using Map-Reduce. In Proc. of TextGraphs-4. 2009. ãèªãããã£ã¦ãããã¨ã¯ MapReduce ãç¨ããã©ãã«ä¼æã§ãããã¾ã§èªç¶è¨èªå¦çã§ä½¿ããã¦ããªãã£ãã®ã ãã©ãåãã¦ããã¾ãããã¨ãã話(åãææã«éå¬ããã ACL-IJCNLP 2009 ã§ã»ã¼åãææ³ãèªåã使ã£ãã®ã§ãèªç¶è¨èªå¦çã«ãã®ææ³ãé©ç¨ããã®ã¯å½¼ã¨èªåãåææã¨ãããã¨ã«ãªã)ã ãããåå¼·ä¼ã®ä¸ã§ã
ã¿ã¤ãã«ã¯é£ãã§ããid:mamorukããã®æ¸ããHadoop 㧠Wikipedia ã®ããã¹ãå¦çã900åé«éå - æ¦èµéæ¥è¨ãèªãã§ããããã1Gç¨åº¦ã®ãã¼ã¿ã®åèªé »åº¦ãæ°ããã®ã«858åãããããã ã£ãã¨æããid:nokunoããã®è³æãèªãã§ã¿ãã¨åèªé »åº¦ãæ±ããéã« a b a aã¿ãããªãã¼ã¿ã a 3 b 1ã«å¤å½¢ããã®ã«sortãããã¡ã¤ã«ãuniq -cã§å¦çããã¨ãããã¨ããã£ã¦ãããããã¯ãã¾ãå¹çã®ããæ¹æ³ã§ã¯ãªãã¦è¡æ°ãNã¨ããã¨ãã«O(N log N)ã®è¨ç®æéã¨ãªã(æååæ¯è¼ã¯O(1)ã§ããããã¨ã«ãã)ã ããã«å¯¾ãã¦ãåèªã®é »åº¦ãããã·ã¥è¡¨ã§ä¿åããã¨çæ³çãªæ¡ä»¶ã®å ã§ã¯O(N)ã®è¨ç®æéã§é »åº¦ãæ±ãããã¨ãåºæ¥ãããé«éã«è¨ç®ãããã¨ãå¯è½ã¨ãªããã¨ãæå¾ ãããã ã¾ããåèªæ°ãWã¨ããã¨ããC++ã®mapã®ãããªäºåæ¢ç´¢æ¨ã使ã£ã¦ãO(N
ä»æä¸ã«å®é¨ã®å®è£ ãçµãããããã§ãªãã¨æ¥æã®æ稿ãåã«éã«åããªãã®ã§ãä»é±ããç 究室ã®ãµã¼ãã« Hadoop ãã¤ã³ã¹ãã¼ã«ãã¦ããã ç 究室ã«ã¯ãµã¼ãã20å°å¼±ããã®ã ãããã®ãã¡10å°å¼·ã使ããã¨ã«ãã¦è¨å®ããããããã®è¦æ¨¡ã ã¨ã大è¦æ¨¡ãã¨è¨ãã®ã¯æããããããããªãã(Yahoo! ã Google ã¨æ¯ã¹ã¦ãã¨ããæå³ã§ã)ãä¸è¦æ¨¡ããããã«ã¯è¨ã£ã¦ãããã ãããããã¶ããå¤ãã®å¤§å¦ãä¼æ¥ã§ä½¿ããå°æ°ããããããã ã¨æããã大ä¼æ¥ã«ããªãã¨ã§ããªãç 究ãããã®ã大å¤ä¾¡å¤ãããããä»ã®äººãã¡ãããæ°ã«ãªãã°çä¼¼ã§ããç 究ãããã®ã(ãã¼ã¿ãã¤ã³ãã©åè² ã§ã¯ãªãã¢ã¤ãã¢åè² ã«ãªãã®ã§è¦ããã¯ããã®ã ã)éè¦ã ã¨èãã¦ããã ãã¨ãã°ãæ°å°ã§ãåæ£ç°å¢ã®æ©æµãåãããããã¨ããã®ã¯PFI ãåºãã Hadoop ã®è§£æè³æã§ç¥ã£ã¦ããã®ã§ãåãã¦å°å ¥ããã¨ãã¯åèã«ãªã£ããããããã
As of April 2024, we have the following colocation facilities (each name except for Magru is derived from an acronym of the facilityâs company and an acronym of a nearby airport): eqiad Application services (primary) at Equinix in Ashburn, Virginia (Washington, DC area). codfw Application services (secondary) at CyrusOne in Carrollton, Texas (Dallas-Fort Worth area). esams Caching at EvoSwitch in
ã¦ã¼ã¶ã¼çæåã®ç¾ç§äºå ¸ãWikipediaããéå¶ãã¦ããéå¶å©å£ä½Wikimedia Foundationã¯ãè¿ãWikipediaã®ã«ãã¯ã¢ã³ããã£ã¼ã«ãå¤§å¹ ã«å¤æ´ããã¨çºè¡¨ãããå財å£ãçå ãã¦ããããåãçµã¿ãå®æ½ããã®ã¯ä»åãåãã¦ã ã Wikimedia Foundationã¯ç±³å½æé3æ25æ¥å¤ãå ¬å¼ããã°ãWikimedia Blogãã¸ã®æ稿ã§æ¬¡ã®ããã«è¿°ã¹ãããããããã¯ããã©ã«ãã®ãã¶ã¤ã³ããVectorãã¨ãããã¼ãã«å¤æ´ãã¦ã主è¦æ©è½ãè¦ã¤ãããããããï¼ä¸ç¥ï¼ãã¹ã¦ã®ã¦ã¼ã¶ã¼ã¯ããµã¤ãã®ã¬ã¤ã¢ã¦ãã大ããå¤ãã£ããã¨ã«æ°ã¥ãã ãããããããã¯ã¦ã¼ã¶ã¼ã®æå¾ ã«å¿ããããã«ããµã¤ãã®ããã²ã¼ã·ã§ã³ãç°¡ç¥åãæ¤ç´¢ããã¯ã¹ã®ä½ç½®ãå¤æ´ããã¨ã¨ãã«ãä»ã®ã¦ã§ãæ¨æºã«æºæ ããããã表示ã®ä¹±ããæ¸ãããå¤æ§ãªè§£å度ããã©ã¦ã¶å½¢å¼ãã¦ã£ã³ãã¦ãµã¤ãºã§ãæ°æ©è½ã確å®ã«æ©è½ããã
ã¦ã£ãããã£ã¢ã®ãã¼ã¸ã®ç·¨éã®æ¹æ³ã«ã¤ãã¦ãåºæ¬çãªæé ãããã¼ã¯ã¢ããã®æ¹æ³ã説æãã¾ããããã§ã¯ã¦ã£ãããã¹ãã¨ãã£ã¿ã¼ã§ã®ãã¼ã¯ã¢ããã説æãã¦ãã¾ãã2016å¹´5æã«å°å ¥ããããã¸ã¥ã¢ã«ã¨ãã£ã¿ã¼ã«ã¤ãã¦ã¯Help:ãã¸ã¥ã¢ã«ã¨ãã£ã¿ã¼ããèªã¿ãã ããã ããã§ã®èª¬æã¯ç·¨éç»é¢ã®åºãæ¹ãç·¨éçµæã®ãã¬ãã¥ã¼ã®æ¹æ³ãããã¦æ稿ããéã®æ¹æ³ã注æç¹ã§ãããã¼ã¯ã¢ããã¯ã¦ã§ããã©ã¦ã¶ã§è¡¨ç¤ºããããã®è¡¨è¨æ¹æ³ã§ãããè¨èã«ãªã³ã¯ããããã表示ãå¤ããããç»åã表ã使ããã¨ãã§ãã¾ããä¸éãã®ãã¼ã¯ã¢ãããç´¹ä»ããã®éã®è«¸æ³¨æã説æãã¾ãã代表çãªãã¼ã¯ã¢ããã¯ãHelp:æ©è¦è¡¨ãã覧ãã ããã 試ãæ¸ãã¯ãç·´ç¿ç¨ã®ãµã³ãããã¯ã¹ã使ã£ã¦ãã ããã
Unicodeã§è¦å®ããã¦ããæåã«å¿ è¦ãªãã®ãããã°ããã¹ã¦ä½¿ããã¨ãã§ãã¾ãããã ããJIS X 0201ã®ã©ãã³æåé¡ãJIS X 0213ãIBMæ¡å¼µæ¼¢åã®ãããã«ãè¦å®ããã¦ããªãæåã¯ãã§ããã ã使ããªãããã«ãã¦ãã ããã ã©ãã³æåï¼è±åï¼ãã¢ã©ãã¢æ°åãªã©ãJIS X 0201ã®ã©ãã³æåé¡ï¼ããããåè§è±æ°åï¼ã§è¦å®ããã¦ãããã®ã¯ãããç¨ãã¾ããããã§ãªãæ¼¢åã»å¹³ä»®åã»çä»®åãªã©ã¯ãJIS X 0213ã«è¦å®ããã¦ãããã®ï¼ããããå ¨è§æåï¼ãããã°ãããç¨ãã¾ãããã ããç°ä½åã«ã¤ãã¦ã¯åºæåè©ãªã©ãé¤ãJIS X 0208ã«è¦å®ããã¦ãããã®ãåªå ãã¦ãã ãããJIS X 0201ã®ä»®åæåé¡ï¼ããããåè§ã«ãï¼ã¯å¼ç¨ãªã©ç¹æ®ãªå ´åãé¤ã使ããªãã§ãã ããã JIS X 0201ã®ã©ãã³æåé¡ã®è¨å·ã®ãªãã«ã¯ãå ´åã«ãã£ã¦ã¯å ¨è§å½¢ãç¨ããå¿ è¦ããããã®ããå ¨è§å½¢ã
ã¹ã¿ã¤ã«ããã¥ã¢ã«ã§ã¯ãã¦ã£ãããã£ã¢ã«ããã¦è¨äºãæ¸ãéã®æç« ã¹ã¿ã¤ã«ã«ã¤ãã¦è§£èª¬ãã¾ããã¦ã£ãããã£ã¢ã®è¨äºã§ã¯ãæç« ã®ã¹ã¿ã¤ã«ããããè¨äºã®å 容ã®æ¹ã大äºã§ãããå·çè ã¯å½ããã¥ã¢ã«ã§è¦å®ããã¹ã¿ã¤ã«ã«çµ¶å¯¾ã«å¾ããªããã°ãªããªãããã§ã¯ããã¾ãããããããè¨äºã®èªã¿ããããç·¨éã®ãããããä¿ã¤ããã«ãä¸è²«ããã¹ã¿ã¤ã«ã«æ²¿ã£ã¦å·çãããã¨ãæ¨å¥¨ããã¦ãã¾ãã è¨äºå ¨ä½ã®é ç½®é ã®ãããªã¬ã¤ã¢ã¦ãã¯ã¹ã¿ã¤ã«ããã¥ã¢ã«ã®ä¸é¨ã§ããããã«é¢ãã¦ã¯Wikipedia:ã¹ã¿ã¤ã«ããã¥ã¢ã«/ã¬ã¤ã¢ã¦ããåç §ãã¦ãã ãããåèæç®ãå¤é¨ãªã³ã¯çè¨äºã®ãããªåºæ¬è¦ç´ ã«é¢ãã¦ãããã«è©³è¿°ããã¦ãã¾ãã ã¹ã¿ã¤ã«ããã¥ã¢ã«ã§ã¯ãåèªã強調ããããã«å¤ªåã«ãããã表ã表è¨ããæ¹æ³ã«ã¤ãã¦ã¯è§£èª¬ãã¦ãã¾ãããããããæç« ã®ãã¼ã¯ã¢ããæ¹æ³ã«é¢ãã¦ã¯Help:ãã¼ã¸ã®ç·¨éãåç §ãã¦ãã ããã ããã§èª¬æããã¹ã¿ã¤
è¨äºåã¯#è¨äºåã®ä»ãæ¹ã®ç®å®ã«é©åããããåºæ¬çã«ã¯æ¥æ¬èªã§ã®æ£å¼å称ã使ç¨ãã¾ãããã®éãå称ãå¤å½èªã®ãã®ã¯æå種ã«å¿ãã¦#ç¥å·ã»è¨å·ã»çä»®åèªãªãã³ã«#æ¼¢åã«å¾ã£ã¦ãã ããããã使ãããç¥ç§°ãå¥åãå¥è¡¨è¨ãªã©ã¯è¨äºåã«ä½¿ãã®ã§ã¯ãªããæ£å¼ãªè¨äºåã¸ã®ãªãã¤ã¬ã¯ãï¼è»¢éï¼ãã¼ã¸ã«ãã¾ãããã詳ããã¯Wikipedia:ãªãã¤ã¬ã¯ããåç §ãã¦ãã ããã Ããæ¥æ¯è°·é«æ ¡ã â âãæ±äº¬é½ç«æ¥æ¯è°·é«çå¦æ ¡ãâ»å¦æ ¡åã«é¢ãã¦ã¯ããã¸ã§ã¯ã:å¦æ ¡#è¨äºåãåç §ãã¦ãã ããã Ããç§ç«éæé«æ ¡ã â âãéæä¸å¦æ ¡ã»é«çå¦æ ¡ã Ããåæé®®ã â âãæé®®æ°ä¸»ä¸»ç¾©äººæ°å ±åå½ã â»å½åã«é¢ãã¦ã¯æ £ä¾ãããã¾ãã#å½åãåç §ãã¦ãã ããã ÃãTBSã â âãTBSãã¬ãã â»æ³äººã»å£ä½åã«é¢ãã¦ã¯#æ³äººã»å£ä½åãåç §ãã¦ãã ããã 訳èªãããäºè±¡ã«é¢ããè¨äºãªã©ãæ£å¼ãªå称ããªãå ´åã¯æ¥æ¬èªã§ã®é©
ã¦ã£ãããã£ã¢ã§æ°ãããã¼ã¸ï¼é ç®ï¼ãä½ãæ¹æ³ã¨æ³¨æç¹ã«ã¤ãã¦èª¬æãã¾ããæ¢ã«é¡ä¼¼ã®è¨äºãåå¨ããªãããè¨äºåã®ä»ãæ¹ãæ°è¦ä½æããã»ã©ã®ãã¼ããã©ãããè¨äºå 容ã®æ³¨æãèä½æ¨©ã®æ³¨æãªã©ã§ãã試ãæ¸ãã¯ãç·´ç¿ç¨ã®ãµã³ãããã¯ã¹ã§ãé¡ããã¾ãï¼ãã°ã¤ã³ããã°ãèªèº«å°ç¨ã®å©ç¨è ãµã³ãããã¯ã¹ã使ãã¾ãï¼ã å ã«Wikipedia:è¨äºãå·çããã®æ³¨æç¹ãçèªãã¦ãã ãããè¨äºã®ä½æãç·´ç¿ãããå ´åã¯ãç·´ç¿ç¨ã®Wikipedia:ãµã³ãããã¯ã¹ããæ´»ç¨ãã ããï¼ãã°ã¤ã³ããã°ãèªèº«å°ç¨ã®å©ç¨è ãµã³ãããã¯ã¹ã使ãã¾ãï¼ã æ°è¦ãã¼ã¸ãä½æããéã«ã¯ã以ä¸ã®ãããªæ³¨æç¹ãããã¾ãã ã¦ã£ãããã£ã¢å ã«é¡ä¼¼ã®è¨äºãæ¢ã«åå¨ããªãã確èªãã¦ãã ãããæ¤ç´¢ããã¯ã¹ã«ã¦ãè¨äºåã¨ãããæååãæ¤ç´¢ãã¾ããããå¼ç§°ããåãäºæã¸ãªãã¤ã¬ã¯ãï¼è»¢éï¼ããã¦ããå ´åãããã¾ãããã®ãªãã¤ã¬ã¯ããä½æããã¦ããªã
ã¡ã³ããã³ã¹
ãç¥ãã
é害
ãªãªã¼ã¹ãé害æ å ±ãªã©ã®ãµã¼ãã¹ã®ãç¥ãã
ææ°ã®äººæ°ã¨ã³ããªã¼ã®é ä¿¡
å¦çãå®è¡ä¸ã§ã
j次ã®ããã¯ãã¼ã¯
kåã®ããã¯ãã¼ã¯
lãã¨ã§èªã
eã³ã¡ã³ãä¸è¦§ãéã
oãã¼ã¸ãéã
{{#tags}}- {{label}}
{{/tags}}