2022å¹´04æ25æ¥ NDLã©ãã®GitHubããã次ã®2件ãå ¬éãã¾ãããã©ã¤ã»ã³ã¹ã詳細ã«ã¤ãã¦ã¯ãåãªãã¸ããªã®READMEããåç §ãã ããã NDLOCR å½ç«å½ä¼å³æ¸é¤¨ï¼ä»¥ä¸ããå½é¤¨ãã¨ãã¾ããï¼ã令å3年度ã«æ ªå¼ä¼ç¤¾ã¢ã«ãã©AIã½ãªã¥ã¼ã·ã§ã³ãºã«å§è¨ãã¦å®æ½ããOCRå¦çããã°ã©ã ã®ç 究éçºäºæ¥ã®ææã§ãããæ¥æ¬èªã®OCRå¦çããã°ã©ã ã§ãã ãã®ããã°ã©ã ã¯ãå½ç«å½ä¼å³æ¸é¤¨ãCC BY 4.0ã©ã¤ã»ã³ã¹ã§å ¬éãããã®ã§ãããªããæ¢åã®ã©ã¤ãã©ãªçãå©ç¨ãã¦ããé¨åã«ã¤ãã¦ã¯å¯å®¹åãªã¼ãã³ã©ã¤ã»ã³ã¹ã®ãã®ãæ¡ç¨ãã¦ãããããåç¨éåç¨ãåããèªç±ãªæ¹å¤ãå©ç¨ãå¯è½ã§ãã æ©è½ãã¨ã«7ã¤ã®ãªãã¸ããªã«åããã¦ãã¾ãããä¸è¨ãªãã¸ããªã®æé ã«å¾ããã¨ã§ãDockerã³ã³ããã¨ãã¦æ§ç¯ã»å©ç¨ãããã¨ãã§ãã¾ãã ãªãã¸ã㪠: https://github.com/ndl-lab/
ãããã£ã¦æ°åã ããåãåºãããã§ãã éãåè§ã®ï¼é ç¹ã®åº§æ¨ããããã°ã¢ãã£ã³å¤æãå®æ½ãã¦ãæ°åã ããåãåºããã¨ãã§ãã¾ããï¼é ç¹ã®åº§æ¨ãèªåã§åå¾ãããã¨æãã¾ãã 試è¡é¯èª¤ã®è¨é²ãæ®ãã¦ããã¾ããï¼æçµçã«ã¯arucoãã¼ã«ã¼ã使ãã¹ãã¨ã®çµè«ã«è³ãã¾ããï¼ è©¦è¡ï¼ãRGBã®éè²ã ããåãåºãã°ããã®ã§ã¯ãªããï¼ éè²ã®æ ã¯ãBã®æ°å¤ãé«ãã®ã§ãããå©ç¨ãã¦edge detectionãããã°ããã®ã§ã¯ãªããã img = cv2.imread("C:\\Users\\tegaki_1.jpg") img_resize = cv2.resize(img, (img.shape[1]//7,img.shape[0]//7)) img_B = img_resize[:,:,0] cv2.imshow('image',img_B) cv2.waitKey() çµè«ï¼å¤±æãéãç½ã«æ¶ãã¦
é·ã»çæè¨æ¶ (LSTM) ã»ã«ã¯ãã¼ã¿ãé£ç¶çã«å¦çããé·æéã«ãã£ã¦ãã®é ãç¶æ ãä¿æãããã¨ãã§ããã é·ã»çæè¨æ¶ï¼ã¡ããã»ãããããããè±: Long short-term memoryãç¥ç§°: LSTMï¼ã¯ã深層å¦ç¿ï¼ãã£ã¼ãã©ã¼ãã³ã°ï¼ã®åéã«ããã¦ç¨ãããã人工å帰åãã¥ã¼ã©ã«ãããã¯ã¼ã¯ï¼RNNï¼ã¢ã¼ããã¯ãã£ã§ãã[1]ãæ¨æºçãªé ä¼æåãã¥ã¼ã©ã«ãããã¯ã¼ã¯ã¨ã¯ç°ãªããLSTMã¯èªèº«ããæ±ç¨è¨ç®æ©ãï¼ããªãã¡ããã¥ã¼ãªã³ã°ãã·ã³ãè¨ç®å¯è½ãªãã¨ãä½ã§ãè¨ç®ã§ããï¼ã«ãããã£ã¼ãããã¯çµåãæãã[2]ãLSTMã¯ï¼ç»åã¨ãã£ãï¼åä¸ã®ãã¼ã¿ç¹ã ãã§ãªããï¼é³å£°ãããã¯åç»ã¨ãã£ãï¼å ¨ãã¼ã¿é åãå¦çã§ãããä¾ãã°ãLSTMã¯åå²ããã¦ããªããã¤ãªãã£ãææ¸ãæåèªè[3]ãé³å£°èªè[4][5]ã¨ãã£ã課é¡ã«é©ç¨å¯è½ã§ããããã«ã¼ã ãã¼ã° ãã¸ãã¹ã¦ã£ã¼ã¯èªã¯ããããã®
æ£è¦è¡¨ç¾ãå©ç¨ããOCRããã¹ãã®ã¯ãªã¼ãã³ã°ææ³ 2020å¹´3æ11æ¥ 2020å¹´6æ24æ¥ The Programming Historian æ¥æ¬èªè¨³ OCR, programing, Python, ä¸ç´, æ£è¦è¡¨ç¾ dh_portal Laura Turner OâHara ã¹ãã£ã³ç»åãããã¹ããã¼ã¿ã«å¤æããå å¦çæåèªèï¼Optical Character Recognition; OCRï¼ã¯ãæ´å²ç 究ã«ã¨ã£ã¦å¤©ããã®è´ãç©ã§ãããã¨ã¯æããã§ãããã®ã¬ãã¹ã³ã§ã¯ãOCRã§ããã¹ãåããããã¼ã¿ããã使ããããããæ¹æ³ãå¦ã³ã¾ãã ç®æ¬¡ ã¯ããã«æ£è¦è¡¨ç¾ï¼Regexï¼Pythonã¨æ£è¦è¡¨ç¾å§ããåã«è¦ãã¦ããã¹ã2ã¤ã®ãã¨ãµã³ãã«Pythonãã¡ã¤ã«VERBOSEã¢ã¼ããæ´»ç¨ ã¯ããã« ã¹ãã£ã³ç»åãããã¹ããã¼ã¿ã«å¤æããå å¦çæåèªèï¼Optical Character
ãã®è¨äºã®ã²ã¨ãã¨ã¾ã¨ã GASã§google Driveã®ãã©ã«ãã¼ã«ã¢ãããã¼ããããç»åï¼PDFãããã¾ãï¼ããgoogle drive APIã¨DocumentApp Classãã¤ãã£ã¦OCRãæ å ±ãããã¹ãã¨ãã¦ã¨ãã ãæ¹æ³ãç´¹ä»ãã¾ãã ããã使ããã¨ã§ã以ä¸ã®ãããªãã¨ãã§ããã¨æãã¾ãã ç´ã®ã·ãã表ãã¹ããã§ã¨ã£ã¦Driveã«ããã¦ã°ã¼ã°ã«ã«ã¬ã³ãã¼ãã¿ã¤ã ããªã¼ã«ã·ãããèªåç»é²ãã ã¬ã·ã¼ããèªã¿è¾¼ãã§ååãéé¡ãã¹ãã¬ããã·ã¼ãã«èªåç»é²ãã ã¯ããã« OCRï¼å å¦æåèªèï¼ãgoogleã®APIã§ããGoogle Cloud Vision APIã§ã§ãããã¨ã¯ç¥ã£ã¦ãããã§ããããã®APIã¯GAS(google app script)ã§ã¯ãµã¼ãã¹ã¨ãã¦æä¾ããã¦ããªãã®ã§ãã£ã¨ã¯ä½¿ãã¾ããã§ããã ã§ããããµã¼ãã¹ã¨ãã¦æä¾ããã¦ããDrive APIãã¤ããã
ä½ k ä¸ã® n 次å å°å½±ç©ºé Pn(k) ã¨ã¯ããã¯ãã«ç©ºé kn+1 ããåç¹ãé¤ãã空éãä½ k ã®ä¹æ³ç¾¤ k* ã®ã¹ã«ã©ã¼åã®ä½ç¨ã§å²ã£ã空é ã®ãã¨ã§ãããããã¨ãkn+1 ã®éã®åååå f ã¯ãã¹ã«ã©ã¼åã¨å¯æã§ãããã¾ã 0 ã§ãªããã¯ãã«ã 0 ã§ãªããã¯ãã«ã«åããããPn(k) ã®éã®ååååãèªå°ãããããã Pn(k) ã®å°å½±å¤æã§ããã
2次å ã¬ãã¼ã«ãã£ã«ã¿ãæ¼¢åã«ä½ç¨ãããä¾ãå·¦ä¸ããæè¨åãã«åç»åãæ¬éæ³¢ã®æ¹åã45°ã¥ã¤é ã«å¤ãã4ã¤ã®ãã£ã«ã¿ã®ä½ç¨çµæãåã³ããã4ã¤ã®ãã£ã«ã¿çµæãéãåãããå³ã表ãã ã¬ãã¼ã«ãã£ã«ã¿ï¼è±: Gabor filterï¼ã¯ãç»åå¦çã®ãã¯ã¹ãã£ã¼è§£æçã«ç¨ããããç·åãã£ã«ã¿ã®ä¸ç¨®ãï¼2次å ã®ã¬ãã¼ã«ãã£ã«ã¿ã§ã¯ï¼ç»åã®åç¹å¨ãã®å±æé åã«ããã¦ãæ¹åæ¯ã«ç¹å®ã®å¨æ³¢æ°æåãæ½åºãããã¨ãã§ããã è¹å½©èªèãæç´èªè¨¼ã«ãå¿ç¨ããã¦ããä»ãåºä¹³é¡ã®è³ã®ä¸æ¬¡è¦è¦éã«ããåç´åç´°èã®æ´»åãã¢ãã«åã§ãããã¨ã示ããã¦ãããå称ã¯ã¬ã¼ãã«ã»ãã¼ãã·ã¥ã«å ã[1]ã å®ç¾©[ç·¨é] 2次å ã¬ãã¼ã«ãã£ã«ã¿ã®ã¤ã³ãã«ã¹å¿çã®ä¾ ç´°é¨ãç°ãªã種ã ã®å®ç¾©ãããããåºæ¬çã«ã¯ã¬ã¦ã¹é¢æ°ï¼ã¬ã¦ã·ã¢ã³ã¨ã³ããã¼ãã¨ãå¼ã°ããï¼ã¨ä¸è§é¢æ°ï¼æ¬éæ³¢ã¨ãå¼ã°ããï¼ã®ç©ã¨ãã¦å®ç¾©ããã[2][3][4]ï¼ ã®
â ã¤ãã³ã â¨ï¼ãSenseTime Japan à Sansanãç»åå¦çåå¼·ä¼ https://sansan.connpass.com/event/230636/ â ç»å£æ¦è¦ ã¿ã¤ãã«ï¼æ·±å±¤å¦ç¿æ代ã®æåèªèã¨ãã®å¨è¾º çºè¡¨è ï¼ â¨æè¡æ¬é¨ DSOC R&Dç ç©¶å¡ ãå®®æ¬ åªä¸ â¼Twitter https://twitter.com/SansanRandD
èããæ´çããããã®å人çãªã¡ã¢çãªãã®ã§ãããããããªã§ãã ç»åã»åç»ã»é³å£°ãªã©ããç¹å®ã®ããã®ããèªèããããã¨ã£ã¦å¤ã ããã¾ãããã å°ãªãã¨ãç§ã®ä¸ã§ã¯ããã£ã¡ã ãããã¾ãã ãã¦ãä»åã®è¨äºã¯ç©ä½èªèã®åé¡è²ã 調ã¹ãçµæãèªåã®ä¸ã§æ´çããããã®ã¡ã¢ã§ãã åºæ¬æ¦å¿µã®æ´çãç®çã§ãã ç©ä½èªèã¨ã¯ ã¾ãç©ä½èªèã¨ã¯ä½ããï¼ã¨ããåãã«ã¤ãã¦ãç©ä½èªèãåé¡é åã®è¦³ç¹ãã大å¥ãã¦ï¼ç³»çµ±ããããã§ãã ç¹å®ç©ä½èªè ä¸è¬ç©ä½èªè ã¾ã1ã«ã¤ãã¦ããããç§ã®æ¬²ããç©ä½èªèã§ãã æ¢ç¥ã®ç©ä½Aã«ã¤ãã¦ãç»åä¸ã®ã©ãã«ç©ä½Aãåå¨ããã(ãããã¯åå¨ããªã)ã調ã¹ã 2ã§ãããç»åãä½ã示ãã¦ãããã®ãªã®ããè¨ãå½ã¦ã(è»ã®ç»åï¼ã¨ã)ç©ä½èªèã§ãã ç»åå¦ç以å¤ã®æ¹é¢ã®ç¥èãå¿ è¦ã¨ãªã£ã¦ããã®ã§ããã¡ãã¯å½é¢ã¯ä¿çã¨ãã¾ãã ç¹å®ç©ä½èªèã®å¤å ¸çææ³ å¤ãããããã¢ã«ã´ãªãºã ã¨ãã¦ã次
ç»åã«å«ã¾ããæåãããã¹ããã¼ã¿åããå å¦æåèªè(OCR)ã¯ãè«æ±æ¸ãã¬ã·ã¼ããååºãªã©ã®å°å·ç©ããã¸ã¿ã«åããææ³ã¨ãã¦åºã使ããã¦ãã¾ãããããªOCRããã£ã¼ãã©ã¼ãã³ã°ãã¬ã¼ã ã¯ã¼ã¯ã§å®ç¾ããã®ãããªã¼ãã³ã½ã¼ã¹ã®OCRã·ã¹ãã ãPP-OCRv2ãã®ãã¢çã¨ãªããPaddleOCRãã§ãã PaddleOCR - a Hugging Face Space by akhaliq https://huggingface.co/spaces/akhaliq/PaddleOCR GitHub - PaddlePaddle/PaddleOCR: Awesome multilingual OCR toolkits based on PaddlePaddle ï¼practical ultra lightweight OCR system, support 80+ languages recog
ãã¼ã¿ãã«ã¹ãã£ãã使ã£ã¦ãªã¢ã«ã¿ã¤ã ã§å å¦æåèªè (OCR) ãè¡ã£ã¦ããåç» å å¦æåèªèï¼ããããããã«ããããè±: Optical character recognitionï¼ã¯ãæ´»åãææ¸ãããã¹ãã®ç»åãæåã³ã¼ãã®åã«å¤æããã½ããã¦ã§ã¢ã§ãããç»åã¯ã¤ã¡ã¼ã¸ã¹ãã£ãã¼ãåçã§åãè¾¼ã¾ããææ¸ã風æ¯åçï¼é¢¨æ¯å ã®çæ¿ã®æåãªã©ï¼ãç»åå ã®åå¹ï¼ãã¬ãæ¾éç»åå ãªã©ï¼ã使ããã[1]ãä¸è¬ã«OCRã¨ç¥è¨ãããã ãã¹ãã¼ããè«æ±æ¸ãéè¡åå¼æç´°æ¸ãã¬ã·ã¼ããååºãã¡ã¼ã«ããã¼ã¿ãææ¸ã®å°å·ç©ãªã©ãç´ã«è¨è¼ããããã¼ã¿ããã¼ã¿å ¥åããææ³ã¨ãã¦åºã使ãããç´ã«å°å·ãããææ¸ããã¸ã¿ã¤ãºããããã³ã³ãã¯ããªå½¢ã§è¨é²ããã®ã«å¿ è¦ã¨ããããããã«ãæåã³ã¼ãã«å¤æãããã¨ã§ã³ã°ããã£ãã³ã³ãã¥ã¼ãã£ã³ã°ãæ©æ¢°ç¿»è¨³ãé³å£°åæã®å ¥åã«ã使ããããã«ãªããããã¹ããã¤ãã³ã°ãå¯è½ã¨ãªããç
ã¯ããã« ããã¿ã¼ã³èªèã¨æ©æ¢°å¦ç¿ãã®ç¬å¦æã®ã¾ã¨ãã§ããä¸é£ã®è¨äºã¯ãæ°å¼ã®è¡éåããã¾ãã¯ãRã»Pythonã§ã®å®è£ ãããã¢ã«ã´ãªãºã ã®ç解ãè£å©ãããã¨ãç®çã¨ãã¦ãã¾ããæ¬ã¨ãããã¦èªãã§ãã ããã ãã®è¨äºã¯ã9.3.3é ã®å 容ã§ããæ··åãã«ãã¼ã¤åå¸ã«å¯¾ããEMã¢ã«ã´ãªãºã ã«ããæå°¤æ¨å®ãPythonã§å®è£ ãã¾ãã ãæ°çç·¨ã www.anarchive-beta.com ãä»ã®ç¯ä¸è¦§ã www.anarchive-beta.com ããã®ç¯ã®å 容ã ã¯ããã« ã»Pythonã§å®è£ ã»MNISTãã¼ã¿ã»ããã®æºå ã»åæå¤ã®è¨å® ã»æ¨è«å¦ç ã»ã³ã¼ãå ¨ä½ ã»å¦çã®è§£èª¬ ã»Eã¹ããã ã»Mã¹ããã ã»å¯¾æ°å°¤åº¦ã®è¨ç® ã»æ¨è«çµæã®ç¢ºèª ã»ãã©ã¡ã¼ã¿ã®ç¢ºèª ã»å¦ç¿ã®æ¨ç§»ã®ç¢ºèª ã»åé¡çµæã®ç¢ºèª ã»ä»ã®çµæ åèæç® ãããã« ã»Pythonã§å®è£ MNISTãã¼ã¿ã»ãããç¨ãã¦ãæ··å
ç¡æãã¼ã«ã§ãããã¾ãããé«ãã»ãã¥ãªãã£ã¬ãã«ã«ã¦ãã¼ã¿ã¯ç®¡çããã¦ãã¾ãã ã¾ãããå ¥åããã ãããã¼ã¿ãæåèªèããæ å ±ãåæã«ä»ã®ç®çã«äºæ¬¡å©ç¨ãããã¨ã¯ãããã¾ããã
R&D ãã¼ã ã®å¾³ç°ï¼@dakutonï¼ã§ãã æè¿ã¯ç»åã¨ããã¹ãã®çéã«ãã¾ãã ä»åè¨äºã®ã¾ã¨ã ç°¡åã«ã¾ã¨ããã¨ä»¥ä¸ã®ã¨ããã§ãã ããã¤ãã®è¶ 解å(é«è§£å度å)ã¢ãã«ãOpenCV extra modules(opencv_contrib)ã¤ã³ã¹ãã¼ã« + ã³ã¼ãæ°è¡è¨è¿°ã§å°å ¥å¯è½ è¶ è§£åã«éãããæåãä¸å®ãµã¤ãºä»¥ä¸ã«ãªããããªåå¦ç -> OCR解æ ãå®æ½ããã¨ãOCR精度æ¹åã«ã¤ãªãããã¨ããã è¶ è§£åã«ããè¦ãç®ã®æ»ãããã«æ¯ä¾ãã¦ãOCR精度æ¹åã«ã¤ãªããããã§ã¯ãªã ä½è¨ç®ã³ã¹ããªç»åæ¡å¤§ããè¶ è§£åã«å¤æ´ããæ©æµã¯çºçãã«ãã ãã¹ãæ¡ä»¶ãå¤ããå ´åãéã£ãçµæã«ãªãå¯è½æ§ãã(ç¨ããOCRã¨ã³ã¸ã³ãç»åã®å£åæ¡ä»¶ãOpenCVæªæä¾ã®å¾çºã¢ãã«å©ç¨ãªã©) å®é¨å 容 å©ç¨ããOCRã¨ã³ã¸ã³ã®å®è¡æ¡ä»¶ã¯å¤ããã«ãåå¦çé¨åã®ã¿å¤æ´ããå ´åã®OCR精度ã»é度å¤åã調ã¹ã¾
ã©ã³ãã³ã°
ã©ã³ãã³ã°
ã©ã³ãã³ã°
ãªãªã¼ã¹ãé害æ å ±ãªã©ã®ãµã¼ãã¹ã®ãç¥ãã
ææ°ã®äººæ°ã¨ã³ããªã¼ã®é ä¿¡
å¦çãå®è¡ä¸ã§ã
j次ã®ããã¯ãã¼ã¯
kåã®ããã¯ãã¼ã¯
lãã¨ã§èªã
eã³ã¡ã³ãä¸è¦§ãéã
oãã¼ã¸ãéã
{{#tags}}- {{label}}
{{/tags}}