2024-02-01ãã1ã¶æéã®è¨äºä¸è¦§
åãã« ã㢠éçºç°å¢ æºå åææé ãã¼ã¿ããã¦ã³ãã¼ãã»ãã¼ã WAND-SNRã使ã£ã¦é³å£°ãã¼ã¿ã®åæ åæçµæããã¹ãã°ã©ã ã§è¡¨ç¤º WADA-SNRå¤ã100以ä¸ã®ãã¼ã¿åæ°ãåå¾ åè Windowsã§ã®ããã»ã¹ã¨ã©ã¼å¯¾å¿ 並åå¦çå¯¾å¿ åãã« reazon-research/reazâ¦
åãã« ã㢠éçºç°å¢ æºå å®è£ åãã« WADA SNRã使ã£ã¦é³å£°ã®è©ä¾¡ãã§ããã¿ãããªã®ã§ã以ä¸ã使ã£ã¦å®éã«è©¦ãã¦ã¿ã¾ã gist.github.com ã㢠å®è¡ããã¨ä»¥ä¸ã®ãããªãã°ã表示ããã¾ã Calculated SNR: 13.775574879980502 éçºç°å¢ Windows 11 Pythoâ¦
åãã« ç°å¢ æºå å®è¡ åãã« TTSã®ãã¼ã¿ã»ããã«ã¯ã¯ãªã¼ã³ãªé³å£°ãå¿ è¦ã§ãããã¯ãªã¼ã³ãªé³å£°ãéããããã«ã¯å´åãããªã大å¤ã«ãªãã¾ãã 以ä¸ã¯Google ãçºè¡¨ããå£åããé³å£°ãé«å質ã«å¤æããé³å£°å¾©å (SR)æè¡ã§ããä»åã¯ãã¡ãã®åç¾ãªãã¸ãâ¦
åãã« ã㢠éçºç°å¢ æºå æåãè¶ãåã³ãã¡ã¤ã«ä¿å ã¨ã©ã¼å¯¾å¿ CUDAãenabledã«ãªã£ã¦ããªãå ´å ReazonSpeech/pkg/nemo-asrã®ã¤ã³ã¹ãã¼ã«ã失æãã åèãµã¤ã åãã« STTãTTSã®å¦ç¿çãããéã«æåãã¼ã¿ãå¿ è¦ã«ãªãã®ã§ãããé³å£°ã ãããå ´åâ¦
åãã« ããã£ã¦ãã人åã ã㢠éçºç°å¢ ç°å¢ã®æºå ãã¼ã¿ã»ããã®æºå ãã¼ã¿ã®é ç½® åå¦çã®å®è¡ äºåå¦ç¿ã®éå§ pthãsafetensorsã«å¤æãã ãã«ãGPUã§å¦ç¿ãããå ´å åãã« Style-Bert-VITS2ã¯ãæ¥æ¬èªã«ãããã¢ã¯ã»ã³ããªã©ã®æ¹åã«ããTTSã§ã¯â¦
åãã« ç°å¢ æºå å®è¡ åã㫠以ä¸ã試ãã¦ã¿ã¾ã huggingface.co ç°å¢ L4 GPU(Jupyter Notebook) ubuntu22.04 æºå 以ä¸ã®ã©ã¤ãã©ãªãå ¥ãã¾ã !pip install transformers bitsandbytes accelerate ã¢ãã«ã®ãã¦ã³ãã¼ãããã¾ã # pip install bitsandbyâ¦
åãã« éçºç°å¢ ãã¼ã¿æºå datasetsã使ã£ããã¼ã«ã«ã®jsonã®ãã¼ã åã㫠以ä¸ã®ããã«QLoRAã使ã£ã¦fine turningãè¡ãã¾ããããç¬èªãã¼ã¿ã使ãããå ´åã®é©å½æ¹æ³ã«ã¤ãã¦ãã£ã¦ã¿ã¾ã ayousanz.hatenadiary.jp éçºç°å¢ cuda:12.2.0-base-ubuntu22â¦
ç°å¢ æºå å®è¡ ç°å¢ python 3.11 æºå 以ä¸ã®ã©ã¤ãã©ãªãã¤ã³ã¹ãã¼ã«ãã¾ã pip install pyarrow pip install pandas å®è¡ import pandas as pd # æ¢ã«ããDataFrameãParquetå½¢å¼ã§ä¿åãã¾ãã file_path = "./data.parquet" # Parquetãã¡ã¤ã«ãèªã¿è¾¼â¦
åãã« ç°å¢ æºå å®è¡ åãã« 35,000æéã®ã³ã¼ãã¹ã§ãããReazonSpeech v2 ã³ã¼ãã¹ããå ¬éãããã®ã§ãå®éã«ã©ã®ãããªãã¼ã¿ãå ¥ã£ã¦ããã®ãã確èªãããã¨æãã¾ãã prtimes.jp ç°å¢ Google Colob (CPU) æºå å¿ è¦ãªã©ã¤ãã©ãªãå ¥ãã¦ããã¾ã !pâ¦