åãã«
text to audioãããããããä¸ã§åºã¦ãã AudioLDMã¨ããCLAPã使ã£ã¦é³å£°ãçæããã¢ãã«ã触ã£ã¦ã¿ã¾ã
éçºç°å¢
- uv
- python 3.8
- windows 11(powershell)
ã»ããã¢ãã
ã¾ãã¯å ¬å¼ãconda 3.8ãæå®ãã¦ããã®ã§ãuv ã§python=3.8ã®ç°å¢ãä½ãã¾ãã
uv venv -p 3.8 .venv\Scripts\activate
ã©ã¤ãã©ãªãå ¥ãã¦ããã¾ã
uv pip install audioldm
ãã®ã¾ã¾ã ã¨å®è¡ããã¨ã¨ã©ã¼ã«ãªãã®ã§ãsetuptoolsãã¢ãããã¼ããã¾ã
ã¨ã©ã¼æ
from pkg_resources import resource_filename ModuleNotFoundError: No module named 'pkg_resources'
uv pip install --upgrade setuptools
gradioã4.x.x以ä¸ã ã¨åããªãã®ã§ãgradioã3ç³»ã§ã¤ã³ã¹ãã¼ã«ãã¾ã
4ç³»ã§å®è¡ããã¨ä»¥ä¸ã®ã¨ã©ã¼ãåºã¾ã
Traceback (most recent call last): File ".\app.py", line 225, in <module> with gr.Box(): AttributeError: module 'gradio' has no attribute 'Box'
uv pip install gradio==3.50.2
torchãcpuã«ãªã£ã¦ããã®ã§ãCUDAçãã¤ã³ã¹ãã¼ã«ãã¾ã
uv pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121 --force-reinstall
ä¸é¨ã©ã¤ãã©ãªã§ã¨ã©ã¼ã«ãªãã®ã§ãã¢ãããã¼ããã¾ã
uv pip install --upgrade typing_extensions
æ¨è«
以ä¸ã§gradioã®GUIæ¨è«ç»é¢ãéãã¾ã
python .\app.py
ååã®submitã¯ã¢ãã«ã®ãã¦ã³ãã¼ããèµ°ãã¾ã
Load AudioLDM: %s audioldm-m-full
Downloading the main structure of audioldm-m-full into C:\Users\user\.cache/audioldm
41% |#############################
以ä¸ã®ããã«é³ãçæããã¾ã
requirements.txt
requirements.txt
ã¯ä»¥ä¸ã§ã
aiofiles==23.2.1 altair==5.4.1 annotated-types==0.7.0 anyio==4.5.2 attrs==24.2.0 audioldm==0.1.1 audioread==3.0.1 certifi==2024.8.30 cffi==1.17.1 chardet==5.2.0 charset-normalizer==3.4.0 click==8.1.7 colorama==0.4.6 contourpy==1.1.1 cycler==0.12.1 decorator==5.1.1 einops==0.8.0 exceptiongroup==1.2.2 fastapi==0.115.5 ffmpy==0.4.0 filelock==3.13.1 fonttools==4.55.0 fsspec==2024.2.0 ftfy==6.2.3 gradio==3.50.2 gradio-client==0.6.1 h11==0.14.0 httpcore==1.0.7 httpx==0.27.2 huggingface-hub==0.26.2 idna==3.10 importlib-metadata==8.5.0 importlib-resources==6.4.5 jinja2==3.1.3 joblib==1.4.2 jsonschema==4.23.0 jsonschema-specifications==2023.12.1 kiwisolver==1.4.7 librosa==0.9.2 llvmlite==0.41.1 markdown-it-py==3.0.0 markupsafe==2.1.5 matplotlib==3.7.5 mdurl==0.1.2 mpmath==1.3.0 narwhals==1.14.1 networkx==3.2.1 numba==0.58.1 numpy==1.24.1 orjson==3.10.11 packaging==24.2 pandas==2.0.3 pillow==10.2.0 pkgutil-resolve-name==1.3.10 platformdirs==4.3.6 pooch==1.8.2 progressbar==2.5 pycparser==2.22 pydantic==2.10.1 pydantic-core==2.27.1 pydub==0.25.1 pygments==2.18.0 pyparsing==3.1.4 python-dateutil==2.9.0.post0 python-multipart==0.0.17 pytz==2024.2 pyyaml==6.0.2 referencing==0.35.1 regex==2024.11.6 requests==2.32.3 resampy==0.4.3 rich==13.9.4 rpds-py==0.20.1 ruff==0.8.0 safetensors==0.4.5 scikit-learn==1.3.2 scipy==1.10.1 semantic-version==2.10.0 setuptools==75.3.0 shellingham==1.5.4 six==1.16.0 sniffio==1.3.1 soundfile==0.12.1 starlette==0.41.3 sympy==1.13.1 threadpoolctl==3.5.0 tokenizers==0.20.3 tomlkit==0.12.0 torch==2.4.1+cu121 torchaudio==2.4.1+cu121 torchlibrosa==0.0.9 torchvision==0.19.1+cu121 tqdm==4.67.0 transformers==4.46.3 typer==0.13.1 typing-extensions==4.12.2 tzdata==2024.2 urllib3==2.2.3 uvicorn==0.32.1 wcwidth==0.2.13 websockets==11.0.3 zipp==3.20.2
以ä¸ã§ãªãã¸ããªã«ãã¢ãããã¼ããã¦ãã¾ã