åˆã‚ã«

microsoftã‹ã‚‰ã‚¹ã‚¯ãƒªãƒ¼ãƒ³ã‚·ãƒ§ãƒƒãƒˆã®å†…å®¹ã‚’ç†è§£ã™ã‚‹ãƒ¢ãƒ‡ãƒ«ãŠã‚ˆã³ãƒ©ã‚¤ãƒ–ãƒ©ãƒªãŒå‡ºã¦ã„ã‚‹ã®ã§ã€ã“ã‚Œã‚’è§¦ã£ã¦ã„ãã¾ã™ã€‚ ç’°å¢ƒæ§‹ç¯‰ã®ãŸã‚ã«ã€dockerç’°å¢ƒã‚’ä½œã£ã¦ã„ãã¾ã™

github.com

ä»¥ä¸‹ã§dockerç’°å¢ƒã‚’ã¾ã¨ã‚ãŸãƒªãƒã‚¸ãƒˆãƒªã‚’ã‚ã’ã¦ã„ã¾ã™

é–‹ç™ºç’°å¢ƒ

Windows
Docker

è©³ç´°

master ãƒ–ãƒ©ãƒ³ãƒã«ã¯ã€dockerç’°å¢ƒã¯ãªã‹ã£ãŸã®ã§ä»¥ä¸‹ã‚’å‚è€ƒã«ä½œæˆã—ã¦ã„ãã¾ã™

github.com

DockerfileãŠã‚ˆã³download.pyã‚’ä½œæˆã—ã¾ã™ã€‚

# Dockerfile for OmniParser with GPU and OpenGL support.
#
# Base: nvidia/cuda:12.3.1-devel-ubuntu22.04
# Features:
# - Python 3.12 with Miniconda environment.
# - Git LFS for large file support.
# - Required libraries: OpenCV, Hugging Face, Gradio, OpenGL.
# - Gradio server on port 7861.
#
# 1. Build the image with CUDA support.
# ```
# sudo docker build -t omniparser .
# ```
#
# 2. Run the Docker container with GPU access and port mapping for Gradio.
# ```bash
# sudo docker run -d -p 7861:7861 --gpus all --name omniparser-container omniparser
# ```
#
# Author: Richard Abrich ([email protected])

FROM nvidia/cuda:12.3.1-devel-ubuntu22.04

# Install system dependencies with explicit OpenGL libraries
RUN apt-get update && DEBIAN_FRONTEND=noninteractive apt-get install -y \
    git-lfs \
    wget \
    libgl1 \
    libglib2.0-0 \
    && apt-get clean \
    && rm -rf /var/lib/apt/lists/* \
    && git lfs install

# Install Miniconda for Python 3.12
RUN wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh && \
    bash miniconda.sh -b -p /opt/conda && \
    rm miniconda.sh
ENV PATH="/opt/conda/bin:$PATH"

# Create and activate Conda environment with Python 3.12, and set it as the default
RUN conda create -n omni python=3.12 && \
    echo "source activate omni" > ~/.bashrc
ENV CONDA_DEFAULT_ENV=omni
ENV PATH="/opt/conda/envs/omni/bin:$PATH"

# Set the working directory in the container
WORKDIR /usr/src/app

# Copy project files and requirements
COPY . .
COPY requirements.txt /usr/src/app/requirements.txt

# Initialize Git LFS and pull LFS files
RUN git lfs install && \
    git lfs pull

# Install dependencies from requirements.txt with specific opencv-python-headless version
RUN . /opt/conda/etc/profile.d/conda.sh && conda activate omni && \
    pip uninstall -y opencv-python opencv-python-headless && \
    pip install --no-cache-dir opencv-python-headless==4.8.1.78 && \
    pip install -r requirements.txt && \
    pip install huggingface_hub

# Run download.py to fetch model weights and convert safetensors to .pt format
RUN . /opt/conda/etc/profile.d/conda.sh && conda activate omni && \
    python download.py && \
    echo "Contents of weights directory:" && \
    ls -lR weights && \
    python weights/convert_safetensor_to_pt.py

# Expose the default Gradio port
EXPOSE 7861

# Configure Gradio to be accessible externally
ENV GRADIO_SERVER_NAME="0.0.0.0"

import os
from huggingface_hub import snapshot_download

# Set the repository name
repo_id = "microsoft/OmniParser"

# Set the local directory where you want to save the files
local_dir = "weights"

# Create the local directory if it doesn't exist
os.makedirs(local_dir, exist_ok=True)

# Download the entire repository
snapshot_download(repo_id, local_dir=local_dir, ignore_patterns=["*.md"])

print(f"All files and folders have been downloaded to {local_dir}")

ãƒ“ãƒ«ãƒ‰ãŠã‚ˆã³å®Ÿè¡Œã‚’ã—ã¾ã™

docker build -t omniparser .
docker run -it -p 7860:7860 omniparserdocker run -it --gpus all -p 7861:7861 --name omniparser-container omniparser /bin/bash```

dockerç’°å¢ƒã«å…¥ã£ãŸå¾Œã« gradioã‚’å®Ÿè¡Œã—ã¾ã™