ãç²ãæ§ã§ãã
SAM3ã®å®è¡ç°å¢ãWSL+Dockerã§ä½æããå®éã«å®è¡ãã¦è©¦ãã¦ã¿ãè¨é²ã§ãã
- SAM3ã«ã¤ãã¦
ai.meta.com
2025å¹´11æã«ãªãªã¼ã¹ãããSAMï¼Segment Anything Modelï¼ã·ãªã¼ãºã®ææ°ã¢ãã«ã§ãã SAM3ã§ã¯ãããã³ããã§ç»åå ã®æ¤åºãããç©ä½ãæç¤ºãããã¨ã§ç®çã®ç©ä½ã®ã»ã°ã¡ã³ãã¼ã·ã§ã³ã¨BBoxã®åºåãã§ãã¾ãã
ï¼ä»ã«ã3Dãªãã¸ã§ã¯ãã«å¯¾å¿ããSAM3Dãããã¾ããä»åã¯æ±ãã¾ãããï¼
ç°å¢æ§ç¯
- å®è¡ç°å¢
OS: Windows 11 Pro
CPU: Intel Core i7-13700
ã¡ã¢ãª: 32GB
GPU: NVIDIA GeForce RTX 4060 Ti (VRAM: 16GB)
ç°å¢ã¯ä¸è¿°ã®éãWSL+Dockerã使ç¨ãã¾ãããã¾ããPythonç°å¢ã¯uvã使ç¨ãã¦ãã¾ãã
ãã¼ã¹ã®ç°å¢ã®ä½æã«ã¤ãã¦ã¯éå»è¨äºããåèãã ããã
Windowsç°å¢ã®å ´åãä¸é¨ã®ã©ã¤ãã©ãªãLinuxã§ãã使ããèªåã§ãã«ãããå¿ è¦ãããã®ã§WSLãä½¿ãæ¹ãè¯ãã¨æãã¾ãã
ä»å使ç¨ããç°å¢è¨å®ãå«ãããªãã¸ããªãGitHubã«æ®ãã¦ãã¾ãã SAM3ã®å ¬å¼ãªãã¸ããªãforkãã¦ç°å¢è¨å®ãã¡ã¤ã«ã追å ããã®ã¿ã§ããâ¦ã
å®è¡
å ¬å¼ãããã¦ãããã¢ç¨ã®ã³ã¼ããåèã«ä½æããä¸è¨ã®ã½ã¼ã¹ã³ã¼ããå®è¡ãã¾ããã
注æç¹ã¨ãã¦ãã¢ãã«ã®éã¿ã®ãã¦ã³ãã¼ãã«ã¯HuggingFaceã®ã¢ãã«ãã¼ã¸ã§å©ç¨ç³è«ãå¿ è¦ã«ãªãã¾ãã
import os from PIL import Image import matplotlib.pyplot as plt from sam3.model_builder import build_sam3_image_model from sam3.model.sam3_image_processor import Sam3Processor from sam3.visualization_utils import plot_results from huggingface_hub import login from dotenv import load_dotenv load_dotenv() login(token=os.getenv("HF_TOKEN")) # ã¢ãã«ã®æºå model = build_sam3_image_model() processor = Sam3Processor(model) # ç»åã®èªã¿è¾¼ã¿ image = Image.open("data/1624777685449_985774_photo1.jpeg") inference_state = processor.set_image(image) # ããã¹ãããã³ãããè¨å®ãã¦æ¨è«ãå®è¡ output = processor.set_text_prompt(state=inference_state, prompt="tomato") plot_results(image, output) plt.show() plt.close()
ä¸è¨ãå®è¡ããã¨ãããªæãã§åºåããã¾ãã

ããã³ããã®æç¤ºã§ããç¨åº¦æ¤åºãããç©ä½ãçµããã¨ãå¯è½ã§ããä¾ãã°prompt="red tomato"ã¨å¤æ´ããã¨åºåãå¤ããã¾ãã

ç§ã®ç°å¢ã§ã®è©±ã«ã¯ãªãã¾ãããVRAMã大ä½5GBããã使ç¨ãã¦ããã®ã§æ¯è¼ç軽ããã§ãã
ã¾ããç»å1æãããã®æ¨è«æéã¯0.20sã»ã©ã ã£ãã®ã§ãã¡ãããªããªãéãã§ãã
