å°è¥¿ç§åã§ãã
ä»åã¯2023-09-28ã«General Availability(GA)ã«ãªã£ãAmazon Bedrockã®åºæ¬æ
å ±ãRuntime APIã®å®è¡ä¾ã«ã¤ãã¦ã¾ã¨ãã¾ãããã¾ãããã¼ã¯ã³ããã©ã¡ã¼ã¿ã®ã¤ã¡ã¼ã¸ãã¤ããããã®å¿
è¦æå°éã®ç¨èªèª¬æãæã
å
¥ãã¦ãã¾ãã
æçµæ´æ°æ¥ï¼2024/06/21
â»æ¬è¨äºããã³å½å·çè
ã®ãã®ä»ã®è¨äºã§æ²è¼ããã¦ããã½ã¼ã¹ã³ã¼ãã¯èªä¸»ç 究活åã®ä¸è²«ã¨ãã¦ä½æãããã®ã§ãããåä½ãä¿è¨¼ãããã®ã§ã¯ããã¾ããã使ç¨ããå ´åã¯èªå·±è²¬ä»»ã§ãé¡ãè´ãã¾ããã¾ããäºåãªãä¿®æ£ãããã¨ãããã¾ãã®ã§ãäºæ¿ãã ããã
ä»åã®è¨äºã®å
容ã¯æ¬¡ã®ãããªæ§æã«ãªã£ã¦ãã¾ãã
- Amazon Bedrockã®åºæ¬æ å ±
- Amazon Bedrockã®åºæ¬çãªä½¿ãæ¹
- Amazon Bedrockã®å§ãæ¹ã»æºå
- Amazon Bedrock Runtime APIã®InvokeModelãInvokeModelWithResponseStreamã¨ãã©ã¡ã¼ã¿
- ä¸è¬çãªæ¨è«ãã©ã¡ã¼ã¿ã®æå³
- AWS SDK for Python(Boto3)ã«ããAmazon Bedrock Runtimeã®invoke_modelå®è¡ä¾
- AWS CLIã«ããAmazon Bedrock Runtimeã®invoke-modelå®è¡ä¾
- ã¾ã¨ã
Amazon Bedrockã®åºæ¬æ å ±
Amazon Bedrockã®åèè³æã»å¦ç¿ãªã½ã¼ã¹
Amazon Bedrockã®ç解ã«å½¹ç«ã¤ä¸»ãªåèè³æã»å¦ç¿ãªã½ã¼ã¹ã«ã¯æ¬¡ã®ãã®ãæãããã¾ãã
ãã®è¨äºã®å
容ã¯ãããã®åèè³æã»å¦ç¿ãªã½ã¼ã¹ã«ããæ
å ±ãåºã¨ãã¦ãã¾ãã
- What's Newï¼ Amazon Bedrock is now generally available
- AWS Blog: Amazon Bedrock Is Now Generally Available â Build and Scale Generative AI Applications with Foundation Models
- ã¢ãã«ãã¨ã®ä¾¡æ ¼ï¼ Amazon Bedrock Pricing
- Workshopï¼ GitHub - aws-samples/amazon-bedrock-workshop: This is a workshop designed for Amazon Bedrock a foundational model service.
- AWS Documentation(User Guide)ï¼ What is Amazon Bedrock? - Amazon Bedrock
- AWS Documentation(API Reference)ï¼ Bedrock API Reference - Amazon Bedrock
- AWS SDK for Python(Boto3) Documentation(Bedrock)ï¼ Bedrock - Boto3 documentation
- AWS SDK for Python(Boto3) Documentation(BedrockRuntime)ï¼ BedrockRuntime - Boto3 documentation
- AWS CLI Command Reference(bedrock)ï¼ bedrock â AWS CLI Command Reference
- AWS CLI Command Reference(bedrock-runtime)ï¼ bedrock-runtime â AWS CLI Command Reference
- AWS Management Console(Amazon Bedrock Model Providers): Amazon Bedrock Model Providers - AWS Management Console
Amazon Bedrockã¨ã¯
Amazon Bedrockã¯AI21 Labsã®Jurassic-2, Amazonã®Titan, Anthropicã®Claude, Cohereã®Command, Metaã®Llama 2, Stability AIã®Stable Diffusionã¨ãã£ãåºç¤ã¢ãã«(Foundation Models:FMs)ã使ç¨ããããã®APIçµç±ã®ã¢ã¯ã»ã¹ãç¬èªãã¼ã¿ã使ç¨ãã¦FMsããã©ã¤ãã¼ãã«ã«ã¹ã¿ãã¤ãºããæ©è½ãæä¾ãããµã¼ãã¹ã§ãã
ããã¹ãçæããã£ããããããæ¤ç´¢ãããã¹ãè¦ç´ãç»åçæããã¼ã½ãã©ã¤ãºããã¬ã³ã¡ã³ãã¼ã·ã§ã³ãªã©ã®ã¦ã¼ã¹ã±ã¼ã¹ã«å¿ãã¦åºç¤ã¢ãã«ãé¸æãã¦Generative AIã¢ããªã±ã¼ã·ã§ã³ã®æ§ç¯ãæ¡å¼µãã§ãã¾ãã
ããã¹ããæ±ãGenerative AIã«ããããã¼ã¯ã³(tokens)ã¨ã¯
Amazon Bedrockã®ã¢ãã«ä¸è¦§ãä¾¡æ ¼è¡¨ãè¦ã¦ããåã«å¶éã課éã®åä½ã¨ãªã£ã¦ãããã¼ã¯ã³(tokens)ã«ã¤ãã¦ç°¡åã«èª¬æãã¾ãã
ãã ããããã§ã¯ã¤ã¡ã¼ã¸ã®ãããããéè¦ããããå³å¯ãªå®ç¾©ã¨ã¯ç°ãªãå¯è½æ§ããããã¨ããæ¿ç¥ãããã ããã
ããã¹ããæ±ãGenerative AIã«ããããã¼ã¯ã³(tokens)ã¨ã¯ãããã¹ããæå³ã®ããé¨åã«åå²ããåä½ã®ãã¨ã§ãã
ãã¼ã¯ã³ã¯åèªã«è©²å½ããå ´åãããã¾ãããå¿
ãããåèªã¨å義ã§ã¯ãªãæåããµãã¯ã¼ããªã©ã«åå²ãããå ´åãããã¾ãã
ä¾ãã°Amazon Bedrock is amazing!
ã¨ããæååãåèªãã¼ã¹ã«ãã¼ã¯ã³åããã¨æ¬¡ã®ããã«ãªãã¾ãã
["Amazon", "Bedrock", "is", "amazing", "!"]
ããããåèªãã¼ã¹ã§ã¯ãªãå¥ã®ãã¼ã¯ã³åææ³(Tokenization)ã使ç¨ããã¨æ¬¡ã®ããã«ã¹ãã¼ã¹ãå«ãã¦åå²ãããå ´åãããã¾ãã
["Amazon", " ", "Bedrock", " ", "is", " ", "amazing", "!"]
ãã¼ã¯ã³åææ³ã«ã¯åèªãã¼ã¹ä»¥å¤ã«Unigram TokenizationãWordPieceãSentencePieceãByte Pair Encoding(BPE)ãªã©é«åº¦ãªãã®ããããã¢ãã«ãã¨ã«æ¡ç¨ãã¦ããææ³ã¯æ§ã
ãªã®ã§ããã®ç¹ã¯æèãã¦ããå¿
è¦ãããã¾ãã
ç¹ã«ããã¼ã¯ã³ãã¼ã¹ã®æéè¨ç®ãè¡ãéã«ã¯ã対象ã¨ãªãã¢ãã«ã®ãã¼ã¯ã³åææ³ã«å¾ã£ã¦ãå®éã«ä½¿ç¨ããæ¡ä»¶ã«è¿ãã·ããªãªã§ãã¼ã¯ã³æ°ãç®åºããã®ãæè¯ã ã¨æãã¾ãã
ãããå人çã«ã¯ãèªåãå©ç¨ãã¦ããGenerative AIãµã¼ãã¹ã®æé¡äºç®ãèããã¨ããªã©ãã¼ã¯ã³æ°ã®è©³ããäºæ¸¬ã«æéã¨ä½æ¥éããããããªãå ´åã«ã¯ãGenerative AIãã®ãã®ã使ç¨ãã¦è¨ç®ããããè¨ç®ããããããã«1æåï¼1ãã¼ã¯ã³ã¨ãã¦é«ãã«æéãè¦ç©ãã£ãããã¦ãã¾ãã
ç¨æããã¦ããã¢ãã«ã®ä¸è¦§
製åãã¼ã¸ã®Amazon Bedrock â AWSã¾ãã¯AWS Management Consoleã®Amazon Bedrock Model Providersãåèã«æ¬è¨äºå·çæç¹ã®ãã¼ã¿ãã¾ã¨ãã¾ããã
â»Embeddings (Embed)ããµãã¼ããã¦ããã¢ãã«ã¯ãããã¹ãå
¥å(åèªããã¬ã¼ãºã大ããªããã¹ãåä½ãªã©)ãããã¹ãã®æå³å
容ãå«ãæ°å¤è¡¨ç¾(Embedding: åãè¾¼ã¿)ã«å¤æãããã¨ãã§ãã¾ãã
Model Provider | Model | Model ID | Max tokens | Modality (Data Type) |
Languages | Supported use cases |
---|---|---|---|---|---|---|
AI21 Labs | Jurassic-2 Ultra (v1) |
ai21.j2-ultra-v1 | 8191 | Text | English Spanish French German Portuguese Italian Dutch |
Open book question answering summarization draft generation information extraction ideation |
AI21 Labs | Jurassic-2 Mid (v1) |
ai21.j2-mid-v1 | 8191 | Text | English Spanish French German Portuguese Italian Dutch |
Open book question answering summarization draft generation information extraction ideation |
Amazon | Titan Embeddings G1 - Text (v1.2) |
amazon.titan-embed-text-v1 | 8k | Embedding | English, Arabic, Chinese (Sim.), French, German, Hindi, Japanese, Spanish, Czech, Filipino, Hebrew, Italian, Korean, Portuguese, Russian, Swedish, Turkish, Chinese (trad), Dutch, Kannada, Malayalam, Marathi, Polish, Tamil, Telugu and others. | Translate text inputs (words, phrases or possibly large units of text) into numerical representations (known as embeddings) that contain the semantic meaning of the text. |
Amazon | Titan Text G1 - Lite | amazon.titan-text-lite-v1 | 4k | Text | English | Summarization and copywriting. |
Amazon | Titan Text G1 - Express | amazon.titan-text-express-v1 | 8k | Text | English (GA), Multilingual in 100+ languages (Preview) | Open ended text generation brainstorming summarization code generation table creation data formatting paraphrasing chain of though rewrite extraction Q&A chat |
Amazon | Titan Image Generator G1 | amazon.titan-image-generator-v1 | 77 | Image | English | Text to image generation image editing image variations |
Amazon | Titan Multimodal Embeddings G1 | amazon.titan-embed-image-v1 | 128 | Embedding | English | Search recommendation personalization |
Anthropic | Claude 3.5 Sonnet | anthropic.claude-3-5-sonnet-20240620-v1:0 | 200k | Text | English and multiple other languages | Complex tasks like customer support Coding Data Analysis and Visual Processing. Streamlining of Workflows Generation of Insights and Production of High-Quality Natural-Sounding Content. |
Anthropic | Claude 3 Opus | anthropic.claude-3-opus-20240229-v1:0 | 200k | Text | English and multiple other languages | Task automation: plan and execute complex actions across APIs and databases, interactive coding R&D: research review, brainstorming and hypothesis generation, drug discovery Strategy: advanced analysis of charts & graphs, financials and market trends, forecasting |
Anthropic | Claude 3 Sonnet | anthropic.claude-3-sonnet-20240229-v1:0 | 200k | Text | English and multiple other languages | Data processing: RAG or search & retrieval over vast amounts of knowledge Sales: product recommendations, forecasting, targeted marketing Time-saving tasks: code generation, quality control, parse text from images |
Anthropic | Claude 3 Haiku | anthropic.claude-3-haiku-20240307-v1:0 | 200k | Text | English and multiple other languages | Customer interactions: quick and accurate support in live interactions, translations Content moderation: catch risky behavior or customer requests Cost-saving tasks: optimized logistics, inventory management, extract knowledge from unstructured data |
Anthropic | Claude v2.1 | anthropic.claude-v2:1 | 200k | Text | English and multiple other languages | Question answering information extraction removing PII content generation multiple choice classification Roleplay comparing text summarization document Q&A with citation |
Anthropic | Claude v2 | anthropic.claude-v2 | 100k | Text | English and multiple other languages | Question answering information extraction removing PII content generation multiple choice classification Roleplay comparing text summarization document Q&A with citation |
Anthropic | [Legacy version] Claude v1.3 |
anthropic.claude-v1 | 100k | Text | English and multiple other languages | Question answering information extraction removing PII content generation multiple choice classification Roleplay comparing text summarization document Q&A with citation |
Anthropic | Claude Instant v1.2 | anthropic.claude-instant-v1 | 100k | Text | English and multiple other languages | Question answering information extraction removing PII content generation multiple choice classification Roleplay comparing text summarization document Q&A with citation |
Cohere | Command R+ (v1) |
cohere.command-r-plus-v1:0 | 128k | Text | English, French, Spanish, Italian, German, Portuguese, Japanese, Korean, Arabic, and Chinese | Complex RAG on large amounts of data Q&A Multi-step tool use chat text generation text summarization |
Cohere | Command R (v1) |
cohere.command-r-v1:0 | 128k | Text | English, French, Spanish, Italian, German, Portuguese, Japanese, Korean, Arabic, and Chinese | Chat text generation text summarization RAG on large amounts of data Q&A function calling |
Cohere | Command (v14.7) |
cohere.command-text-v14 | 4000 | Text | English | Summarization copywriting dialogue extraction question answering |
Cohere | Command Light (v14.7) |
cohere.command-light-text-v14 | 4000 | Text | English | Summarization copywriting dialogue extraction question answering |
Cohere | Embed English (v3) |
cohere.embed-english-v3 | 512 | Embedding | English | Semantic search retrieval-augmented generation (RAG) classification clustering |
Cohere | Embed Multilingual (v3) |
cohere.embed-multilingual-v3 | 512 | Embedding | 108 Languages | Semantic search retrieval-augmented generation (RAG) classification clustering |
Meta | Llama 3 70B Instruct | meta.llama3-70b-instruct-v1:0 | 8k | Text | English | Language modeling Dialog systems Code generation Following instructions Sentiment analysis with nuances in reasoning Text classification with improved accuracy and nuance Text summarization with accuracy and nuance |
Meta | Llama 3 8B Instruct | meta.llama3-8b-instruct-v1:0 | 8k | Text | English | Text summarization Text classification Sentiment analysis |
Meta | Llama 2 Chat 13B | meta.llama2-13b-chat-v1 | 4096 | Text | English | Text generation Conversation Chat based applications |
Meta | Llama 2 Chat 70B | meta.llama2-70b-chat-v1 | 4096 | Text | English | Text generation Conversation Chat based applications |
Mistral AI | Mistral 7B Instruct | mistral.mistral-7b-instruct-v0:2 | 32K | Text | English | Classification Text generation Code generation |
Mistral AI | Mixtral 8x7B Instruct | mistral.mixtral-8x7b-instruct-v0:1 | 32K | Text | English, French, Italian, German and Spanish | Complex reasoning & analysis Text generation Code generation |
Mistral AI | Mistral Large | mistral.mistral-large-2402-v1:0 | 32K | Text | English, French, Italian, German and Spanish | Complex reasoning & analysis Text generation Code generation RAG Agents |
Mistral AI | Mistral Small | mistral.mistral-small-2402-v1:0 | 32K | Text | English, French, Italian, German and Spanish | Text generation Code generation Classification RAG Conversation |
Stability AI | [Legacy version] Stable Diffusion XL (v0.8) |
stability.stable-diffusion-xl-v0 | 77 | Image | English | image generation image editing |
Stability AI | Stable Diffusion XL (v1.0) |
stability.stable-diffusion-xl-v1 | 77 | Image | English | image generation image editing |
ç¨æããã¦ããã¢ãã«ã®ä¾¡æ ¼
Amazon Bedrock Pricingãåèã«æ¬è¨äºå·çæç¹ã®ãã¼ã¿ãã¾ã¨ãã¾ããã
ä¾¡æ ¼ãè¨è¼ããã¦ããªãã¢ãã«ã®é
ç®ã¯ãã®ä¾¡æ ¼ãªãã·ã§ã³ãæä¾ããã¦ããªããã¾ãã¯ã¢ãã«ã®ã«ã¹ã¿ãã¤ãºã®æ©è½ãã®ãã®ããµãã¼ãããã¦ããªããã¨ã示ãã¦ãã¾ãã
ããã¹ããæ±ãã¢ãã«ã®ä¾¡æ ¼
ããã¹ããæ±ãã¢ãã«ã®ä¾¡æ ¼ã¯æ¬¡ã®é
ç®ã§ä¾¡æ ¼ãè¨å®ããã¦ãã¾ãã
- On-Demand
On-Demandã¯1,000ãã¨ã®å ¥åãã¼ã¯ã³ã¨1,000ãã¨ã®åºåãã¼ã¯ã³ã§ä¾¡æ ¼è¨ç®ããã¾ã(æéãã¼ã¹ã®æ¯æãã§ã¯ããã¾ãã)ã - Provisioned Throughput
Provisioned Throughputã¯æå®ããæéã«ãããæéãã¼ã¹ã®æ¯æãã確ç´(commitment)ãããã¨ã§ã大è¦æ¨¡å©ç¨ãªã©è¦ä»¶ãæºããããã®ååãªã¹ã«ã¼ãããããããã¸ã§ãã³ã°ãã¾ãã
commitmentã®æéã«ã¯ãç¡ãã1ã¶æã6ã¶æãããé·æéã§ããã»ã©å²å¼ãããã¾ãã - Model customization(Fine-tuning)
Fine-tuningã使ç¨ããã«ã¹ã¿ã ã¢ãã«ãä½æããå ´åã¯ã1,000ãã¼ã¯ã³ãã¨ã®ãã¬ã¼ãã³ã°æéãã«ã¹ã¿ã ã¢ãã«ãã¨ã®æé¡ä¿åæéãçºçãã¾ãã
Model Provider | Model | On-Demand (per 1000 input tokens) |
On-Demand (per 1000 output tokens) |
Provisioned Throughput (per hour per model) |
Model customization through Fine-tuning |
---|---|---|---|---|---|
AI21 Labs | Jurassic-2 Ultra | 0.0188 USD | 0.0188 USD | - | - |
AI21 Labs | Jurassic-2 Mid | 0.0125 USD | 0.0125 USD | - | - |
Amazon | Titan Text Lite(Titan Text G1 - Lite) | 0.0003 USD | 0.0004 USD | no commitment: 7.10 USD 1-month commitment: 6.40 USD 6-month commitment: 5.10 USD |
Train(per 1000 tokens): 0.0004 USD Store each custom model(per month): 1.95 USD |
Amazon | Titan Text Express(Titan Text G1 - Express) | 0.0008 USD | 0.0016 USD | no commitment: 20.50 USD 1-month commitment: 18.40 USD 6-month commitment: 14.80 USD |
Train(per 1000 tokens): 0.008 USD Store each custom model(per month): 1.95 USD |
Amazon | Titan Embeddings(Titan Embeddings G1 - Text) | 0.0001 USD | N/A | no commitment: N/A 1-month commitment: 6.40 USD 6-month commitment: 5.10 USD |
- |
Anthropic | Claude 3.5 Sonnet | 0.00300 USD | 0.01500 USD | no commitment: N/A 1-month commitment: N/A 6-month commitment: N/A |
- |
Anthropic | Claude 3 Opus | 0.01500 USD | 0.07500 USD | no commitment: N/A 1-month commitment: N/A 6-month commitment: N/A |
- |
Anthropic | Claude 3 Sonnet | 0.00300 USD | 0.01500 USD | no commitment: N/A 1-month commitment: N/A 6-month commitment: N/A |
- |
Anthropic | Claude 3 Haiku | 0.00025 USD | 0.00125 USD | no commitment: N/A 1-month commitment: N/A 6-month commitment: N/A |
- |
Anthropic | Claude(v2.0, v2.1) | 0.00800 USD | 0.02400 USD | no commitment: N/A 1-month commitment: 63.00 USD 6-month commitment: 35.00 USD |
- |
Anthropic | Claude Instant(v1.2) | 0.00080 USD | 0.00240 USD | no commitment: N/A 1-month commitment: 39.60 USD 6-month commitment: 22.00 USD |
- |
Cohere | Command R+ | 0.0030 USD | 0.0150 USD | - | - |
Cohere | Command R | 0.0005 USD | 0.0015 USD | - | - |
Cohere | Command | 0.0015 USD | 0.0020 USD | no commitment: 49.50 USD 1-month commitment: 39.60 USD 6-month commitment: 23.77 USD |
Train(per 1000 tokens): 0.004 USD Store each custom model(per month): 1.95 USD |
Cohere | Command-Light | 0.0003 USD | 0.0006 USD | no commitment: 8.56 USD 1-month commitment: 6.85 USD 6-month commitment: 4.11 USD |
Train(per 1000 tokens): 0.001 USD Store each custom model(per month): 1.95 USD |
Cohere | Embed â English | 0.0001 USD | N/A | no commitment: 7.12 USD 1-month commitment: 6.76 USD 6-month commitment: 6.41 USD |
- |
Cohere | Embed â Multilingual | 0.0001 USD | N/A | no commitment: 7.12 USD 1-month commitment: 6.76 USD 6-month commitment: 6.41 USD |
- |
Meta | Llama 3 Instruct 8B | 0.0003 USD | 0.0006 USD | - | - |
Meta | Llama 3 Instruct 70B | 0.00265 USD | 0.0035 USD | - | - |
Meta | Llama 2 Chat 13B | 0.00075 USD | 0.00100 USD | no commitment: N/A 1-month commitment: 21.18 USD 6-month commitment: 13.08 USD |
Train(per 1000 tokens): 0.00149 USD Store each custom model(per month): 1.95 USD |
Meta | Llama 2 Chat 70B | 0.00195 USD | 0.00256 USD | no commitment: N/A 1-month commitment: 21.18 USD 6-month commitment: 13.08 USD |
Train(per 1000 tokens): 0.00799 USD Store each custom model(per month): 1.95 USD |
Mistral AI | Mistral 7B Instruct | 0.00015 USD | 0.0002 USD | - | - |
Mistral AI | Mixtral 8x7B Instruct | 0.00045 USD | 0.0007 USD | - | - |
Mistral AI | Mistral Small | 0.001 USD | 0.003 USD | - | - |
Mistral AI | Mistral Large | 0.004 USD | 0.012 USD | - | - |
ãã«ãã¢ã¼ãã«ã¢ãã«ã®ä¾¡æ ¼
ç»åããã®ä»ã®ã¡ãã£ã¢ãå¦çãããã«ãã¢ã¼ãã«ã¢ãã«ã®ä¾¡æ ¼è¨å®ã¯ãç»åã®æ°ã解å度ãªã©æ§ã
ãªåºæºã«åºã¥ãã¦ãããããããã®ã¢ãã«ãã¨ã«ã¾ã¨ãã¾ããã
Model Provider | Model | Standard quality(<51 steps) (per image) |
Premium quality(>51 steps) (per image) |
Provisioned Throughput (per hour per model) |
Model customization through Fine-tuning |
---|---|---|---|---|---|
Stability AI | Stable Diffusion XL (v0.8) |
512x512 or smaller: 0.018 USD Larger than 512x512: 0.036 USD |
512x512 or smaller: 0.036 USD Larger than 512x512: 0.072 USD |
- | - |
Stability AI | Stable Diffusion XL (v1.0) |
Up to 1024 x 1024: 0.04 USD | Up to 1024 x 1024: 0.08 USD | no commitment: N/A 1-month commitment: 49.86 USD 6-month commitment: 46.18 USD |
- |
Model Provider | Model | Standard quality (per image) |
Premium quality (per image) |
Provisioned Throughput (per hour per model) |
Model customization through Fine-tuning |
---|---|---|---|---|---|
Amazon | Titan Image Generator | 512x512: 0.008 USD 1024X1024: 0.01 USD |
512x512: 0.01 USD 1024X1024: 0.012 USD |
no commitment: N/A 1-month commitment: 16.20 USD 6-month commitment: 13.00 USD |
Train(per image seen): 0.005 USD Store each custom model(per month): 1.95 USD |
Amazon | Titan Image Generator(custom models) | 512x512: 0.018 USD 1024X1024: 0.02 USD |
512x512: 0.02 USD 1024X1024: 0.022 USD |
no commitment: 23.40 USD 1-month commitment: 21.00 USD 6-month commitment: 16.85 USD |
- |
Model Provider | Model | On-Demand (per 1000 input tokens) |
On-Demand (per 1000 input image) |
Provisioned Throughput (per hour per model) |
Model customization through Fine-tuning |
---|---|---|---|---|---|
Amazon | Titan Multimodal Embeddings | 0.0008 USD | 0.00006 USD | no commitment: 9.38 USD 1-month commitment: 8.45 USD 6-month commitment: 6.75 USD |
Train(per image seen): 0.0002 USD Store each custom model(per month): 1.95 USD |
Amazon Bedrockã®åºæ¬çãªä½¿ãæ¹
Amazon Bedrockã®å§ãæ¹ã»æºå
Amazon Bedrockãå§ããã«ã¯AWS Management Consoleã§Amazon Bedrockã®Model accessç»é¢ã«é·ç§»ããEdit
ãã¯ãªãã¯ããã¦ä½¿ç¨ããã¢ãã«ãé¸æããSave changes
ã§ã¢ãã«ã¸ã®ã¢ã¯ã»ã¹ããªã¯ã¨ã¹ããã¾ãã
Amazon Bedrock > Model access - AWS Management Console
â»Anthropicã®ã¢ãã«ã¯ä¼ç¤¾æ
å ±ãç®çãªã©ãå
¥åãã¦ãªã¯ã¨ã¹ãããå¿
è¦ãããã¾ãã
ãªã¯ã¨ã¹ããæ¿èªãããã¨ã¢ãã«ã®ã¢ã¯ã»ã¹ãæå¹ã«ãªã使ç¨ã§ããããã«ãªãã¾ãã
Amazon Bedrock Runtime APIã®InvokeModelãInvokeModelWithResponseStreamã¨ãã©ã¡ã¼ã¿
å®éã«Amazon Bedrockã使ç¨ããããã®APIã«ã¤ãã¦èª¬æãã¾ãã
Amazon Bedrockã«é¢ããAPIã«ã¯å¤§ããåãã¦Bedrock APIã¨Bedrock Runtime APIãããã¾ãã
Bedrock APIã¯Fine-tuningã«ããã«ã¹ã¿ã ã¢ãã«ã®ä½æãã¢ãã«ã®Provisioned Throughputã®è³¼å
¥ãªã©AWSãªã½ã¼ã¹ã®æä½ã«ä½¿ç¨ãã¾ãã
ä¸æ¹ã®Bedrock Runtime APIã¯ãã¼ã¹ã¢ãã«ãã«ã¹ã¿ã ã¢ãã«ãæå®ãã¦å
¥åãã¼ã¿(Prompt)ããªã¯ã¨ã¹ãããã¬ã¹ãã³ã¹ããåºåãã¼ã¿(Completions)ãåå¾ããå®éã®å®è¡ã«ä½¿ç¨ãã¾ãã
Amazon Bedrock Runtime APIã«ã¯å®éã«ã¢ãã«ãå¼ã³åºãã¦ä½¿ç¨ããããã®InvokeModelã¨InvokeModelWithResponseStreamãããã¾ãã
Amazon Bedrock Runtime APIã®InvokeModelã¯ãªã¯ã¨ã¹ãã«å¯¾ããã¬ã¹ãã³ã¹ã®å
容ãä¸åº¦ã«ãã¹ã¦åå¾ããAPIã§ãã
ä¸æ¹ã§Amazon Bedrock Runtime APIã®InvokeModelWithResponseStreamã¯ãªã¯ã¨ã¹ãã«å¯¾ããã¬ã¹ãã³ã¹ã®å
容ãå°éã®æåãã¤å¾ã
ã«ã¹ããªã¼ã ã¨ãã¦åå¾ããAPIã§ãã
ãã§ã«ãã£ããå½¢å¼ã®Generative AIãµã¼ãã¹ã使ã£ããã¨ãããæ¹ã¯Promptã«å¯¾ããçµæãæ°æåãã¤è¡¨ç¤ºãããç»é¢ãè¦ããã¨ãããã¨æãã¾ããããã®è¡¨ç¤ºæ¹æ³ã«ä½¿ç¨ã§ããã®ãInvokeModelWithResponseStreamã§ãã
Amazon Bedrock Runtime APIã®InvokeModelã¨InvokeModelWithResponseStreamã®ãªã¯ã¨ã¹ãã§æå®ãããã©ã¡ã¼ã¿ã¼ã¯å
±éã§æ¬¡ã®ãã®ã使ç¨ãã¾ãã
accept: ã¬ã¹ãã³ã¹ã®æ¨è«Bodyã®MIMEã¿ã¤ãã(Default: application/json) contentType: ãªã¯ã¨ã¹ãã®å ¥åãã¼ã¿ã®MIMEã¿ã¤ãã(Default: application/json) modelId: [Required]ã¢ãã«ã®èå¥åã(ä¾ï¼ ai21.j2-ultra-v1) body: [Required]contentTypeã§æå®ããå½¢å¼ã®å ¥åãã¼ã¿ãåã¢ãã«ã§ãµãã¼ãããã¦ããæ¨è«ãã©ã¡ã¼ã¿ã¼ã«ãããã¦bodyã®ãã£ã¼ã«ãã®ãã©ã¼ããããæå®ããã
ä¸è¬çãªæ¨è«ãã©ã¡ã¼ã¿ã®æå³
以éã§ã¯Amazon Bedrock Runtime APIã®å®è¡ä¾ãç´¹ä»ãã¾ããããã®åã«ã¢ãã«ã¸ã®ãªã¯ã¨ã¹ãã®Bodyå
ã§ãã使ç¨ããä¸è¬çãªæ¨è«ãã©ã¡ã¼ã¿ã¼ã«ã¤ãã¦ç°¡åã«èª¬æãã¾ãã
ãã ããããã§ã¯ã¤ã¡ã¼ã¸ã®ãããããéè¦ããããå³å¯ãªå®ç¾©ã¨ã¯ç°ãªãå¯è½æ§ããããã¨ããæ¿ç¥ãããã ããã
- temperature
ã¢ãã«ã®åºå確çåå¸ã®ã©ã³ãã æ§ã¨å¤æ§æ§ã調æ´ãããã©ã¡ã¼ã¿ã§ãå¤ã大ããã¨ã©ã³ãã æ§ã¨å¤æ§æ§ã®é«ãæå¤ãªåçãè¿ãå¾åã«ãªããå¤ãå°ããã¨ããé«ã確çã§æ¨å®ãããåçãè¿ãå¾åã«ãªãã¾ããtemperatureã®é常ã®ç¯å²ã¯0 - 1
ã®éã§ããã1ãè¶ ããå¤ãè¨å®ã§ããã¢ãã«ãããã¾ããä¾ãã°ãtemperature=1.0
ã¨temperature=0.1
ã§ã¯temperature=1.0
ã®æ¹ãã©ã³ãã æ§ã¨å¤æ§æ§ãé«ãåçããtemperature=0.1
ã®æ¹ãããé«ã確çã§æ¨å®ãããåçãè¿ãå¾åã«ãªãã¾ãã - topK
ã¢ãã«ãèæ ®ãããã¼ã¯ã³ã®ä¸ä½Kåãå¶éãããã¨ã§ã©ã³ãã æ§ã¨å¤æ§æ§ã調æ´ãããã©ã¡ã¼ã¿ã§ããtopKã®æé©ãªç¯å²ã¯ä½¿ç¨ããã¢ãã«ã«ãã£ã¦ç°ãªãã¾ãããã®å¤ãã»ããããã¨åºåãã¼ã¯ã³ã¯ãã®ä¸ä½Kã®ä¸ããé¸æããã¾ããä¾ãã°ãtopK=10
ã®å ´åã¯ã¢ãã«ãåççææã«ç¢ºçã®é«ãä¸ä½10ã®ãã¼ã¯ã³ã®ã¿ãèæ ®ããããã«ãªãã¾ããç°¡åã«è¨ãã°ãtopKã¯é¸æå¯è½ãªãã¼ã¯ã³ã®ç¯å²ãåºåãã¼ã¯ã³ã®æ°ã§å¶éãããã®çµæã¨ãã¦å¤æ§æ§ã調æ´ãã¾ãã - topP
ãã¼ã¯ã³ã®ç´¯ç©ç¢ºçãæå®ããPãè¶ ããåã®ãã¼ã¯ã³ã®éåãããµã³ããªã³ã°ãããã¨ã§ã©ã³ãã æ§ã¨å¤æ§æ§ã調æ´ãããã©ã¡ã¼ã¿ã§ããé常ã®topPã®ç¯å²ã¯0 - 1
ã®éã§ããä¾ãã°topP=0.9
ã®å ´åã¯ã¢ãã«ãåççææã«ç¢ºçãé«ããã¼ã¯ã³ããé ã«ç´¯è¨ç¢ºçã0.9ãè¶ ããåã®ãã¼ã¯ã³ã¾ã§ãèæ ®ããããã«ãªãã¾ããç°¡åã«è¨ãã°ãtopPã¯é¸æå¯è½ãªãã¼ã¯ã³ã®ç¯å²ãåºåãã¼ã¯ã³ã®ç¢ºçã®ç´¯è¨ã«åºã¥ãã¦å¶éãããã®çµæã¨ãã¦ã©ã³ãã æ§ã¨å¤æ§æ§ã調æ´ãã¾ãã - maxTokens
çæããããã¼ã¯ã³ã®æ大æ°ãå¶éããçæãããããã¹ãã®é·ããã³ã³ããã¼ã«ããããã®ãã©ã¡ã¼ã¿ã§ããä¾ãã°maxTokens=800
ã®å ´åã¯ã¢ãã«ã800ãã¼ã¯ã³ãè¶ ããããã¹ããçæããªãããã«ãªãã¾ãã
APIã¸ã®ãªã¯ã¨ã¹ãã§ã¯temperatureãtopKãtopPã®ãã©ã¡ã¼ã¿ãçµã¿åããã¦ç¢ºä¿¡åº¦ã¨å¤æ§æ§ã®ãã©ã³ã¹ã調æ´ããmaxTokensã§åºåããããã¼ã¯ã³æ°ãå¶éãã¾ãã
Amazon Bedrockã«ããåã¢ãã«ã®è©³ç´°ãªæ¨è«ãã©ã¡ã¼ã¿ã«ã¤ãã¦ã¯ãInference parameters for foundation models - Amazon Bedrockããåç §ãã¦ãã ããã
AWS SDK for Python(Boto3)ã«ããAmazon Bedrock Runtimeã®invoke_modelå®è¡ä¾
ããã§ã¯AWS SDK for Python(Boto3)ã«ããAmazon Bedrock Runtimeã®invoke_modelãAWS Lambdaé¢æ°ã§å®è¡ããä¾ãç´¹ä»ãã¾ãã
æ¬è¨äºå·çæç¹ã§ã¯AWS Lambdaé¢æ°ã®ããã©ã«ãã®AWS SDK for Python(Boto3)ã§ã¯bedrock
ãbedrock-runtime
ã®Clientãã¾ã å¼ã³åºãã¾ããã§ããã
ãã®ããã以ä¸ã¯ææ°ã®AWS SDK for Python(Boto3)ãLambda Layerã«è¿½å ãã¦bedrock-runtime
ã®Clientã使ç¨ããä¾ã§ãã
ã»å®è¡ä¾(AWS Lambdaé¢æ°)
import boto3 import json import os region = os.environ.get('AWS_REGION') bedrock_runtime_client = boto3.client('bedrock-runtime', region_name=region) def lambda_handler(event, context): modelId = 'ai21.j2-ultra-v1' contentType = 'application/json' accept = 'application/json' body = json.dumps({ "prompt": "Please tell us all the states in the U.S.", "maxTokens": 800, "temperature": 0.7, "topP": 0.95 }) response = bedrock_runtime_client.invoke_model( modelId=modelId, contentType=contentType, accept=accept, body=body ) response_body = json.loads(response.get('body').read()) return response_body
ã»å®è¡çµæä¾(ä¸è¨AWS Lambdaé¢æ°ã®è¿å´å¤)
{ "id": 1234, "prompt": { "text": "Please tell us all the states in the U.S.", "tokens": [ ãçç¥ã ] }, "completions": [ { "data": { "text": "\nUnited States of America is a federal republic consisting of 50 states, a federal district (Washington, D.C., the capital city of the United States), five major territories, and various minor islands. The 50 states are Alabama, Alaska, Arizona, Arkansas, California, Colorado, Connecticut, Delaware, Florida, Georgia, Hawaii, Idaho, Illinois, Indiana, Iowa, Kansas, Kentucky, Louisiana, Maine, Maryland, Massachusetts, Michigan, Minnesota, Mississippi, Missouri, Montana, Nebraska, Nevada, New Hampshire, New Jersey, New Mexico, New York, North Carolina, North Dakota, Ohio, Oklahoma, Oregon, Pennsylvania, Rhode Island, South Carolina, South Dakota, Tennessee, Texas, Utah, Vermont, Virginia, Washington, West Virginia, Wisconsin, and Wyoming.", "tokens": [ ãçç¥ã ] }, "finishReason": { "reason": "endoftext" } } ] }
â»æ¬è¨äºå·çæç¹ã§ææ°ã®AWS SDK for Python(Boto3)ã«ã¯Amazon Bedrock Runtimeã®invoke_model_with_response_streamã³ãã³ããç¨æããã¦ãã¾ãã
ãã ãå¥è¨äºã§è©³ç´°ã«ã¤ãã¦èª¬æããäºå®ã®ãããä»åã®è¨äºã§ã¯å²æãã¾ãã
AWS CLIã«ããAmazon Bedrock Runtimeã®invoke-modelå®è¡ä¾
ããã§ã¯ãAWS CLIã«ããAmazon Bedrock Runtimeã®invoke-modelå®è¡ä¾ãç´¹ä»ãã¾ãã
æ¬è¨äºå·çæç¹ã§ã¯Amazon Bedrock Runtime APIã¯AWS CLI Version 2ç³»ã«ã¯ã¾ã 対å¿ãã¦ãã¾ããã§ããã
ãã®ããã以ä¸ã¯Amazon Bedrock Runtime APIã対å¿ãã¦ããAWS CLI Version 1ç³»ãå¥éã¤ã³ã¹ãã¼ã«ãã¦å®è¡ããä¾ã§ãã
ã»ãã©ã¼ããã
aws bedrock-runtime invoke-model \ --region [Region] \ --model-id "[modelId]" \ --content-type "[contentType]" \ --accept "[accept]" \ --body "[body]" [Output FileName]
ã»å®è¡ä¾
aws bedrock-runtime invoke-model \ --region us-east-1 \ --model-id "ai21.j2-ultra-v1" \ --content-type "application/json" \ --accept "application/json" \ --body "{\"prompt\": \"Please tell us all the states in the U.S.\", \"maxTokens\": 800,\"temperature\": 0.7,\"topP\": 0.95}" invoke-model-output.txt
ã»ã¬ã¹ãã³ã¹ä¾
* ç»é¢è¡¨ç¤º {"contentType": "application/json"} * ãã¡ã¤ã«å 容(invoke-model-output.txt) {"id": 1234,"prompt": {"text": "Please tell us all the states in the U.S.","tokens": [ãçç¥ã]},"completions": [{"data": {"text": "\nUnited States of America is a federal republic consisting of 50 states, a federal district (Washington, D.C., the capital city of the United States), five major territories, and various minor islands. The 50 states are Alabama, Alaska, Arizona, Arkansas, California, Colorado, Connecticut, Delaware, Florida, Georgia, Hawaii, Idaho, Illinois, Indiana, Iowa, Kansas, Kentucky, Louisiana, Maine, Maryland, Massachusetts, Michigan, Minnesota, Mississippi, Missouri, Montana, Nebraska, Nevada, New Hampshire, New Jersey, New Mexico, New York, North Carolina, North Dakota, Ohio, Oklahoma, Oregon, Pennsylvania, Rhode Island, South Carolina, South Dakota, Tennessee, Texas, Utah, Vermont, Virginia, Washington, West Virginia, Wisconsin, and Wyoming.","tokens": [ãçç¥ã]},"finishReason": {"reason": "endoftext"}}]}
â»æ¬è¨äºå·çæç¹ã§AWS CLIã«Amazon Bedrock Runtimeã®invoke-model-with-response-streamã³ãã³ãã¯ç¨æããã¦ãã¾ããã
åèï¼
Amazon Bedrock is now generally available
Amazon Bedrock Is Now Generally Available â Build and Scale Generative AI Applications with Foundation Models
Amazon Bedrock Pricing
GitHub - aws-samples/amazon-bedrock-workshop: This is a workshop designed for Amazon Bedrock a foundational model service.
What is Amazon Bedrock? - Amazon Bedrock
Bedrock API Reference - Amazon Bedrock
Bedrock - Boto3 documentation
BedrockRuntime - Boto3 documentation
bedrock â AWS CLI Command Reference
bedrock-runtime â AWS CLI Command Reference
Amazon Bedrock Model Providers - AWS Management Console
Tech Blog with related articles referenced
ã¾ã¨ã
ä»åã¯Amazon Bedrockã®åèè³æãã¢ãã«ä¸è¦§ãä¾¡æ ¼ã使ãæ¹ããã¼ã¯ã³ããã©ã¡ã¼ã¿ã®ç¨èªèª¬æãRuntime APIã®å®è¡ä¾ã«ã¤ãã¦ç´¹ä»ãã¾ããã
æ
å ±ãã¾ã¨ãã¦ããä¸ã§Amazon Bedrockã¯æ§ã
ãªç¨®é¡ã®ã¢ãã«ããã¦ã¼ã¹ã±ã¼ã¹ã«å¿ãããã®ãé¸æã§ããä»ã®AWSãµã¼ãã¹ã¨è¦ªåæ§ã®é«ãAWS SDKãAWS CLIã®ã¤ã³ã¿ãã§ã¼ã¹ã§å¼ã³åºããããã¨ããããã¾ããã
ãããããAmazon Bedrockãã¢ãããã¼ããå®è£
æ¹æ³ãä»ã®ãµã¼ãã¹ã¨ã®çµã¿åãããªã©ã®è¦³ç¹ã§ã¦ã©ãããã¦ããããã¨æãã¾ãã
- [English Edition] Basic Information about Amazon Bedrock with API Examples - Model Features, Pricing, How to Use, Explanation of Tokens and Inference Parameters