Version 1.107 is now available! Read about the new features and fixes from November.
Dismiss this update
The AI Toolkit Copilot tools help you build AI agent applications faster. These tools give you ready-made features, templates, and best practices. You can create smart applications that use AI more quickly.
The AI Toolkit Copilot tools use the VS Code Language Model Tool API. These tools extend a large language model (LLM) with domain-specific features in chat. In agent mode, VS Code can automatically use these tools to handle user prompts and perform tasks.
The AI Toolkit Copilot tools include four main tools:
After installing the prerequisites, you can use the tools AI Toolkit provides when using agents in chat:
Open the Chat view ⌃⌘I (Windows, Linux Ctrl+Alt+I), and select Agent from the dropdown.
Select the Tools button to view the list of available tools.
Optionally, select or deselect the tools you want to use. You can search tools by typing in the search box.

The Agent Code Gen tool helps developers create agent code more easily. Use this tool to quickly make code snippets and templates that work well for AI agents. This approach makes development faster and ensures your code follows best practices for AI agent development.
The Agent Code Gen tool has several important features:
Intelligent agent code generation: The tool creates agent code based on what you need.
Default framework selection: If you don't choose a framework, the tool picks Microsoft Agent Framework SDK for you.
Example requirement:
Create an AI app that helps me to manage travel queries.
Integrated model guidance: The tool uses the AI Model Guide to give you model details while building agents. It picks GPT4.1 as the default model unless you choose another.
Example requirement:
Create an AI app to manage travel queries, use Microsoft Foundry models.
Various agent framework functionality support: The tool supports many features like function calling, MCP, and streaming responses.
Example requirement:
Create an AI app to check the CNN headline, use local MCP playwright to fetch CNN web page.
Workflow support: The tool supports many workflows in Agent Framework, like Sequential, Switch-case, Loop, and Human-In-The-Loop.
Example requirement:
Build a conditional routing workflow based on email classification:
- "Email Classifier": determines if email is spam or legitimate.
- "Spam Handler": processes spam emails (if spam detected).
- "Email Assistant": drafts responses for legitimate emails (if not spam).
The AI Model Guide tool helps developers pick the best AI models for their apps. It recommends Microsoft Foundry and GitHub models, including the latest and most popular ones. The tool provides details like input types, context length, cost, and metrics (quality, speed, safety). It also explains how to connect to models, such as GitHub endpoints and tokens.
This tool supports:
Basic model recommendation:
Example information prompt:
Which models are specifically designed for reasoning or math tasks?
For this example, Copilot gathers model information from this tool and recommends o-series models for reasoning or math tasks.
Working together with the Code Gen tool to provide model selection during agent code generation:
Example requirement prompt:
Create a quick demo AI Chat app
For this example, Copilot selects a model like the free GitHub GPT-4.1 for the chat app.
Model selection based on specific requirements:
Example requirement prompt:
Create an AI app to manage travel queries using a cheap and fast azure model.
For this example, Copilot selects a model like MicrosoftFoundry GPT-4.1-mini model.
The Evaluation Code Gen tool helps you add evaluation to your AI apps or agents. It uses a Plan tool to gather details and prepare assets before generating code.
The Evaluation Code Gen tool has four main features:
These tools are helpful regardless of your skill level. New developers get guidance on tests and tools, and experienced developers can save time. The Plan tool finds your test files or asks where they are, making it easy to add testing to your app.
Example requirement prompts:
Example 1:
Create an evaluation for my AI travel assistant application that assesses response accuracy and user satisfaction.
Example 2:
Measure the tool call accuracy of the agent.
The Tracing Code Gen tool shares best practices for adding tracing to AI apps. It helps developers monitor and debug AI agents and workflows with ease. This tool supports local tracing for these languages and SDKs:
Example requirement prompt:
Enable tracing for my AI travel assistant application built with the agent-framework SDK in Python.
In this article, you learned how to: