Claude in Microsoft Foundry

This guide will walk you through the process of setting up and making API calls to Claude in Foundry in Python, TypeScript, or using direct HTTP requests. When you can access Claude in Foundry, you will be billed for Claude usage in the Microsoft Marketplace with your Azure subscription, allowing you to access Claude’s latest capabilities while managing costs through your Azure subscription. Regional availability: At launch, Claude is available as a Global Standard deployment type in Foundry resources with US DataZone coming soon. Pricing for Claude in the Microsoft Marketplace uses Anthropic’s standard API pricing. Visit our pricing page for details.

Preview

In this preview platform integration, Claude models run on Anthropic’s infrastructure. This is a commercial integration for billing and access through Azure. As an independent processor for Microsoft, customers using Claude through Microsoft Foundry are subject to Anthropic’s data use terms. Anthropic continues to provide its industry-leading safety and data commitments, including zero data retention availability.

Prerequisites

Before you begin, ensure you have:

An active Azure subscription
Access to Foundry
The Azure CLI installed (optional, for resource management)

Install an SDK

Anthropic’s client SDKs support Foundry through platform-specific packages.

# Python
pip install -U "anthropic"

# Typescript
npm install @anthropic-ai/foundry-sdk

Provisioning

Foundry uses a two-level hierarchy: resources contain your security and billing configuration, while deployments are the model instances you call via API. You’ll first create a Foundry resource, then create one or more Claude deployments within it.

Provisioning Foundry resources

Create a Foundry resource, which is required to use and manage services in Azure. You can follow these instructions to create a Foundry resource. Alternatively, you can start by creating a Foundry project, which involves creating a Foundry resource. To provision your resource:

Navigate to the Foundry portal
Create a new Foundry resource or select an existing one
Configure access management using Azure-issued API keys or Entra ID for role-based access control
Optionally configure the resource to be part of a private network (Azure Virtual Network) for enhanced security
Note your resource name—you’ll use this as {resource} in API endpoints (e.g., https://{resource}.services.ai.azure.com/anthropic/v1/*)

Creating Foundry deployments

After creating your resource, deploy a Claude model to make it available for API calls:

In the Foundry portal, navigate to your resource
Go to Models + endpoints and select + Deploy model > Deploy base model
Search for and select a Claude model (e.g., claude-sonnet-4-5)
Configure deployment settings:
- Deployment name: Defaults to the model ID, but you can customize it (e.g., my-claude-deployment). The deployment name cannot be changed after it has been created.
- Deployment type: Select Global Standard (recommended for Claude)
Select Deploy and wait for provisioning to complete
Once deployed, you can find your endpoint URL and keys under Keys and Endpoint

The deployment name you choose becomes the value you pass in the model parameter of your API requests. You can create multiple deployments of the same model with different names to manage separate configurations or rate limits.

Authentication

Claude on Foundry supports two authentication methods: API keys and Entra ID tokens. Both methods use Azure-hosted endpoints in the format https://{resource}.services.ai.azure.com/anthropic/v1/*.

API key authentication

After provisioning your Foundry Claude resource, you can obtain an API key from the Foundry portal:

Navigate to your resource in the Foundry portal
Go to Keys and Endpoint section
Copy one of the provided API keys
Use either the api-key or x-api-key header in your requests

The Python and TypeScript SDKs require an API key and resource name. The SDK’s will automatically read these from the ANTHROPIC_FOUNDRY_API_KEY and ANTHROPIC_FOUNDRY_RESOURCE environment variables if they are defined. Example using API key:

import os
from anthropic import AnthropicFoundry

client = AnthropicFoundry(
    api_key=os.environ.get("ANTHROPIC_FOUNDRY_API_KEY"),
    resource_name="{resource}",
)

message = client.messages.create(
    model="claude-sonnet-4-5",
    max_tokens=1024,
    messages=[{"role": "user", "content": "Hello!"}]
)
print(message.content)

Keep your API keys secure. Never commit them to version control or share them publicly. Anyone with access to your API key can make requests to Claude through your Foundry resource.

Microsoft Entra authentication

For enhanced security and centralized access management, you can use Entra ID (formerly Azure Active Directory) tokens:

Enable Entra authentication for your Foundry resource
Obtain an access token from Entra ID
Use the token in the Authorization: Bearer {TOKEN} header

Example using Entra ID:

import os
from anthropic import AnthropicFoundry
from azure.identity import DefaultAzureCredential, get_bearer_token_provider

# Get Azure Entra ID token using token provider pattern
token_provider = get_bearer_token_provider(
    DefaultAzureCredential(),
    "https://cognitiveservices.azure.com/.default"
)

# Create client with Entra ID authentication
client = AnthropicFoundry(
    resource_name="{resource}",  # Your Azure resource name
    azure_ad_token_provider=token_provider  # Use token provider for Entra ID auth
)

# Make request
message = client.messages.create(
    model="claude-sonnet-4-5",
    max_tokens=1024,
    messages=[{"role": "user", "content": "Hello!"}]
)
print(message.content)

Azure Entra ID authentication allows you to manage access using Azure RBAC, integrate with your organization’s identity management, and avoid managing API keys manually.

Replace {resource} with your actual Azure resource name. You can use either the api-key header (shown above) or the x-api-key header - both are supported.

Install an SDK

Anthropic’s client SDKs support Foundry through platform-specific packages.

# Python
pip install -U "anthropic"

# Typescript
npm install @anthropic-ai/foundry-sdk

Model parameter and deployments

The model parameter in your API requests accepts deployment names. The default name suggested for deployments is the model ID (e.g., claude-sonnet-4-5), but you can customize deployment names in the Foundry portal (at deployment creation time only). Example with custom deployment:

# If you've created a custom deployment named "my-claude-deployment"
message = client.messages.create(
    model="my-claude-deployment",  # Your custom deployment name
    max_tokens=1024,
    messages=[{"role": "user", "content": "Hello!"}]
)

Deployments allow you to manage different model configurations, versions, or rate limits through Azure without changing your application code. See our client SDKs for more details, and the official Foundry docs here.

Correlation request IDs

Foundry includes request identifiers in HTTP response headers for debugging and tracing. When contacting support, provide both the request-id and apim-request-id values to help teams quickly locate and investigate your request across both Anthropic and Azure systems.

Supported features

Claude on Foundry supports most of Claude’s powerful features. You can find all the features currently supported here.

Features not supported

Admin API (/v1/organizations/* endpoints)
Models API (/v1/models)
Message Batch API (/v1/messages/batches)

API responses

API responses from Claude on Foundry follow the standard Anthropic API response format. This includes the usage object in response bodies, which provides detailed token consumption information for your requests. The usage object is consistent across all platforms (first-party API, Foundry, Amazon Bedrock, and Google Vertex AI). For details on response headers specific to Foundry, see the correlation request IDs section.

API model IDs and deployments

The following Claude models are available through Foundry. The latest generation models (Sonnet 4.5, Opus 4.1, and Haiku 4.5) offer the most advanced capabilities:

Model	Default Deployment Name
Claude Sonnet 4.5	`claude-sonnet-4-5`
Claude Opus 4.1	`claude-opus-4-1`
Claude Haiku 4.5	`claude-haiku-4-5`

By default, deployment names match the model IDs shown above. However, you can create custom deployments with different names in the Foundry portal to manage different configurations, versions, or rate limits. Use the deployment name (not necessarily the model ID) in your API requests.

Monitoring and logging

Azure provides comprehensive monitoring and logging capabilities for your Claude usage through standard Azure patterns:

Azure Monitor: Track API usage, latency, and error rates
Azure Log Analytics: Query and analyze request/response logs
Cost Management: Monitor and forecast costs associated with Claude usage

Anthropic recommends logging your activity on at least a 30-day rolling basis to understand usage patterns and investigate any potential issues.

Azure’s logging services are configured within your Azure subscription. Enabling logging does not provide Microsoft or Anthropic access to your content beyond what’s necessary for billing and service operation.

Troubleshooting

Authentication errors

Error: 401 Unauthorized or Invalid API key

Solution: Verify your API key is correct. You can obtain a new API key from the Azure portal under Keys and Endpoint for your Claude resource.
Solution: If using Azure Entra ID, ensure your access token is valid and hasn’t expired. Tokens typically expire after 1 hour.

Error: 403 Forbidden

Solution: Your Azure account may lack the necessary permissions. Ensure you have the appropriate Azure RBAC role assigned (e.g., “Cognitive Services OpenAI User”).

Rate limiting

Error: 429 Too Many Requests

Solution: You’ve exceeded your rate limit. Implement exponential backoff and retry logic in your application.
Solution: Consider requesting rate limit increases through the Azure portal or Azure support.

Rate limit headers

Foundry does not include Anthropic’s standard rate limit headers (anthropic-ratelimit-tokens-limit, anthropic-ratelimit-tokens-remaining, anthropic-ratelimit-tokens-reset, anthropic-ratelimit-input-tokens-limit, anthropic-ratelimit-input-tokens-remaining, anthropic-ratelimit-input-tokens-reset, anthropic-ratelimit-output-tokens-limit, anthropic-ratelimit-output-tokens-remaining, and anthropic-ratelimit-output-tokens-reset) in responses. Manage rate limiting through Azure’s monitoring tools instead.

Model and deployment errors

Error: Model not found or Deployment not found

Solution: Verify you’re using the correct deployment name. If you haven’t created a custom deployment, use the default model ID (e.g., claude-sonnet-4-5).
Solution: Ensure the model/deployment is available in your Azure region.

Error: Invalid model parameter

Solution: The model parameter should contain your deployment name, which can be customized in the Foundry portal. Verify the deployment exists and is properly configured.

Additional resources

Foundry documentation: ai.azure.com/catalog
Azure pricing: azure.microsoft.com/en-us/pricing
Anthropic pricing details: Pricing documentation
Authentication guide: See the authentication section above
Azure portal: portal.azure.com

First steps

Models & pricing

Build with Claude

Capabilities

Tools

Agent Skills

Agent SDK

MCP in the API

Claude on 3rd-party platforms

Prompt engineering

Test & evaluate

Strengthen guardrails

Administration and monitoring

Claude in Microsoft Foundry

Preview

Prerequisites

Install an SDK

Provisioning

Provisioning Foundry resources

Creating Foundry deployments

Authentication

API key authentication

Microsoft Entra authentication

Install an SDK

Model parameter and deployments

Correlation request IDs

Supported features

Features not supported

API responses

API model IDs and deployments

Monitoring and logging

Troubleshooting

Authentication errors

Rate limiting

Rate limit headers

Model and deployment errors

Additional resources

First steps

Models & pricing

Build with Claude

Capabilities

Tools

Agent Skills

Agent SDK

MCP in the API

Claude on 3rd-party platforms

Prompt engineering

Test & evaluate

Strengthen guardrails

Administration and monitoring

​Preview

​Prerequisites

​Install an SDK

​Provisioning

​Provisioning Foundry resources

​Creating Foundry deployments

​Authentication

​API key authentication

​Microsoft Entra authentication

​Install an SDK

​Model parameter and deployments

​Correlation request IDs

​Supported features

​Features not supported

​API responses

​API model IDs and deployments

​Monitoring and logging

​Troubleshooting

​Authentication errors

​Rate limiting

​Rate limit headers

​Model and deployment errors

​Additional resources

Preview

Prerequisites

Install an SDK

Provisioning

Provisioning Foundry resources

Creating Foundry deployments

Authentication

API key authentication

Microsoft Entra authentication

Install an SDK

Model parameter and deployments

Correlation request IDs

Supported features

Features not supported

API responses

API model IDs and deployments

Monitoring and logging

Troubleshooting

Authentication errors

Rate limiting

Rate limit headers

Model and deployment errors

Additional resources