OpenAI-compatible Bedrock Mantle model available in Stockholm. Direct inference without cross-region routing.
Provider: All Qwen models | AWS Bedrock
Inference regions: eu-north-1
https://bedrock-mantle.eu-north-1.api.aws/v1/responsesInstall: pip install boto3
import boto3
client = boto3.client("bedrock-runtime", region_name="eu-north-1")
response = client.converse(
modelId="qwen.qwen3-coder-next",
messages=[
{
"role": "user",
"content": [{"text": "Hello, how are you?"}],
}
],
inferenceConfig={
"maxTokens": 1024,
"temperature": 0.7,
},
)
print(response["output"]["message"]["content"][0]["text"])Additional examples: Basic invoke, Streaming
| Parameter | Type | Description |
|---|---|---|
| max_output_tokens | integer | Maximum number of visible output tokens to generate. (≥1) |
| stream | boolean | Stream response events as they are generated. Default: false. |
| store | boolean | Store response state for follow-up turns. Set false for zero-retention request handling. Default: false. |
Use the OpenAI SDK with a Bedrock API key and the regional bedrock-mantle endpoint.