OpenAI-compatible Bedrock Mantle model available in Stockholm. Direct inference without cross-region routing.
Provider: All NVIDIA models | AWS Bedrock
Inference regions: eu-north-1
https://bedrock-mantle.eu-north-1.api.aws/v1/responsesInstall: pip install openai
from openai import OpenAI
import os
client = OpenAI(
api_key=os.environ["AWS_BEARER_TOKEN_BEDROCK"],
base_url="https://bedrock-mantle.eu-north-1.api.aws/v1",
)
response = client.responses.create(
model="nvidia.nemotron-nano-3-30b",
input="Hello, how are you?",
max_output_tokens=1024,
store=False,
)
print(response.output_text)Additional examples: Basic invoke, Streaming
| Parameter | Type | Description |
|---|---|---|
| max_output_tokens | integer | Maximum number of visible output tokens to generate. (≥1) |
| stream | boolean | Stream response events as they are generated. Default: false. |
| store | boolean | Store response state for follow-up turns. Set false for zero-retention request handling. Default: false. |
Use the OpenAI SDK with a Bedrock API key and the regional bedrock-mantle endpoint.