GPT OSS 20b (Sydney)

OpenAI-compatible Bedrock Mantle model available in Sydney. Direct inference without cross-region routing.

Provider: All OpenAI models | AWS Bedrock

Inference regions: ap-southeast-2

API Endpoint

https://bedrock-mantle.ap-southeast-2.api.aws/v1/responses

Quick Start (Python)

Install: pip install openai

from openai import OpenAI
import os

client = OpenAI(
    api_key=os.environ["AWS_BEARER_TOKEN_BEDROCK"],
    base_url="https://bedrock-mantle.ap-southeast-2.api.aws/v1",
)

response = client.responses.create(
    model="openai.gpt-oss-20b",
    input="Hello, how are you?",
    max_output_tokens=1024,
    store=False,
)

print(response.output_text)

Additional examples: Basic invoke, Streaming

Supported Parameters

ParameterTypeDescription
max_output_tokensintegerMaximum number of visible output tokens to generate. (≥1)
streambooleanStream response events as they are generated. Default: false.
storebooleanStore response state for follow-up turns. Set false for zero-retention request handling. Default: false.

Feature Guides

OpenAI-compatible Responses API

Use the OpenAI SDK with a Bedrock API key and the regional bedrock-mantle endpoint.

Documentation

Developer Notes