Pegasus 1 2

Routes inference requests from Seoul to Twelve Labs models in Hyderabad, Mumbai and 6 more regions

Inference regions: ap-south-2, ap-south-1, ap-southeast-1, ap-northeast-3, ap-southeast-2, ap-northeast-2, ap-northeast-1, ap-southeast-4

API Endpoint

https://bedrock-runtime.ap-northeast-2.amazonaws.com

Parameter	Type	Description
max_tokens	integer	Maximum number of tokens to generate in the response. (≥1)
temperature	float	Controls randomness. Lower values make output more deterministic. (0–1) Default: 1.
top_p	float	Nucleus sampling threshold. Considers tokens with cumulative probability up to this value. (0–1) Default: 1.
stop_sequences	string	Up to 4 sequences where the model will stop generating.
top_k	integer	Only sample from the top K most likely tokens at each step. (0–500) Default: 250.

Default mode. Pay per token with no upfront commitment.