Fireworks Qwen3.6 Plus (Global)

Qwen 3.6 Plus is Alibaba's latest flagship closed model, available exclusively through Fireworks AI outside of Alibaba's own infrastructure. Please contact Fireworks AI to get dedicated instances for Qwen 3.6 Plus.

Provider: All Qwen models | Fireworks AI

API Endpoint

https://api.fireworks.ai/inference/v1/chat/completions

Quick Start (Python)

Install: pip install openai

from openai import OpenAI

client = OpenAI(
    api_key="your-fireworks-api-key",
    base_url="https://api.fireworks.ai/inference/v1",
)

response = client.chat.completions.create(
    model="accounts/fireworks/models/qwen3p6-plus",
    messages=[
        {"role": "user", "content": "Hello, how are you?"},
    ],
    max_tokens=1024,
    temperature=0.7,
)

print(response.choices[0].message.content)

Additional examples: Basic invoke, Streaming

Supported Parameters

ParameterTypeDescription
max_tokensintegerMaximum tokens to generate. (≥1)
temperaturefloatControls randomness. (0–2) Default: 0.7.
top_pfloatNucleus sampling threshold. (0–1) Default: 1.
streambooleanStream response chunks as they are generated. Default: false.
stopstringStop sequence or array of stop sequences.

Feature Guides

Serverless Inference

Pay per token for public open models without managing GPU deployments.

Documentation

OpenAI Compatibility

Use OpenAI-compatible client libraries by changing the API base URL.

Documentation