Fireworks Qwen3.6 Plus (Global)

Qwen 3.6 Plus is Alibaba's latest flagship closed model, available exclusively through Fireworks AI outside of Alibaba's own infrastructure. Please contact Fireworks AI to get dedicated instances for Qwen 3.6 Plus.

Provider: All Qwen models | Fireworks AI

API Endpoint

https://api.fireworks.ai/inference/v1/chat/completions

Quick Start (Python)

Install: pip install openai

from openai import OpenAI

client = OpenAI(
    api_key="your-fireworks-api-key",
    base_url="https://api.fireworks.ai/inference/v1",
)

response = client.chat.completions.create(
    model="accounts/fireworks/models/qwen3p6-plus",
    messages=[
        {"role": "user", "content": "Hello, how are you?"},
    ],
    max_tokens=1024,
    temperature=0.7,
)

print(response.choices[0].message.content)

Additional examples: Basic invoke, Streaming

Supported Parameters

Parameter	Type	Description
max_tokens	integer	Maximum tokens to generate. (≥1)
temperature	float	Controls randomness. (0–2) Default: 0.7.
top_p	float	Nucleus sampling threshold. (0–1) Default: 1.
stream	boolean	Stream response chunks as they are generated. Default: false.
stop	string	Stop sequence or array of stop sequences.

Feature Guides

Serverless Inference

Pay per token for public open models without managing GPU deployments.

Documentation

OpenAI Compatibility

Use OpenAI-compatible client libraries by changing the API base URL.

Documentation