Qwen

Alibaba: qwen3.5-plus

qwen3.5-plus

Qwen3.5 native vision-language Plus model built on a hybrid architecture that combines linear attention with sparse MoE for higher inference efficiency. Across many benchmarks, the 3.5 series reaches performance comparable to top frontier models, with major gains over the 3 series in both text-only and multimodal tasks. This release is functionally equivalent to the qwen3.5-plus-2026-02-15 snapshot.

Provider: AlibabaInput types
Output typesPublish Time: None
Group price
Price information for different user groups
Auto Group routing default
GroupBilling typeInput PriceOutput Price
default
Pay as you go
$0.1100 / M tokens
$0.6600 / M tokens
Provider
Alibaba
Pricing$0.1100 / M tokens
Input video-
Input audio-
Web Search-
Cache pricing-
Context window-
Max output-
Latency397 ms
Throughput170 TPS
Availability97.50%
Chat
Start a conversation
Type a message below to begin
API call example
Connect quickly using the standard OpenAI-compatible API
Python
1import openai
2
3client = openai.OpenAI(
4 api_key="<YOUR_API_KEY>",
5 base_url="http://localhost:3000/v1"
6)
7
8response = client.chat.completions.create(
9 model="qwen3.5-plus",
10 messages=[
11 {"role": "user", "content": "What model are you?"}
12 ]
13)
14
15print(response.choices[0].message.content)

Average response in 5 minutes

Service Hours:10:30-23:30
WhatsApp

Scan to join

WhatsApp QR

Scan to add WhatsApp support for instant assistance.

Scan to add our support team for onboarding, billing, and integration assistance.