LaunchDevelopers
Jun 15, 11:34 PM
vLLM adds new streaming parser for Qwen3+ in nightly
GitHub PR #45413 introduces a streaming parser that fixes Qwen3.6-27b stopping mid-turn and failing tool calls due to chunk boundaries. Available in vLLM nightly.
GitHub PR #45413 introduces a streaming parser that fixes Qwen3.6-27b stopping mid-turn and failing tool calls due to chunk boundaries. Available in vLLM nightly.