AnalysisAI Models
23 days ago
Qwen3.6 27B achieves 1000 t/s with V100s in concurrent setup
The setup achieved 1000 t/s generation on Qwen3.6 27B with 128 concurrent requests using V100 GPUs. For single user, generation speed drops to 80 t/s with 3000 t/s processing.
·
23 days ago
