TTFT deteriorates rapidly after Concurrency reaches 72.
#5
by
theGreatGuy
- opened
When I use vLLM-benchmark to test performance of Kimi-Dev-72B, I find that TTFT deteriorates rapidly after Concurrency reaches 72. Anyone knows reason?