TTFT deteriorates rapidly after Concurrency reaches 72.

by theGreatGuy - opened 6 days ago

6 days ago

When I use vLLM-benchmark to test performance of Kimi-Dev-72B, I find that TTFT deteriorates rapidly after Concurrency reaches 72. Anyone knows reason?

theGreatGuy

6 days ago

In addition, I use evalScope to test the model accuracy and found that its accuracy was only 0.5488 in the humaneval dataset.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment