-
Notifications
You must be signed in to change notification settings - Fork 114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: subsequent requests cannot be sent until 'num_concurrent_requests' requests have all finished in non-block mode #59
Conversation
…s' requests have all finished in non-blocking mode Signed-off-by: Sungjae Lee <[email protected]>
d75db69
to
570f780
Compare
Hey, was there any progress regarding this issue ? it seems its detrimental for the correct operation of llmperf but on the other hand no one gives it any attention. |
I guess maybe it might be better to move it to a different file in an overlay manner instead of modifying the Anyway, I have been using llmperf well since applying this commit without any errors. |
I had made a commit 5 months ago to move the prompt construction outside the send loop so the benchmark doesn't get slowed down. I didn't know this affects the num_concurrent_requests mode. Let me know if I can help in some way. |
@gracehonv |
It does look like in your commit you have prepared all of the prompts before launching all the concurrent requests so there shouldn't be any slowdown. Thanks for fixing! |
@cpwan, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you!
issues
#43
#56
Summary