-
Notifications
You must be signed in to change notification settings - Fork 197
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix ANN bench latency #1940
Fix ANN bench latency #1940
Conversation
The initial version of this PR adds a helper ANN benchmark class: FixLatencyWorkload. This can be configured to spend
We can see the following issues:
|
In throughput mode I believe we want to define vector search
|
|
It would be nice to even be able to set the min/max threads (and set them equal if one desires to run only a single threaded experiment) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
|
Fixed the problem. Pending CI it is ready to merge. Example output
It is recommended to use |
Co-authored-by: Tamas Bela Feher <[email protected]>
…ix_ann_bench_latency
/merge |
auto duration = std::chrono::duration_cast<std::chrono::duration<double>>(end - start).count(); | ||
state.counters.insert({{"end_to_end", duration}}); | ||
} | ||
cudaDeviceSynchronize(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CPU only builds fail at this point
benchmark.hpp:299:3: error: there are no arguments to 'cudaDeviceSynchronize' that depend on a template parameter, so a declaration of 'cudaDeviceSynchronize' must be available [-fpermissive]
299 | cudaDeviceSynchronize()
I thought we cuda_stub.hpp to help with this, but it does not work. If we are in a hurry we could disable the sync here, but it would be better to fix the cuda_stubs
.
This adds explicit latency column to the benchmark. Authors: - Tamas Bela Feher (https://github.com/tfeher) - Corey J. Nolet (https://github.com/cjnolet) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: rapidsai#1940
This adds explicit latency column to the benchmark.