Add more info to ANN_BENCH context #248

achirkin · 2024-07-24T09:30:46Z

Add extra information to benchmark context for better reproducibility and performance analysis:

Full command line used to call the executable (so you can copy-paste and run again).
More CUDA device information: whether HMM, AST, or host atomics are available (how GPU can efficiently communicate with CPU).
Host information: min/max frequences, used virtual processors and cores, available physical memory and swap (does the benchmark segfault due to not enough host memory? is SMT enabled? etc).

Addresses parts of #160

achirkin · 2024-07-24T09:33:28Z

I'm thinking whether we should include some environment variables if present or the machine hostname? Wouldn't that be a bit fishy in terms of user privacy when sharing the output files?

tfeher

Thanks Artem, LGTM, it is useful to extend the context information. I do not see any issues with adding the hostname.

tfeher · 2024-07-24T11:54:17Z

For env vars, I am not sure what would be relevant.
For OpenMP context, you could add omp_get_max_threads()

achirkin · 2024-07-24T12:00:38Z

We could get the openmp-related env vars to see if the limit to the number of threads is set explicitly.
I wouldn't like to add omp_get_max_threads() and other OpenMP functions to avoid introducing openmp dependency in the benchmark (for the algorithms that do not use OpenMP and for the ANN_BENCH executable in the single-exe mode). I'm also not sure if cuVS team decides to move away from using OpenMP in favor of standard C++ threads in future.

…UDA version

tfeher · 2024-07-29T10:58:35Z

/merge

Add extra information to benchmark context for better reproducibility and performance analysis: 1. Full command line used to call the executable (so you can copy-paste and run again). 2. More CUDA device information: whether HMM, AST, or host atomics are available (how GPU can efficiently communicate with CPU). 3. Host information: min/max frequences, used virtual processors and cores, available physical memory and swap (does the benchmark segfault due to not enough host memory? is SMT enabled? etc). Addresses parts of rapidsai#160 Authors: - Artem M. Chirkin (https://github.com/achirkin) Approvers: - Tamas Bela Feher (https://github.com/tfeher) URL: rapidsai#248

Add more info to ANN_BENCH context

4194296

achirkin requested a review from a team as a code owner July 24, 2024 09:30

github-actions bot added the cpp label Jul 24, 2024

tfeher approved these changes Jul 24, 2024

View reviewed changes

Use CUDART_VERSION to hide cuda properties not available in earlier C…

b0c6b8c

…UDA version

cjnolet added improvement Improves an existing functionality non-breaking Introduces a non-breaking change labels Jul 24, 2024

achirkin added 2 commits July 26, 2024 08:38

Merge branch 'branch-24.08' into fea-ann-bench-host-info

0b8c6bf

Merge branch 'branch-24.08' into fea-ann-bench-host-info

8ab3434

rapids-bot bot merged commit 98c07f9 into rapidsai:branch-24.08 Jul 29, 2024
54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more info to ANN_BENCH context #248

Add more info to ANN_BENCH context #248

achirkin commented Jul 24, 2024

achirkin commented Jul 24, 2024

tfeher left a comment

tfeher commented Jul 24, 2024

achirkin commented Jul 24, 2024

tfeher commented Jul 29, 2024

Add more info to ANN_BENCH context #248

Add more info to ANN_BENCH context #248

Conversation

achirkin commented Jul 24, 2024

achirkin commented Jul 24, 2024

tfeher left a comment

Choose a reason for hiding this comment

tfeher commented Jul 24, 2024

achirkin commented Jul 24, 2024

tfeher commented Jul 29, 2024