-
Notifications
You must be signed in to change notification settings - Fork 235
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support spilling to disk directly via cuFile/GDS #1313
Conversation
Signed-off-by: Rong Ou <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My only comment on this is that either we need to hide the config, or we need to document it better and explain it is an experimental feature and point to more information about GDS if we want others to try it out.
sql-plugin/src/main/scala/com/nvidia/spark/rapids/shuffle/BufferSendState.scala
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsGdsStore.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsGdsStore.scala
Outdated
Show resolved
Hide resolved
@revans2 added wording to the config to specify it's an experimental feature, and link to gds docs. |
sql-plugin/src/main/scala/com/nvidia/spark/rapids/RapidsGdsStore.scala
Outdated
Show resolved
Hide resolved
Signed-off-by: Rong Ou <[email protected]>
build |
build |
build |
1 similar comment
build |
* support spilling to GDS Signed-off-by: Rong Ou <[email protected]> * address review comments * review feedback Signed-off-by: Rong Ou <[email protected]>
* support spilling to GDS Signed-off-by: Rong Ou <[email protected]> * address review comments * review feedback Signed-off-by: Rong Ou <[email protected]>
…p ci] [bot] (NVIDIA#1313) * Update submodule cudf to 9c559c94fcaa1525d7c95faf94e5486fcd992ef1 Signed-off-by: spark-rapids automation <[email protected]> * Update submodule cudf to b7994bc16b1b1743b0743860b4f02ac4da8245d5 Signed-off-by: spark-rapids automation <[email protected]> * Update submodule cudf to 15cc5011902f1026e04662e725b880f48d38ba8d Signed-off-by: spark-rapids automation <[email protected]> --------- Signed-off-by: spark-rapids automation <[email protected]>
This adds an option to spill to disk directly via GDS. Default is off and shouldn't have any effect on existing behavior.
Initial test on my local desktop shows that it performs similarly to spilling to host memory, with a single Samsung 970 EVO Plus 2TB NVMe M.2 drive. Need to do more benchmarking in real deployment environments.
Depends on rapidsai/cudf#6940.