[DOC]: Modernize `cuda.parallel.experimental.reduce_info` example in the docstring #2307

leofang · 2024-08-28T05:47:26Z

Is this a duplicate?

I confirmed there appear to be no duplicate issues for this bug and that I agree to the Code of Conduct

Is this for new documentation, or an update to existing docs?

Update

Describe the incorrect/future/missing documentation

The existing docs is based on this test file:

cccl/python/cuda_parallel/tests/test_reduce_api.py

Lines 14 to 38 in 0a1cddb

    
           def test_device_reduce(): 
        
               # example-begin reduce-min 
        
               def op(a, b): 
        
                   return a if a < b else b 
        
               dtype = numpy.int32 
        
               h_init = numpy.array([42], dtype) 
        
               h_input = numpy.array([8, 6, 7, 5, 3, 0, 9]) 
        
               d_output = cuda.device_array(1, dtype) 
        
               d_input = cuda.to_device(h_input) 
        
               # Instantiate reduction for the given operator and initial value 
        
               reduce_into = cudax.reduce_into(d_output, d_output, op, h_init) 
        
               # Deterrmine temporary device storage requirements 
        
               temp_storage_size = reduce_into(None, d_input, d_output, h_init) 
        
               # Allocate temporary storage 
        
               d_temp_storage = cuda.device_array(temp_storage_size, dtype=numpy.uint8) 
        
               # Run reduction 
        
               reduce_into(d_temp_storage, d_input, d_output, h_init) 
        
               expected_output = 0 
        
               # example-end reduce-min

However it is not the modern/canonical usage that we'd like to encourage. For example, CuPy ndarrays should be used instead of the primitive Numba device arrays. It might require some internal changes in how cuda.parallel fetches the pointer to the ndarray's buffer.

If this is a correction, please provide a link to the incorrect documentation. If this is a new documentation request, please link to where you have looked.

https://nvidia.github.io/cccl/cuda_parallel/#cuda.parallel.experimental.reduce_into

The text was updated successfully, but these errors were encountered:

leofang added the doc Documentation-related items. label Aug 28, 2024

leofang self-assigned this Aug 28, 2024

leofang linked a pull request Aug 30, 2024 that will close this issue

Ensure CuPy arrays can be used with cuda.parallel too #2335

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOC]: Modernize `cuda.parallel.experimental.reduce_info` example in the docstring #2307

[DOC]: Modernize `cuda.parallel.experimental.reduce_info` example in the docstring #2307

leofang commented Aug 28, 2024

[DOC]: Modernize cuda.parallel.experimental.reduce_info example in the docstring #2307

[DOC]: Modernize cuda.parallel.experimental.reduce_info example in the docstring #2307

Comments

leofang commented Aug 28, 2024

Is this a duplicate?

Is this for new documentation, or an update to existing docs?

Describe the incorrect/future/missing documentation

If this is a correction, please provide a link to the incorrect documentation. If this is a new documentation request, please link to where you have looked.

[DOC]: Modernize `cuda.parallel.experimental.reduce_info` example in the docstring #2307

[DOC]: Modernize `cuda.parallel.experimental.reduce_info` example in the docstring #2307