Support for OpenCL floatN and doubleN types #18

vchuravy · 2014-03-04T05:20:25Z

It would be nice if OpenCL.jl would be able to support the halfN, floatN and doubleN types that OpenCL provides on Julias side.

Currently it is possible to initialize a double4 buffer with

    test_buff = cl.Buffer(Float64, ctx, :rw, Ndim * Mdim * 4)

But working with that data on Julias side is a hassle.

jakebolewski · 2014-03-04T19:56:03Z

I agree that this would be nice to have. We could even emulate these types nicely with an array of immutable SIMD types that follow OpenCL's api, ex.

immutable Float4
    s0::Float32
    s1::Float32
    s2::Float32
    s3::Float32
end

However, it looks like Julia in the near future will get SIMD types so I'm hesitant to implement this now when we can take advantage of built in SIMD types in the future. Another caveat to this is buffer alignment. The OpenCL spec says that it is the user's job to make sure memory is aligned correctly when using host array pointers as memory buffers (use host pointer option for buffer construction). If we want to take advantage of SIMD types on CPU OpenCL platforms this needs to be correct. If I remember correctly Julia aligns all arrays to 16 byte boundaries. This would be correct for float4 opencl buffers but not for float8 which require 32 byte alignment. Right now there is no way of controlling alignment of Julia's arrays but this might change with the introduction of SIMD types to take advantage of SIMD align/store instructions which are often 2x faster than unaligned store/loads. If the contents are not aligned correctly I think it is up to the runtime to figure out what to do. It could use unaligned load/stores for SIMD types or copy the contents of the buffer to a new buffer with the correct alignment. This is less of a concern when using GPU's as unaligned buffers are copied anyway.

nstiurca · 2015-11-16T17:13:29Z

One possible use case for OpenCL's float2/float4 is to represent complex numbers or quaternions. Likewise, uchar3/uchar4 often represent RGB/RGBA pixels. We should consider if there is a clean way to map between eg Julia's Complex <--> half2/float2/double2, or Julia's Quaternion <--> float4, or mapping to Colors.RGB, etc.

nstiurca · 2015-11-16T17:15:43Z

Also, no reason to limit to float/double support. We should also support all integer types, as well as half (Float16) if the OpenCL device supports it.

juliohm closed this as completed Oct 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for OpenCL floatN and doubleN types #18

Support for OpenCL floatN and doubleN types #18

vchuravy commented Mar 4, 2014

jakebolewski commented Mar 4, 2014

nstiurca commented Nov 16, 2015

nstiurca commented Nov 16, 2015

Support for OpenCL floatN and doubleN types #18

Support for OpenCL floatN and doubleN types #18

Comments

vchuravy commented Mar 4, 2014

jakebolewski commented Mar 4, 2014

nstiurca commented Nov 16, 2015

nstiurca commented Nov 16, 2015