ccall should pass vectorized tuples by value instead of reference #16138

ArchRobison · 2016-04-30T17:38:57Z

While writing the docs for #15244, I discovered that ccall is passing tuples of VecElement by reference instead of value. Passing by value would be more consistent with the C ABI for SIMD types, e.g. pass a NTuple{8,VecElement{Float32}} as if it were a C AVX __m256.

I'll poke around ccall to see what I need to teach it. Below is the example I was playing with.

typealias m256 NTuple{8,VecElement{Float32}}

convert(::Type{VecElement{Float32}},x::Float32) = VecElement(x)

a = m256(ntuple(i->VecElement(sin(Float32(i))),8))
b = m256(ntuple(i->VecElement(cos(Float32(i))),8))
println(typeof(a))
println(typeof(b))

function call_dist(a::m256, b::m256)
    ccall((:dist, "libdist"), m256, (m256, m256), a, b)  # C prototype is __m256 dist( __m256 a, __m256 b ) 
end

@code_llvm call_dist(a,b)

The text was updated successfully, but these errors were encountered:

ArchRobison · 2016-04-30T18:49:10Z

Looks like the fix involves changing the Classification logic to understand the special tuples. Can someone explain the purpose of the two classes fields here in abi_x86_64.cpp?

struct Classification {
    bool isMemory;
    ArgClass classes[2];

yuyichao · 2016-04-30T20:33:10Z

Right. All the abi needs to be updated.

ArchRobison · 2016-04-30T20:38:53Z

What's the difference between classes[0] and classes[1]?

ArchRobison · 2016-05-25T02:09:23Z

Closed by #16258.

JeffBezanson added the compiler:codegen Generation of LLVM IR and native code label May 3, 2016

This was referenced May 8, 2016

Document VecElement. Support call-by-value of SIMD types on 64-bit x86. #16258

Merged

VecElement doesn't work with 1-tuples #16287

Closed

ArchRobison closed this as completed May 25, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ccall should pass vectorized tuples by value instead of reference #16138

ccall should pass vectorized tuples by value instead of reference #16138

ArchRobison commented Apr 30, 2016

ArchRobison commented Apr 30, 2016

yuyichao commented Apr 30, 2016 •

edited

Loading

ArchRobison commented Apr 30, 2016

ArchRobison commented May 25, 2016

ccall should pass vectorized tuples by value instead of reference #16138

ccall should pass vectorized tuples by value instead of reference #16138

Comments

ArchRobison commented Apr 30, 2016

ArchRobison commented Apr 30, 2016

yuyichao commented Apr 30, 2016 • edited Loading

ArchRobison commented Apr 30, 2016

ArchRobison commented May 25, 2016

yuyichao commented Apr 30, 2016 •

edited

Loading