add nvptx vendor intrinsics and initial no_std support #191

gnzlbg · 2017-11-16T12:35:46Z

stdsimd does currently not compile as is with #![no_std].

This PR adds the "nostd" crate feature that enforces #![no_std] and ci test to cover this.

The only dependency stdsimd has on std is std::os::raw::c_void, which I've worked around by c&p c_void into the x86 module.

Once we merge ARM run-time feature detection it will also get a dependency on std::fs. The plan is then to split stdsimd into a core component, and a std component.

The nvptx intrinsics are always available independently of the architecture, but one can only call them from an nvptx kernel.

BurntSushi · 2017-11-16T12:43:55Z

This looks OK to me. r? @alexcrichton

gnzlbg · 2017-11-16T12:52:56Z

@alexcrichton I am not suggesting that we should stabilize nvptx intrinsics any time soon, but...

Currently compiling CUDA code using Rust requires three unstable features: abi_ptx, untagged_unions (isn't this stable already?), and platform_intrinsics.

So... if we were to ever stabilize this... that basically cuts it down to abi_ptx.

In the mean time this allows those interested to implement and use these intrinsics on unstable rust without having to use platform_intrinsics or hack into the compiler.

@japaric I have tested this by switching the dependency of your nvptx project kernel crate to use stdsimd instead of nvptx-builtins, but if we were to ever expose these here for "serious" use we would need to figure out a way to use cross to at least tests that kernels compile properly.

japaric · 2017-11-16T13:20:04Z

we would need to figure out a way to use cross to at least tests that kernels compile properly.

we don't really need cross to use the nvptx targets though. Emitting NVPTX doesn't require any system library or tool and there's no binary distribution of core for these targets either so Xargo is enough -- provided that you have the target specification file around since these targets are not built-in.

(semi on-topic: I'd like to see cargo build to Just Work and directly generate NVPTX as the output artifact but there's no such thing as a standalone NVPTX linker (there's no NVPTX object forrmat either) -- you can use clang as a linker to link bitcode but that's too big of a dependency. What we could do is something like what's being done for the new wasm target: set obj-is-bitcode to true, force LTO and have LLVM do the linking under the hood.)

gnzlbg · 2017-11-16T13:38:58Z

we don't really need cross to use the nvptx targets though.

Yes, the problem is not compiling, but running the tests.

So what I was thinking is using cross just for running the tests. Instead of using qemu-user we would use, e.g., ocelot, to run the PTX kernels on x86. I haven't used ocelot myself though, so I don't know how good this could work.

(semi on-topic: I'd like to see cargo build to Just Work and directly generate NVPTX as the output artifact but there's no such thing as a standalone NVPTX linker (there's no NVPTX object forrmat either)

Even less on-topic: I'd like to just be able to stamp a #[kernel] attribute to a Rust function or closure, compile my code with cargo build --kernel-targets="x86-avx2,nvptx-sm30", and be able to call the kernel transparently in the CPU and also launch it as a cuda kernel.

But until then a #[kernel_nvptx] attribute just for cuda kernels would already be sweet enough.

mattico · 2017-11-16T18:52:50Z

Nit: Cargo features should be additive (couldn't find a better link but Steve has mentioned this), so it'd be better to have a std feature that's enabled by default.

gnzlbg · 2017-11-16T20:32:22Z

@mattico thanks, looks better now :)

mattico · 2017-11-17T16:54:48Z

Cargo.toml

@@ -32,3 +32,4 @@ cupid = "0.3"

 [features]
 strict = []
+std = []


Should there be default = ["std"]?

I thought about this, and I think we should settle it before the next release. I've opened #193.

alexcrichton · 2017-11-19T16:47:49Z

src/lib.rs

@@ -128,6 +128,10 @@
                  cast_possible_truncation, cast_precision_loss,
                  shadow_reuse, cyclomatic_complexity, similar_names,
                  doc_markdown, many_single_char_names))]
+#![cfg_attr(not(feature = "std"), no_std)]


Typically this is an unconditional #![no_std] if a crate is no_std compatible, but I'll send a PR for this.

alexcrichton · 2017-11-19T16:48:40Z

Is there a URL at which we can point the documentation to as well? Something like Intel's source of truth for these intrinsic definitions?

gnzlbg · 2017-11-19T17:50:06Z

@alexcrichton probably the CUDA API docs are the closest thing to that. The problem is that CUDA considers these to be part of the CUDA language, we only see them as intrinsics here because CUDA Rust is not a thing.

So I am going to link those there and also link the docs of the LLVM's NVPTX backend but I will mention that these intrinsics are experimental and if you don't know what they do you probably shouldn't be using them.

I still think this is pretty exciting. If we get @japaric's https://github.com/japaric/cuda and nvptx libraries to work on top of this, we might one day offer CUDA on stable Rust "as a library" by just stabilizing these.

add nvptx architecture

b40f23b

gnzlbg force-pushed the nvptx branch from 4a7a630 to f68cba9 Compare November 16, 2017 12:36

BurntSushi requested a review from alexcrichton November 16, 2017 12:43

gnzlbg added 2 commits November 16, 2017 21:30

add support for no_std

9bd9ba4

formatting

ef6284f

gnzlbg force-pushed the nvptx branch from d601f22 to ef6284f Compare November 16, 2017 20:32

gnzlbg merged commit 231405f into rust-lang:master Nov 17, 2017

mattico reviewed Nov 17, 2017

View reviewed changes

alexcrichton reviewed Nov 19, 2017

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add nvptx vendor intrinsics and initial no_std support #191

add nvptx vendor intrinsics and initial no_std support #191

gnzlbg commented Nov 16, 2017

BurntSushi commented Nov 16, 2017

gnzlbg commented Nov 16, 2017 •

edited

Loading

japaric commented Nov 16, 2017

gnzlbg commented Nov 16, 2017 •

edited

Loading

mattico commented Nov 16, 2017

gnzlbg commented Nov 16, 2017

mattico Nov 17, 2017

gnzlbg Nov 17, 2017

alexcrichton Nov 19, 2017

alexcrichton commented Nov 19, 2017

gnzlbg commented Nov 19, 2017

add nvptx vendor intrinsics and initial no_std support #191

add nvptx vendor intrinsics and initial no_std support #191

Conversation

gnzlbg commented Nov 16, 2017

BurntSushi commented Nov 16, 2017

gnzlbg commented Nov 16, 2017 • edited Loading

japaric commented Nov 16, 2017

gnzlbg commented Nov 16, 2017 • edited Loading

mattico commented Nov 16, 2017

gnzlbg commented Nov 16, 2017

mattico Nov 17, 2017

Choose a reason for hiding this comment

gnzlbg Nov 17, 2017

Choose a reason for hiding this comment

alexcrichton Nov 19, 2017

Choose a reason for hiding this comment

alexcrichton commented Nov 19, 2017

gnzlbg commented Nov 19, 2017

gnzlbg commented Nov 16, 2017 •

edited

Loading

gnzlbg commented Nov 16, 2017 •

edited

Loading