Pull recent NVIDIA changes #127

ghost · 2018-02-16T23:58:05Z

No description provided.

Apply nodepchk to a loop/routine as a pragma nodepchk for simd clause or simd construct. Since the host function has information about nodepcheck, need to propagate the info to outlined function when it was created because we lose sptr of host function when we expand outlined function.

Pull 2018-02-08T12-20 Recent NVIDIA Changes

Disable 8-byte argument modifications in the runtime routine table for nonstandard intrinsics idate and jdate.

Test that a pointer to a contiguous array section is passed correctly for an assumed shape dummy argument.

If SDSC is not set, get the descriptor. If it is set, do not overwrite the descriptor.

Pull 2018-02-09T11-21 Recent NVIDIA Changes

Also add oop719 f90_correct test to Flang.

Pull 2018-02-09T14-19 Recent NVIDIA Changes

Add metadata to the loads and stores to tell LLVM that it is ok to vectorize the loop and ignore dependence analysis. (See !llvm.mem.parallel_loop_access for more info.) This particular annotation will be disabled if we detect that llvect has already vectorized the loop body.

## ... Target knl-64 instead of: ## ... Target knightslanding-64 "Unhid" the target processor "-tp knl", so now it is included in the list of supported target processors that is generated.

If pod is true then pod2 won't be set. But it is read when pos != pass_pos. Initialize pod2 so that it is well-defined when it is read.

Consider invocations like foo() Character (max_charlen), Intent (Out) :: token(max_data_vals) token(1:n) = adjustl(token(1:n) bar() Character (*), Intent (In) :: buffin buffall = adjustl(buffin) The former gets converted to a subroutine call form, whereas the latter remains as a func, the hash table would have an entry for adjustl as void type, whereas the second call would be expecting a return type of INT (actual result is passed as hidden argument). This conflict type leads to an error. The conversion to a subroutine call was enabled to prevent memory allocation for a temp array used to store the return value of the intrinsic, but the temp would never be used. This change uses a single temp (in place of an array) to store the return value of the call, thereby eliminating the need to convert it to a subroutine call. This change also fixes similar handling of 'adjustr' and 'trim'.

_mkshft sets the return value of type DT_INT which is incorrect. Use mk_prototype to correctly set the prototype of the function.

Fix typo, too.

Pull 2018-02-13T07-42 Recent NVIDIA Changes

gklimowicz and others added 16 commits February 8, 2018 12:05

Merge pull request #393 from ThePortlandGroup/nv_stage

51497d9

Pull 2018-02-08T12-20 Recent NVIDIA Changes

Fix non-standard intrinsic IDATE asserts at compile-time

e2d5bb5

Disable 8-byte argument modifications in the runtime routine table for nonstandard intrinsics idate and jdate.

Add test pp58 to f90_correct

d0d6570

Test that a pointer to a contiguous array section is passed correctly for an assumed shape dummy argument.

Fix to support fox_2 application

dae9953

If SDSC is not set, get the descriptor. If it is set, do not overwrite the descriptor.

Merge pull request #395 from ThePortlandGroup/nv_stage

f4b69c1

Pull 2018-02-09T11-21 Recent NVIDIA Changes

Remove incorrect code in handling assignment of allocatables members

56770b8

Also add oop719 f90_correct test to Flang.

Merge pull request #396 from ThePortlandGroup/nv_stage

c8988c1

Pull 2018-02-09T14-19 Recent NVIDIA Changes

Change references to "knightslanding" to shorter "knl"

6854244

## ... Target knl-64 instead of: ## ... Target knightslanding-64 "Unhid" the target processor "-tp knl", so now it is included in the list of supported target processors that is generated.

Fix use of uninitialized variable pod2

3b7fc76

If pod is true then pod2 won't be set. But it is read when pos != pass_pos. Initialize pod2 so that it is well-defined when it is read.

Fix return type of "ftn_i_kishft"

f38bbdf

_mkshft sets the return value of type DT_INT which is incorrect. Use mk_prototype to correctly set the prototype of the function.

Fix 32-bit implementation of LEADZ intrinsic on AMD targets

6153c56

Cleanup: Remove unused functions and fields in LLVM bridge

63b004a

Fix typo, too.

Merge pull request #400 from ThePortlandGroup/nv_stage

eb3fa40

Pull 2018-02-13T07-42 Recent NVIDIA Changes

ghost merged commit e160efe into isuruf:windows Feb 17, 2018

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull recent NVIDIA changes #127

Pull recent NVIDIA changes #127

ghost commented Feb 16, 2018

Pull recent NVIDIA changes #127

Pull recent NVIDIA changes #127

Conversation

ghost commented Feb 16, 2018