Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimizations for broadcast #299

Merged
merged 5 commits into from
Jul 17, 2020
Merged

Optimizations for broadcast #299

merged 5 commits into from
Jul 17, 2020

Conversation

maleadt
Copy link
Member

@maleadt maleadt commented Jul 17, 2020

Fixes #298

@maleadt maleadt added cuda kernels Stuff about writing CUDA kernels. performance How fast can we go? labels Jul 17, 2020
@codecov
Copy link

codecov bot commented Jul 17, 2020

Codecov Report

Merging #299 into master will decrease coverage by 0.00%.
The diff coverage is 91.66%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #299      +/-   ##
==========================================
- Coverage   77.71%   77.70%   -0.01%     
==========================================
  Files         155      155              
  Lines        8874     8873       -1     
==========================================
- Hits         6896     6895       -1     
  Misses       1978     1978              
Impacted Files Coverage Δ
src/gpuarrays.jl 48.00% <83.33%> (-2.00%) ⬇️
lib/cudadrv/execution.jl 100.00% <100.00%> (ø)
lib/cudadrv/occupancy.jl 76.00% <100.00%> (ø)
src/compiler/execution.jl 91.59% <100.00%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8ce07c8...81290a0. Read the comment docs.

@maleadt maleadt merged commit f5d880f into master Jul 17, 2020
@maleadt maleadt deleted the tb/grid_stride branch July 17, 2020 10:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cuda kernels Stuff about writing CUDA kernels. performance How fast can we go?
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Performance: bias add
1 participant