-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
A faster copyto_unaliased! #41434
Closed
Closed
A faster copyto_unaliased! #41434
Commits on Jul 1, 2021
-
Exclude the slow branch in Broadcast's copyto!
julia> a = randn(50,50); b = randn(50,50); # 2D case # before julia> @Btime $a[1:end,1:end] .= $b; 4.900 μs (0 allocations: 0 bytes) julia> @Btime $a[:,:] .= $b; 721.642 ns (0 allocations: 0 bytes) # after julia> @Btime $a[1:end,1:end] .= $b; 355.238 ns (0 allocations: 0 bytes) julia> @Btime $a[:,:] .= $b; 351.643 ns (0 allocations: 0 bytes) julia> a = randn(50*50); b = randn(50*50); #1D case # before julia> @Btime $a .= $b; 406.566 ns (0 allocations: 0 bytes) # after julia> @Btime $a .= $b; 270.607 ns (0 allocations: 0 bytes)
Configuration menu - View commit details
-
Copy full SHA for 3fc1a37 - Browse repository at this point
Copy the full SHA 3fc1a37View commit details
Commits on Jul 2, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 84db092 - Browse repository at this point
Copy the full SHA 84db092View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0d1439e - Browse repository at this point
Copy the full SHA 0d1439eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1d4dfee - Browse repository at this point
Copy the full SHA 1d4dfeeView commit details -
Configuration menu - View commit details
-
Copy full SHA for d533a6b - Browse repository at this point
Copy the full SHA d533a6bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9de88bb - Browse repository at this point
Copy the full SHA 9de88bbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 17fe447 - Browse repository at this point
Copy the full SHA 17fe447View commit details
Commits on Jul 5, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 691fb71 - Browse repository at this point
Copy the full SHA 691fb71View commit details -
Put SimdLoop in advance to avoid bootstrapping (I hope this work)
Configuration menu - View commit details
-
Copy full SHA for 43d3581 - Browse repository at this point
Copy the full SHA 43d3581View commit details -
Merge pull request #2 from N5N3/revert-1-patch__
Focus on copyto_unalias!, put SimdLoop in advance.
Configuration menu - View commit details
-
Copy full SHA for 742c97a - Browse repository at this point
Copy the full SHA 742c97aView commit details -
Configuration menu - View commit details
-
Copy full SHA for acbb8f4 - Browse repository at this point
Copy the full SHA acbb8f4View commit details -
Configuration menu - View commit details
-
Copy full SHA for e52eb1e - Browse repository at this point
Copy the full SHA e52eb1eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 06d5fb0 - Browse repository at this point
Copy the full SHA 06d5fb0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1343477 - Browse repository at this point
Copy the full SHA 1343477View commit details -
Configuration menu - View commit details
-
Copy full SHA for bbc4a2f - Browse repository at this point
Copy the full SHA bbc4a2fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 17bbc34 - Browse repository at this point
Copy the full SHA 17bbc34View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1838f88 - Browse repository at this point
Copy the full SHA 1838f88View commit details -
Configuration menu - View commit details
-
Copy full SHA for 996e272 - Browse repository at this point
Copy the full SHA 996e272View commit details -
Configuration menu - View commit details
-
Copy full SHA for b69943a - Browse repository at this point
Copy the full SHA b69943aView commit details -
Configuration menu - View commit details
-
Copy full SHA for bba1fd1 - Browse repository at this point
Copy the full SHA bba1fd1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 15f0ea5 - Browse repository at this point
Copy the full SHA 15f0ea5View commit details -
add a branch to speed up Line To Cart which have the same length
Configuration menu - View commit details
-
Copy full SHA for bf98f8c - Browse repository at this point
Copy the full SHA bf98f8cView commit details -
Use manually expanded SIMD kernal instead of "illegal" `@simd`. Linear to Cartesian copy is even faster when length(dest) > length(src)
Configuration menu - View commit details
-
Copy full SHA for 6aa61b9 - Browse repository at this point
Copy the full SHA 6aa61b9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0e237cf - Browse repository at this point
Copy the full SHA 0e237cfView commit details
Commits on Jul 6, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 18456b1 - Browse repository at this point
Copy the full SHA 18456b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0c5611b - Browse repository at this point
Copy the full SHA 0c5611bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 781f176 - Browse repository at this point
Copy the full SHA 781f176View commit details -
Configuration menu - View commit details
-
Copy full SHA for 64efb57 - Browse repository at this point
Copy the full SHA 64efb57View commit details -
Configuration menu - View commit details
-
Copy full SHA for a6c23f5 - Browse repository at this point
Copy the full SHA a6c23f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 72e6fbd - Browse repository at this point
Copy the full SHA 72e6fbdView commit details -
Configuration menu - View commit details
-
Copy full SHA for affccb8 - Browse repository at this point
Copy the full SHA affccb8View commit details -
Configuration menu - View commit details
-
Copy full SHA for ec0a0ce - Browse repository at this point
Copy the full SHA ec0a0ceView commit details
Commits on Jul 7, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 0255a6e - Browse repository at this point
Copy the full SHA 0255a6eView commit details -
fix for 0d Cartesian AbstractArray. This version should be fast enough if the size of Cartesian array's first dim is larger than 16 (eltype Float64).
Configuration menu - View commit details
-
Copy full SHA for e1db2ee - Browse repository at this point
Copy the full SHA e1db2eeView commit details -
Merge pull request #6 from N5N3/patch2
fix for 0d Cartesian AbstractArray. This version should be fast enough if the size of Cartesian array's first dim is larger than 16 (eltype Float64).
Configuration menu - View commit details
-
Copy full SHA for c8fbc84 - Browse repository at this point
Copy the full SHA c8fbc84View commit details -
Configuration menu - View commit details
-
Copy full SHA for 911f0dc - Browse repository at this point
Copy the full SHA 911f0dcView commit details -
Configuration menu - View commit details
-
Copy full SHA for c303830 - Browse repository at this point
Copy the full SHA c303830View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4273b9c - Browse repository at this point
Copy the full SHA 4273b9cView commit details
Commits on Jul 9, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 4515150 - Browse repository at this point
Copy the full SHA 4515150View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d4268e - Browse repository at this point
Copy the full SHA 5d4268eView commit details -
Fix for other IndexStyle. Only use manually expanded version when the size of 1st dim >=16
Configuration menu - View commit details
-
Copy full SHA for a599564 - Browse repository at this point
Copy the full SHA a599564View commit details -
Configuration menu - View commit details
-
Copy full SHA for 36adcbd - Browse repository at this point
Copy the full SHA 36adcbdView commit details
Commits on Jul 10, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 553c2f7 - Browse repository at this point
Copy the full SHA 553c2f7View commit details -
Configuration menu - View commit details
-
Copy full SHA for aa5ba0e - Browse repository at this point
Copy the full SHA aa5ba0eView commit details -
Configuration menu - View commit details
-
Copy full SHA for dd45e9a - Browse repository at this point
Copy the full SHA dd45e9aView commit details -
Configuration menu - View commit details
-
Copy full SHA for f5086dc - Browse repository at this point
Copy the full SHA f5086dcView commit details -
Merge pull request #8 from N5N3/patch2
fix white space, typo error; add test;
Configuration menu - View commit details
-
Copy full SHA for c3fd814 - Browse repository at this point
Copy the full SHA c3fd814View commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.