Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Velocity SIMD CPU Runtime #891

Closed
wants to merge 66 commits into from
Closed

Velocity SIMD CPU Runtime #891

wants to merge 66 commits into from

Conversation

m4rs-mt
Copy link
Owner

@m4rs-mt m4rs-mt commented Nov 24, 2022

This PR adds the announced SIMD-based CPU runtime fully implemented in managed code. The new Velocity accelerator supports most GPU kernels (except those using dynamic shared memory) and is able to utilize SIMD hardware acceleration on modern CPUs, allowing you to run your ILGPU kernels efficiently on then CPU by leveraging the implemented automatic vectorization engine.

It supports the following hardware configurations:

  • 128bit-based X64 SSE and ARM64 Neon instructions (also supports M1 Macs - Mac M Series Support #769)
  • 256bit-based X64 AVX instructions
  • 512bit-based X64 AVX2 instructions (limited feature set; some functions will fallback to 256bit registers)

Note that this PR is a preview of the current development state that will be extended with additional features in the near future.

@m4rs-mt m4rs-mt added the feature A new feature (or feature request) label Nov 24, 2022
@m4rs-mt m4rs-mt modified the milestones: v2.0, v1.4 Nov 24, 2022
@m4rs-mt m4rs-mt force-pushed the velocity branch 2 times, most recently from 35e7f72 to 7b0d4dd Compare December 3, 2022 03:05
@m4rs-mt m4rs-mt force-pushed the velocity branch 6 times, most recently from 872582e to 353fbfd Compare January 26, 2023 21:24
@m4rs-mt m4rs-mt force-pushed the velocity branch 2 times, most recently from 298e489 to 699fb91 Compare March 2, 2023 19:30
@m4rs-mt m4rs-mt modified the milestones: v1.4, v1.5 Mar 16, 2023
@m4rs-mt m4rs-mt modified the milestones: v1.5, v2.0 Jul 21, 2023
@m4rs-mt m4rs-mt marked this pull request as draft August 23, 2023 10:13
@m4rs-mt
Copy link
Owner Author

m4rs-mt commented Aug 24, 2023

Closing this PR in favor of a newly designed Velocity backend.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature A new feature (or feature request)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant