Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable some NEON optimizations on ARM32 that we only had on ARM64 before #18450

Merged
merged 2 commits into from
Nov 27, 2023

Conversation

hrydgard
Copy link
Owner

Most of these just worked or were easy to translate, so might as well turn them on - will usually be a minor win.

Additionally, turns out that vld3q_f32 is perfect for implementing ConvertMatrix4x3To3x4Transposed .

@hrydgard hrydgard added the Code Cleanup Cleanup to make future work easier. Needs to be done sometimes. label Nov 27, 2023
@hrydgard hrydgard added this to the v1.17.0 milestone Nov 27, 2023
@hrydgard hrydgard merged commit 8ad0ef6 into master Nov 27, 2023
18 checks passed
@hrydgard hrydgard deleted the neon-arm32 branch November 27, 2023 23:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Code Cleanup Cleanup to make future work easier. Needs to be done sometimes.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant