which blas #85

ngam · 2022-01-24T16:25:27Z

I'm going to refer to the comment made #83 (comment)

I would suggest, that for this kind of optimization, that you demonstrate some real gains to be had by moving to accelerate. Maybe you can provide a benchmark showing the improvement? It is pretty easy to build things yourself and upload them to your channel. I strongly suggest that instead of waiting too long on the community for your own work anyway.

once we've established the gains, we can work on integrating it here

Originally posted by @hmaarrfk in #84 (comment)

ngam · 2022-01-24T16:25:32Z

https://pythonawesome.com/benchmarking-openblas-on-the-apple-m1/

hmaarrfk · 2022-01-24T16:49:18Z

Thank you for sharing. These are interesting. It is unclear to me what is "accelerate" in the benchmarks. Maybe you can elaborate on what they wrote.

Finally, isuruf recommended:

Please don't do this. Accelerate is a very experimental backend. Try instead to building with netlib variant which will give openblas by default and then users can use accelerate if they want to experiment.

ngam · 2022-01-24T16:54:26Z

Veclib is from Accelerate while vortex and zen are from OpenBLAS. However, this is besides the point for now --- we would need to show something about PyTorch in particular before moving forward. This is just additional info as I am trying to better understand this :)

Again, my inspiration was from my work on mxnet where they seemed to have a clear hierarchy. I would like to understand if that's just for packaging purposes (avoiding shipping yet another BLAS) or for actual performance reasons. One thing is really clear though: MKL is really the choice wherever possible, so you're making the perfect call here with forcing everything to use MKL (obviously MKL isn't available on osx-arm64 and that's the only narrow case we would be changing)

ngam · 2022-01-24T16:56:17Z

Just to expand on @isuruf's comment. I believe I understand where he comes from on two fronts:

There is the idea of replacing BLAS by the user like I showed in issue pytorch bombs if "libblas=*=*accelerate" #82 (that's definitely experimental; he only added it recently; I will link the issue/discussion on that shortly: Option to install numpy built with Apple's Accelerate BLAS implementation numpy-feedstock#253)
The underlying issues with Accelerate itself some of which are documented in this comment: fixing blas matrix? #83 (comment)

ngam · 2022-01-24T17:00:40Z

Regarding the first point above, this will require a lot more work and may not be possible after all --- I think pytorch requires setting a BLAS clearly and maybe it would not work if you can just switch back and forth. Again, see #82. So our option would likely be to build with explicit CMAKE flag settings, I believe.

Regarding the second point, that's an important and valid reason to avoid Accelerate altogether.

hmaarrfk · 2022-01-24T17:16:52Z

I'm not a blas expert, but compiling with netlib allows for the switch. If you compile with the underlying library straight, you cannot switch. Typically you need a bit of trickery.

For what its worth: pytorch has been open to PRs that allow customization like this.

ngam · 2022-01-24T17:31:35Z

For what its worth: pytorch has been open to PRs that allow customization like this.

Good call, I will look around and open an issue upstream to ask about

ngam · 2022-01-25T20:57:21Z

closing this as we have two other issues addressing the same exact issue

ngam closed this as completed Jan 25, 2022

jkleckner mentioned this issue Jul 28, 2023

implement accelerate for osx-arm64 #88

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

which blas #85

which blas #85

ngam commented Jan 24, 2022

ngam commented Jan 24, 2022

hmaarrfk commented Jan 24, 2022

ngam commented Jan 24, 2022

ngam commented Jan 24, 2022 •

edited

Loading

ngam commented Jan 24, 2022

hmaarrfk commented Jan 24, 2022

ngam commented Jan 24, 2022

ngam commented Jan 25, 2022

which blas #85

which blas #85

Comments

ngam commented Jan 24, 2022

ngam commented Jan 24, 2022

hmaarrfk commented Jan 24, 2022

ngam commented Jan 24, 2022

ngam commented Jan 24, 2022 • edited Loading

ngam commented Jan 24, 2022

hmaarrfk commented Jan 24, 2022

ngam commented Jan 24, 2022

ngam commented Jan 25, 2022

ngam commented Jan 24, 2022 •

edited

Loading