parallel chunking routine and Python 2GB bottleneck for large statsmodels OLS #154

turbach · 2019-08-02T15:16:13Z

Appears to be a monkey jar ... the chunker ships jobs out to the pool that fit through the 2GB bottleneck then statsmodel OLS fitting inflates the size and the returns are too big to make it back through the bottleneck.

Possible workaround: use tester fit to estimate OLS return size and chunk accordingly for 2GB, fall back to serial.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

parallel chunking routine and Python 2GB bottleneck for large statsmodels OLS #154

parallel chunking routine and Python 2GB bottleneck for large statsmodels OLS #154

turbach commented Aug 2, 2019

parallel chunking routine and Python 2GB bottleneck for large statsmodels OLS #154

parallel chunking routine and Python 2GB bottleneck for large statsmodels OLS #154

Comments

turbach commented Aug 2, 2019