codegen optimizations for unions #21279

vtjnash · 2017-04-04T22:46:11Z

This seems to generate code that llvm is better at optimizing. Will definitely need to check nanosoldier though to see if this seems to trigger any regressions elsewhere.

Keno · 2017-04-04T22:50:57Z

Instcombine is quite expensive. Can you get away with instsimplify for your purpose?

vtjnash · 2017-04-05T01:15:06Z

Ah, I was assuming it was cheap since we call it very often. I don't actually need it.

ararslan · 2017-04-05T05:27:35Z

Looks like this broke the cfunction round-trip test on x86-64

SROA likes this form better Also, since many of these loop variables are loop-dependent, it helps to run the loop structure analysis passes twice

vtjnash · 2017-04-05T06:38:43Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2017-04-05T09:38:47Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

martinholters · 2017-04-05T10:59:04Z

Are the "simd" regressions real? I don't remember those as particularly noisy.

ararslan · 2017-04-05T16:44:31Z

We can always run again to see if the results are the same.

@nanosoldier runbenchmarks(ALL, vs=":master")

vtjnash · 2017-04-05T17:12:13Z

They seem to be "quasi-real", but the reason is hilarious. With the extra passes, LLVM is able to notice that the loop is computing the number 0 and can elid all of the SIMD computational work and replace it with a simple memset. So we end up profiling the quality of the system memset function 😆

ararslan · 2017-04-05T17:20:09Z

That is pretty amusing. Why would that cause a regression though? The system's memset could actually be slower than the full SIMD computations?

vtjnash · 2017-04-05T17:40:46Z

The system's memset could actually be slower than the full SIMD computations?

Yes. It is entirely reliant on the glibc version supporting the max vector width for the machine. The bottleneck is waiting for memory, making the SIMD calculations essentially "free".

nanosoldier · 2017-04-05T19:45:13Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

vtjnash added compiler:codegen Generation of LLVM IR and native code performance Must go faster labels Apr 4, 2017

vtjnash requested a review from Keno April 4, 2017 22:46

vtjnash force-pushed the jn/codegen-opt-unions branch from 900c69b to 42bcd70 Compare April 5, 2017 01:33

vtjnash added 2 commits April 5, 2017 02:28

help union-alloca variables emit better native code

9d641e4

SROA likes this form better Also, since many of these loop variables are loop-dependent, it helps to run the loop structure analysis passes twice

disable llvm output from PR #21276 test

58dd828

vtjnash force-pushed the jn/codegen-opt-unions branch from 42bcd70 to 58dd828 Compare April 5, 2017 06:31

ararslan mentioned this pull request Apr 5, 2017

codegen emit constant data as llvm constants #21277

Merged

vtjnash merged commit 11682d8 into master Apr 6, 2017

vtjnash deleted the jn/codegen-opt-unions branch April 6, 2017 14:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

codegen optimizations for unions #21279

codegen optimizations for unions #21279

vtjnash commented Apr 4, 2017

Keno commented Apr 4, 2017

vtjnash commented Apr 5, 2017

ararslan commented Apr 5, 2017

vtjnash commented Apr 5, 2017

nanosoldier commented Apr 5, 2017

martinholters commented Apr 5, 2017

ararslan commented Apr 5, 2017

vtjnash commented Apr 5, 2017

ararslan commented Apr 5, 2017

vtjnash commented Apr 5, 2017

nanosoldier commented Apr 5, 2017

codegen optimizations for unions #21279

codegen optimizations for unions #21279

Conversation

vtjnash commented Apr 4, 2017

Keno commented Apr 4, 2017

vtjnash commented Apr 5, 2017

ararslan commented Apr 5, 2017

vtjnash commented Apr 5, 2017

nanosoldier commented Apr 5, 2017

martinholters commented Apr 5, 2017

ararslan commented Apr 5, 2017

vtjnash commented Apr 5, 2017

ararslan commented Apr 5, 2017

vtjnash commented Apr 5, 2017

nanosoldier commented Apr 5, 2017