Replies: 31 comments 88 replies
-
Ah this was a newer method, not available in the release. |
Beta Was this translation helpful? Give feedback.
-
Your results on a cold run: time ./bin/abouchez /tmp/measurements-1_000_000_000.txt -t 30
{...}
real 0m9.134s
user 1m17.539s
sys 0m14.066s |
Beta Was this translation helpful? Give feedback.
-
Running the full automation:
|
Beta Was this translation helpful? Give feedback.
-
Nice! I am a bit worried about the Please try with 16 and 24 threads too. |
Beta Was this translation helpful? Give feedback.
-
Did you try with the Just curious, if thread affinity helps here. |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), In the spirit of cross platform, at least, would it be too much to ask if you can do the If it's too much I guess we can skip that. Would really appreciate it if you could make the effort. Cheers, |
Beta Was this translation helpful? Give feedback.
-
There are already asm version of To learn about cross-platform and cross-compiler, look at the mORMot source code, it is a much better place. |
Beta Was this translation helpful? Give feedback.
-
I tried to clean the comments about the asm, and updated the README thanks to your feedback. |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), Does this matter?
If so, where should I start investigating? Cheers, |
Beta Was this translation helpful? Give feedback.
-
This is an Lazarus parsing bug only. So, as I wrote above, I think this is a false positive. (to be honest, I never use lazbuild, I do prefer a manual .sh file directly calling fpc with the proper switches) |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), Latest PR. 16 Threads
24 threads
32 Threads
Cheers, |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), Was trying to run your entry with the 400 stations file( 8 threads ), thanks to @paweld, and I got this: $ ./bin/abouchez /tmp/measurements-400-1_000_000_000.txt -t 8
Fatal exception EAccessViolation raised with message
Access violation
Program will now abort Cheers, |
Beta Was this translation helpful? Give feedback.
-
Latest patch results. 3rd run with $ time ./bin/abouchez /tmp/measurements-400-1_000_000_000.txt -t 8
{...}
real 0m2.746s
user 0m16.317s
sys 0m1.391s 4th or 5th run ( I forget ) for the 400/8 with affinity: time ./bin/abouchez /tmp/measurements-400-1_000_000_000.txt -a -t 8
{...}
real 0m2.698s
user 0m16.381s
sys 0m1.315s and $ ./run_all.sh abouchez
******** Run ********
===== Arnaud Bouchez ======
-- SSD --
Benchmark 1: abouchez
Time (mean ± σ): 1.732 s ± 0.031 s [User: 33.319 s, System: 1.956 s]
Range (min … max): 1.700 s … 1.805 s 10 runs
=========== |
Beta Was this translation helpful? Give feedback.
-
I have added #56 With updated README file. Thanks a lot! |
Beta Was this translation helpful? Give feedback.
-
About the new
And see the results. |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), Hope you don't mind me doing a bit of formatting clean-up on your If you do, please chastise me a ton and I'll, maybe, promise not to do it again 😉 🫂 Cheers, |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), Your latest
Cheers, |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), I think I messed up!! And I must give you my most profound apologies!!! I think I forgot to compile you last PR or something on those lines, cuz now that I did and performed the runs, you're back at 1.9s and not 1.7s. I think this means that the Again, really sorry to have messed this up!! The weekly Saturday results are up. Will update them when you make a new PR reverting what you did lastly. Cheers, |
Beta Was this translation helpful? Give feedback.
-
It is weird that the last run did return numbers higher that what you found here. Could you try with |
Beta Was this translation helpful? Give feedback.
-
Another question: why some results are using lazarus-3.99, fpc-3.3.1 and some lazarus-3.0, fpc-3.2.2 ? My guess is that fpc-3.3.1 should generate better code, so it seems a bit unfair. |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), Latest $ ./run_all.sh abouchez
******** Run ********
===== Arnaud Bouchez ======
-- SSD --
Benchmark 1: abouchez
Time (mean ± σ): 1.766 s ± 0.031 s [User: 33.130 s, System: 2.374 s]
Range (min … max): 1.737 s … 1.849 s 10 runs |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse) I just got a feed back from Craig Chapman, on his 32 cores, 64 threads and 128GiB of
That blows the JavaBros completely out of the water !!! Cheers, |
Beta Was this translation helpful? Give feedback.
-
Very interesting video. |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), Last $ ./run_all.sh abouchez
******** Run ********
===== Arnaud Bouchez ======
-- SSD --
Benchmark 1: abouchez
Time (mean ± σ): 1.644 s ± 0.009 s [User: 30.553 s, System: 1.932 s]
Range (min … max): 1.631 s … 1.656 s 10 runs |
Beta Was this translation helpful? Give feedback.
-
I have committed yesterday some new versions, done during the weekend. On my computer they are noticeably faster. Please try with and without To be accurate, here are the actual numbers on my local machine, with the latest current version of our code:
Which are pretty good numbers, especially thanks to our If we force the full station name lookup (with the
The "pure pascal" version of the code is not so far away:
Interesting reading: @paweld @gcarreno Last but not least... |
Beta Was this translation helpful? Give feedback.
-
Hello Arnaud @synopse , If you have the time (and aside the great links you've posted in your readme), any recommended readings for such bit-manipulations and Delphi asm for beginners? |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), I need to run these:
On a quiet system. But I promise to give you proper results before next Saturday. Cheers, |
Beta Was this translation helpful? Give feedback.
-
Hello Arnaud, |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse), On a not so quiet system.
$ hyperfine -w 1 -r 5 -N -n 'abouchez-full-a' --export-json 'results/abouchez-full-a.json' './bin/abouchez /tmp/measurements-1_000_000_000.txt -a'
Benchmark 1: abouchez-full-a
Time (mean ± σ): 1.358 s ± 0.026 s [User: 18.044 s, System: 2.442 s]
Range (min … max): 1.328 s … 1.399 s 5 runs |
Beta Was this translation helpful? Give feedback.
-
Hello Arnaud (@synopse), |
Beta Was this translation helpful? Give feedback.
-
Hey Arnaud(@synopse),
I'm getting an error:
I installed the
tar.gz
from Release v2.2.How can I solve this?
Cheers,
Gus
Beta Was this translation helpful? Give feedback.
All reactions