Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA only starts single thread? #165

Open
grahamreeds opened this issue May 16, 2022 · 6 comments
Open

CUDA only starts single thread? #165

grahamreeds opened this issue May 16, 2022 · 6 comments

Comments

@grahamreeds
Copy link

Maybe I am reading this wrong but when I start xmrig with just CUDA it only starts a single thread?

D:\github.com\xmrig\build\Release>xmrig
 * ABOUT        XMRig/6.17.0 MSVC/2019
 * LIBS         libuv/1.38.0 OpenSSL/1.1.1g hwloc/2.7.0
 * HUGE PAGES   permission granted
 * 1GB PAGES    unavailable
 * CPU          Intel(R) Xeon(R) CPU E3-1505M v5 @ 2.80GHz (1) 64-bit AES VM
                L2:1.0 MB L3:8.0 MB 4C/8T NUMA:1
 * MEMORY       7.1/47.9 GB (15%)
                Top-Slot 1(left): 8 GB DDR4 @ 2133 MHz HMA81GS6CJR8N-VK
                Top-Slot 2(right): 8 GB DDR4 @ 2133 MHz HMA81GS6DJR8N-XN
                Bottom-Slot 1(left): 16 GB DDR4 @ 2133 MHz M474A2K43BB1-CRC
                Bottom-Slot 2(right): 16 GB DDR4 @ 2133 MHz M474A2K43BB1-CRC
 * MOTHERBOARD  HP - HP ZBook 15 G3
 * DONATE       1%
 * ASSEMBLY     auto:intel
 * POOL #1      gulf.moneroocean.stream:10128 algo auto
 * POOL #2      pool.minexmr.com:4444 algo auto
 * COMMANDS     hashrate, pause, resume, results, connection
 * CUDA         11.0/11.6/6.17.0
 * CUDA GPU     #0 01:00.0 Quadro M2000M 1137/2505 MHz smx:5 arch:50 mem:3415/4095 MB
[2022-05-16 15:58:31.578]  net      use pool gulf.moneroocean.stream:10128  199.247.0.216
[2022-05-16 15:58:31.578]  net      new job from gulf.moneroocean.stream:10128 diff 11369 algo rx/0 height 2624793 (10 tx)
[2022-05-16 15:58:31.580]  cpu      use argon2 implementation AVX2
[2022-05-16 15:58:31.582]  randomx  init dataset algo rx/0 (8 threads) seed 4dea06a49b31df67...
[2022-05-16 15:58:31.583]  randomx  allocated 2336 MB (2080+256) huge pages 100% 1168/1168 +JIT (1 ms)
[2022-05-16 15:58:33.271]  net      new job from gulf.moneroocean.stream:10128 diff 11369 algo rx/0 height 2624794 (9 tx)
[2022-05-16 15:58:36.267]  randomx  dataset ready (4683 ms)
[2022-05-16 15:58:36.268]  nvidia   use profile  rx  (1 thread) scratchpad 2048 KB
|  # | GPU |  BUS ID | INTENSITY | THREADS | BLOCKS | BF |  BS | MEMORY | NAME
|  0 |   0 | 01:00.0 |       320 |      32 |     10 |  8 |  25 |    640 | Quadro M2000M
[2022-05-16 15:58:36.459]  nvidia   READY threads 1/1 (186 ms)
[2022-05-16 15:58:49.423]  net      new job from gulf.moneroocean.stream:10128 diff 11369 algo rx/0 height 2624795 (9 tx)
[2022-05-16 15:58:51.572]  miner    speed 10s/60s/15m n/a n/a n/a H/s max n/a H/s
[2022-05-16 15:59:06.886]  miner    speed 10s/60s/15m 86.09 n/a n/a H/s max 86.42 H/s
[2022-05-16 15:59:22.195]  miner    speed 10s/60s/15m 86.22 n/a n/a H/s max 86.44 H/s
[2022-05-16 15:59:37.508]  miner    speed 10s/60s/15m 86.50 n/a n/a H/s max 86.62 H/s
[2022-05-16 15:59:52.824]  miner    speed 10s/60s/15m 86.49 86.39 n/a H/s max 86.80 H/s
[2022-05-16 16:00:08.134]  miner    speed 10s/60s/15m 86.44 86.44 n/a H/s max 86.80 H/s
[2022-05-16 16:00:13.460]  nvidia   accepted (1/0) diff 11369 (63 ms)

Am I reading this wrong?

@Spudz76
Copy link
Contributor

Spudz76 commented May 18, 2022

Yes.

It only needs to start one CPU thread to speak to the GPU, which is running 32 actual threads (line below).

@grahamreeds
Copy link
Author

Okay.

I wouldn't of expected CUDA performance to be so poor, even on my laptop. It is running about as fast as a raspberry pi!

@Spudz76
Copy link
Contributor

Spudz76 commented May 18, 2022

RandomX is intentionally nasty for GPUs. You normally would run two copies of xmrig (separate folders) and different configs. One with CPU only, GPUs off; and the other with CPU off and GPU(s) only. Because of how intensely different each sort of algorithm work on different devices. And then if you mine into MoneroOcean pool you can process other coins jobs and still get paid XMR. Even if you only point the GPU there, it will score a lot more using GPU-friendly algorithms. Although the M2000M is really old so probably not much more than the CPU scores at rx/0. But much better than 86H/s effective payout handling probably cn-gpu or something (Maxwell cores weren't great, but I run a bunch of even worse Fermi's).

@grahamreeds
Copy link
Author

grahamreeds commented May 19, 2022

So what algo to mine? I tried various algos but they all default to rx/0:

[2022-05-19 13:06:34.885]  net      no active pools, stop mining
 * POOL #1      gulf.moneroocean.stream:10128 algo rx/arq
 * POOL #2      pool.minexmr.com:4444 algo auto
[2022-05-19 13:06:34.951]  net      use pool gulf.moneroocean.stream:10128  199.247.0.216
[2022-05-19 13:06:34.952]  net      new job from gulf.moneroocean.stream:10128 diff 128001 algo rx/0 height 2626870 (38 tx)
[2022-05-19 13:06:45.431]  miner    speed 10s/60s/15m 62.81 n/a n/a H/s max 66.36 H/s
[2022-05-19 13:06:50.419]  config   "D:\github.com\xmrig\build\Release\config.json" was changed, reloading configuration
[2022-05-19 13:06:50.420]  net      no active pools, stop mining
 * POOL #1      gulf.moneroocean.stream:10128 algo rx/wow
 * POOL #2      pool.minexmr.com:4444 algo auto
[2022-05-19 13:06:50.484]  net      use pool gulf.moneroocean.stream:10128  199.247.0.216
[2022-05-19 13:06:50.485]  net      new job from gulf.moneroocean.stream:10128 diff 128001 algo rx/0 height 2626870 (38 tx)
[2022-05-19 13:07:00.763]  miner    speed 10s/60s/15m 64.25 n/a n/a H/s max 66.36 H/s
[2022-05-19 13:07:02.536]  config   "D:\github.com\xmrig\build\Release\config.json" was changed, reloading configuration
[2022-05-19 13:07:02.537]  net      no active pools, stop mining
 * POOL #1      gulf.moneroocean.stream:10128 algo cn/0
 * POOL #2      pool.minexmr.com:4444 algo auto
[2022-05-19 13:07:02.605]  net      use pool gulf.moneroocean.stream:10128  199.247.0.216
[2022-05-19 13:07:02.606]  net      new job from gulf.moneroocean.stream:10128 diff 128001 algo rx/0 height 2626870 (38 tx)

@Spudz76
Copy link
Contributor

Spudz76 commented May 19, 2022

MoneroOcean's extra features require their fork of everything.
MO-xmrig
MO-xmrig-cuda

Then it will benchmark all algorithms at the first run and algo-switch from there on.

@grahamreeds
Copy link
Author

I guess it isn't meant to be - the benchmark fails when it gets to kawpow.

I have logged a ticket: MoneroOcean#7

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants