Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NVidia Grace L3 cache size seems wrong #689

Open
JimCownie opened this issue Sep 11, 2024 · 2 comments
Open

NVidia Grace L3 cache size seems wrong #689

JimCownie opened this issue Sep 11, 2024 · 2 comments

Comments

@JimCownie
Copy link

What version of hwloc are you using?

2.10.0
lstopo says
% lstopo --version
lstopo 2.9.0

Which operating system and hardware are you running on?

% uname -a
Linux nid001040 5.14.21-150500.55.31_13.0.53-cray_shasta_c_64k #1 SMP Mon Dec 4 22:56:47 UTC 2023 (03d3f83) aarch64 aarch64 aarch64 GNU/Linux

% lstopo -
Machine (477GB total)
Package L#0
NUMANode L#0 (P#0 117GB)
L3 L#0 (114MB)
L2 L#0 (1024KB) + L1d L#0 (64KB) + L1i L#0 (64KB) + Core L#0 + PU L#0 (P#0)
L2 L#1 (1024KB) + L1d L#1 (64KB) + L1i L#1 (64KB) + Core L#1 + PU L#1 (P#1)
L2 L#2 (1024KB) + L1d L#2 (64KB) + L1i L#2 (64KB) + Core L#2 + PU L#2 (P#2)
L2 L#3 (1024KB) + L1d L#3 (64KB) + L1i L#3 (64KB) + Core L#3 + PU L#3 (P#3)
L2 L#4 (1024KB) + L1d L#4 (64KB) + L1i L#4 (64KB) + Core L#4 + PU L#4 (P#4)
L2 L#5 (1024KB) + L1d L#5 (64KB) + L1i L#5 (64KB) + Core L#5 + PU L#5 (P#5)
L2 L#6 (1024KB) + L1d L#6 (64KB) + L1i L#6 (64KB) + Core L#6 + PU L#6 (P#6)
L2 L#7 (1024KB) + L1d L#7 (64KB) + L1i L#7 (64KB) + Core L#7 + PU L#7 (P#7)
L2 L#8 (1024KB) + L1d L#8 (64KB) + L1i L#8 (64KB) + Core L#8 + PU L#8 (P#8)
L2 L#9 (1024KB) + L1d L#9 (64KB) + L1i L#9 (64KB) + Core L#9 + PU L#9 (P#9)
L2 L#10 (1024KB) + L1d L#10 (64KB) + L1i L#10 (64KB) + Core L#10 + PU L#10 (P#10)
L2 L#11 (1024KB) + L1d L#11 (64KB) + L1i L#11 (64KB) + Core L#11 + PU L#11 (P#11)
L2 L#12 (1024KB) + L1d L#12 (64KB) + L1i L#12 (64KB) + Core L#12 + PU L#12 (P#12)
L2 L#13 (1024KB) + L1d L#13 (64KB) + L1i L#13 (64KB) + Core L#13 + PU L#13 (P#13)
L2 L#14 (1024KB) + L1d L#14 (64KB) + L1i L#14 (64KB) + Core L#14 + PU L#14 (P#14)
L2 L#15 (1024KB) + L1d L#15 (64KB) + L1i L#15 (64KB) + Core L#15 + PU L#15 (P#15)
L2 L#16 (1024KB) + L1d L#16 (64KB) + L1i L#16 (64KB) + Core L#16 + PU L#16 (P#16)
L2 L#17 (1024KB) + L1d L#17 (64KB) + L1i L#17 (64KB) + Core L#17 + PU L#17 (P#17)
L2 L#18 (1024KB) + L1d L#18 (64KB) + L1i L#18 (64KB) + Core L#18 + PU L#18 (P#18)
L2 L#19 (1024KB) + L1d L#19 (64KB) + L1i L#19 (64KB) + Core L#19 + PU L#19 (P#19)
L2 L#20 (1024KB) + L1d L#20 (64KB) + L1i L#20 (64KB) + Core L#20 + PU L#20 (P#20)
L2 L#21 (1024KB) + L1d L#21 (64KB) + L1i L#21 (64KB) + Core L#21 + PU L#21 (P#21)
L2 L#22 (1024KB) + L1d L#22 (64KB) + L1i L#22 (64KB) + Core L#22 + PU L#22 (P#22)
L2 L#23 (1024KB) + L1d L#23 (64KB) + L1i L#23 (64KB) + Core L#23 + PU L#23 (P#23)
L2 L#24 (1024KB) + L1d L#24 (64KB) + L1i L#24 (64KB) + Core L#24 + PU L#24 (P#24)
L2 L#25 (1024KB) + L1d L#25 (64KB) + L1i L#25 (64KB) + Core L#25 + PU L#25 (P#25)
L2 L#26 (1024KB) + L1d L#26 (64KB) + L1i L#26 (64KB) + Core L#26 + PU L#26 (P#26)
L2 L#27 (1024KB) + L1d L#27 (64KB) + L1i L#27 (64KB) + Core L#27 + PU L#27 (P#27)
L2 L#28 (1024KB) + L1d L#28 (64KB) + L1i L#28 (64KB) + Core L#28 + PU L#28 (P#28)
L2 L#29 (1024KB) + L1d L#29 (64KB) + L1i L#29 (64KB) + Core L#29 + PU L#29 (P#29)
L2 L#30 (1024KB) + L1d L#30 (64KB) + L1i L#30 (64KB) + Core L#30 + PU L#30 (P#30)
L2 L#31 (1024KB) + L1d L#31 (64KB) + L1i L#31 (64KB) + Core L#31 + PU L#31 (P#31)
L2 L#32 (1024KB) + L1d L#32 (64KB) + L1i L#32 (64KB) + Core L#32 + PU L#32 (P#32)
L2 L#33 (1024KB) + L1d L#33 (64KB) + L1i L#33 (64KB) + Core L#33 + PU L#33 (P#33)
L2 L#34 (1024KB) + L1d L#34 (64KB) + L1i L#34 (64KB) + Core L#34 + PU L#34 (P#34)
L2 L#35 (1024KB) + L1d L#35 (64KB) + L1i L#35 (64KB) + Core L#35 + PU L#35 (P#35)
L2 L#36 (1024KB) + L1d L#36 (64KB) + L1i L#36 (64KB) + Core L#36 + PU L#36 (P#36)
L2 L#37 (1024KB) + L1d L#37 (64KB) + L1i L#37 (64KB) + Core L#37 + PU L#37 (P#37)
L2 L#38 (1024KB) + L1d L#38 (64KB) + L1i L#38 (64KB) + Core L#38 + PU L#38 (P#38)
L2 L#39 (1024KB) + L1d L#39 (64KB) + L1i L#39 (64KB) + Core L#39 + PU L#39 (P#39)
L2 L#40 (1024KB) + L1d L#40 (64KB) + L1i L#40 (64KB) + Core L#40 + PU L#40 (P#40)
L2 L#41 (1024KB) + L1d L#41 (64KB) + L1i L#41 (64KB) + Core L#41 + PU L#41 (P#41)
L2 L#42 (1024KB) + L1d L#42 (64KB) + L1i L#42 (64KB) + Core L#42 + PU L#42 (P#42)
L2 L#43 (1024KB) + L1d L#43 (64KB) + L1i L#43 (64KB) + Core L#43 + PU L#43 (P#43)
L2 L#44 (1024KB) + L1d L#44 (64KB) + L1i L#44 (64KB) + Core L#44 + PU L#44 (P#44)
L2 L#45 (1024KB) + L1d L#45 (64KB) + L1i L#45 (64KB) + Core L#45 + PU L#45 (P#45)
L2 L#46 (1024KB) + L1d L#46 (64KB) + L1i L#46 (64KB) + Core L#46 + PU L#46 (P#46)
L2 L#47 (1024KB) + L1d L#47 (64KB) + L1i L#47 (64KB) + Core L#47 + PU L#47 (P#47)
L2 L#48 (1024KB) + L1d L#48 (64KB) + L1i L#48 (64KB) + Core L#48 + PU L#48 (P#48)
L2 L#49 (1024KB) + L1d L#49 (64KB) + L1i L#49 (64KB) + Core L#49 + PU L#49 (P#49)
L2 L#50 (1024KB) + L1d L#50 (64KB) + L1i L#50 (64KB) + Core L#50 + PU L#50 (P#50)
L2 L#51 (1024KB) + L1d L#51 (64KB) + L1i L#51 (64KB) + Core L#51 + PU L#51 (P#51)
L2 L#52 (1024KB) + L1d L#52 (64KB) + L1i L#52 (64KB) + Core L#52 + PU L#52 (P#52)
L2 L#53 (1024KB) + L1d L#53 (64KB) + L1i L#53 (64KB) + Core L#53 + PU L#53 (P#53)
L2 L#54 (1024KB) + L1d L#54 (64KB) + L1i L#54 (64KB) + Core L#54 + PU L#54 (P#54)
L2 L#55 (1024KB) + L1d L#55 (64KB) + L1i L#55 (64KB) + Core L#55 + PU L#55 (P#55)
L2 L#56 (1024KB) + L1d L#56 (64KB) + L1i L#56 (64KB) + Core L#56 + PU L#56 (P#56)
L2 L#57 (1024KB) + L1d L#57 (64KB) + L1i L#57 (64KB) + Core L#57 + PU L#57 (P#57)
L2 L#58 (1024KB) + L1d L#58 (64KB) + L1i L#58 (64KB) + Core L#58 + PU L#58 (P#58)
L2 L#59 (1024KB) + L1d L#59 (64KB) + L1i L#59 (64KB) + Core L#59 + PU L#59 (P#59)
L2 L#60 (1024KB) + L1d L#60 (64KB) + L1i L#60 (64KB) + Core L#60 + PU L#60 (P#60)
L2 L#61 (1024KB) + L1d L#61 (64KB) + L1i L#61 (64KB) + Core L#61 + PU L#61 (P#61)
L2 L#62 (1024KB) + L1d L#62 (64KB) + L1i L#62 (64KB) + Core L#62 + PU L#62 (P#62)
L2 L#63 (1024KB) + L1d L#63 (64KB) + L1i L#63 (64KB) + Core L#63 + PU L#63 (P#63)
L2 L#64 (1024KB) + L1d L#64 (64KB) + L1i L#64 (64KB) + Core L#64 + PU L#64 (P#64)
L2 L#65 (1024KB) + L1d L#65 (64KB) + L1i L#65 (64KB) + Core L#65 + PU L#65 (P#65)
L2 L#66 (1024KB) + L1d L#66 (64KB) + L1i L#66 (64KB) + Core L#66 + PU L#66 (P#66)
L2 L#67 (1024KB) + L1d L#67 (64KB) + L1i L#67 (64KB) + Core L#67 + PU L#67 (P#67)
L2 L#68 (1024KB) + L1d L#68 (64KB) + L1i L#68 (64KB) + Core L#68 + PU L#68 (P#68)
L2 L#69 (1024KB) + L1d L#69 (64KB) + L1i L#69 (64KB) + Core L#69 + PU L#69 (P#69)
L2 L#70 (1024KB) + L1d L#70 (64KB) + L1i L#70 (64KB) + Core L#70 + PU L#70 (P#70)
L2 L#71 (1024KB) + L1d L#71 (64KB) + L1i L#71 (64KB) + Core L#71 + PU L#71 (P#71)
HostBridge
PCIBridge
PCI 0002:01:00.0 (Ethernet)
Net "hsn0"
HostBridge
PCIBridge
PCI 0005:01:00.0 (Ethernet)
Net "nmn0"
HostBridge
PCIBridge
PCI 0007:01:00.0 (NVMExp)
Block(Disk) "nvme0n1"
HostBridge
PCIBridge
PCI 0009:01:00.0 (3D)
Package L#1
NUMANode L#1 (P#1 120GB)
L3 L#1 (114MB)
L2 L#72 (1024KB) + L1d L#72 (64KB) + L1i L#72 (64KB) + Core L#72 + PU L#72 (P#72)
L2 L#73 (1024KB) + L1d L#73 (64KB) + L1i L#73 (64KB) + Core L#73 + PU L#73 (P#73)
L2 L#74 (1024KB) + L1d L#74 (64KB) + L1i L#74 (64KB) + Core L#74 + PU L#74 (P#74)
L2 L#75 (1024KB) + L1d L#75 (64KB) + L1i L#75 (64KB) + Core L#75 + PU L#75 (P#75)
L2 L#76 (1024KB) + L1d L#76 (64KB) + L1i L#76 (64KB) + Core L#76 + PU L#76 (P#76)
L2 L#77 (1024KB) + L1d L#77 (64KB) + L1i L#77 (64KB) + Core L#77 + PU L#77 (P#77)
L2 L#78 (1024KB) + L1d L#78 (64KB) + L1i L#78 (64KB) + Core L#78 + PU L#78 (P#78)
L2 L#79 (1024KB) + L1d L#79 (64KB) + L1i L#79 (64KB) + Core L#79 + PU L#79 (P#79)
L2 L#80 (1024KB) + L1d L#80 (64KB) + L1i L#80 (64KB) + Core L#80 + PU L#80 (P#80)
L2 L#81 (1024KB) + L1d L#81 (64KB) + L1i L#81 (64KB) + Core L#81 + PU L#81 (P#81)
L2 L#82 (1024KB) + L1d L#82 (64KB) + L1i L#82 (64KB) + Core L#82 + PU L#82 (P#82)
L2 L#83 (1024KB) + L1d L#83 (64KB) + L1i L#83 (64KB) + Core L#83 + PU L#83 (P#83)
L2 L#84 (1024KB) + L1d L#84 (64KB) + L1i L#84 (64KB) + Core L#84 + PU L#84 (P#84)
L2 L#85 (1024KB) + L1d L#85 (64KB) + L1i L#85 (64KB) + Core L#85 + PU L#85 (P#85)
L2 L#86 (1024KB) + L1d L#86 (64KB) + L1i L#86 (64KB) + Core L#86 + PU L#86 (P#86)
L2 L#87 (1024KB) + L1d L#87 (64KB) + L1i L#87 (64KB) + Core L#87 + PU L#87 (P#87)
L2 L#88 (1024KB) + L1d L#88 (64KB) + L1i L#88 (64KB) + Core L#88 + PU L#88 (P#88)
L2 L#89 (1024KB) + L1d L#89 (64KB) + L1i L#89 (64KB) + Core L#89 + PU L#89 (P#89)
L2 L#90 (1024KB) + L1d L#90 (64KB) + L1i L#90 (64KB) + Core L#90 + PU L#90 (P#90)
L2 L#91 (1024KB) + L1d L#91 (64KB) + L1i L#91 (64KB) + Core L#91 + PU L#91 (P#91)
L2 L#92 (1024KB) + L1d L#92 (64KB) + L1i L#92 (64KB) + Core L#92 + PU L#92 (P#92)
L2 L#93 (1024KB) + L1d L#93 (64KB) + L1i L#93 (64KB) + Core L#93 + PU L#93 (P#93)
L2 L#94 (1024KB) + L1d L#94 (64KB) + L1i L#94 (64KB) + Core L#94 + PU L#94 (P#94)
L2 L#95 (1024KB) + L1d L#95 (64KB) + L1i L#95 (64KB) + Core L#95 + PU L#95 (P#95)
L2 L#96 (1024KB) + L1d L#96 (64KB) + L1i L#96 (64KB) + Core L#96 + PU L#96 (P#96)
L2 L#97 (1024KB) + L1d L#97 (64KB) + L1i L#97 (64KB) + Core L#97 + PU L#97 (P#97)
L2 L#98 (1024KB) + L1d L#98 (64KB) + L1i L#98 (64KB) + Core L#98 + PU L#98 (P#98)
L2 L#99 (1024KB) + L1d L#99 (64KB) + L1i L#99 (64KB) + Core L#99 + PU L#99 (P#99)
L2 L#100 (1024KB) + L1d L#100 (64KB) + L1i L#100 (64KB) + Core L#100 + PU L#100 (P#100)
L2 L#101 (1024KB) + L1d L#101 (64KB) + L1i L#101 (64KB) + Core L#101 + PU L#101 (P#101)
L2 L#102 (1024KB) + L1d L#102 (64KB) + L1i L#102 (64KB) + Core L#102 + PU L#102 (P#102)
L2 L#103 (1024KB) + L1d L#103 (64KB) + L1i L#103 (64KB) + Core L#103 + PU L#103 (P#103)
L2 L#104 (1024KB) + L1d L#104 (64KB) + L1i L#104 (64KB) + Core L#104 + PU L#104 (P#104)
L2 L#105 (1024KB) + L1d L#105 (64KB) + L1i L#105 (64KB) + Core L#105 + PU L#105 (P#105)
L2 L#106 (1024KB) + L1d L#106 (64KB) + L1i L#106 (64KB) + Core L#106 + PU L#106 (P#106)
L2 L#107 (1024KB) + L1d L#107 (64KB) + L1i L#107 (64KB) + Core L#107 + PU L#107 (P#107)
L2 L#108 (1024KB) + L1d L#108 (64KB) + L1i L#108 (64KB) + Core L#108 + PU L#108 (P#108)
L2 L#109 (1024KB) + L1d L#109 (64KB) + L1i L#109 (64KB) + Core L#109 + PU L#109 (P#109)
L2 L#110 (1024KB) + L1d L#110 (64KB) + L1i L#110 (64KB) + Core L#110 + PU L#110 (P#110)
L2 L#111 (1024KB) + L1d L#111 (64KB) + L1i L#111 (64KB) + Core L#111 + PU L#111 (P#111)
L2 L#112 (1024KB) + L1d L#112 (64KB) + L1i L#112 (64KB) + Core L#112 + PU L#112 (P#112)
L2 L#113 (1024KB) + L1d L#113 (64KB) + L1i L#113 (64KB) + Core L#113 + PU L#113 (P#113)
L2 L#114 (1024KB) + L1d L#114 (64KB) + L1i L#114 (64KB) + Core L#114 + PU L#114 (P#114)
L2 L#115 (1024KB) + L1d L#115 (64KB) + L1i L#115 (64KB) + Core L#115 + PU L#115 (P#115)
L2 L#116 (1024KB) + L1d L#116 (64KB) + L1i L#116 (64KB) + Core L#116 + PU L#116 (P#116)
L2 L#117 (1024KB) + L1d L#117 (64KB) + L1i L#117 (64KB) + Core L#117 + PU L#117 (P#117)
L2 L#118 (1024KB) + L1d L#118 (64KB) + L1i L#118 (64KB) + Core L#118 + PU L#118 (P#118)
L2 L#119 (1024KB) + L1d L#119 (64KB) + L1i L#119 (64KB) + Core L#119 + PU L#119 (P#119)
L2 L#120 (1024KB) + L1d L#120 (64KB) + L1i L#120 (64KB) + Core L#120 + PU L#120 (P#120)
L2 L#121 (1024KB) + L1d L#121 (64KB) + L1i L#121 (64KB) + Core L#121 + PU L#121 (P#121)
L2 L#122 (1024KB) + L1d L#122 (64KB) + L1i L#122 (64KB) + Core L#122 + PU L#122 (P#122)
L2 L#123 (1024KB) + L1d L#123 (64KB) + L1i L#123 (64KB) + Core L#123 + PU L#123 (P#123)
L2 L#124 (1024KB) + L1d L#124 (64KB) + L1i L#124 (64KB) + Core L#124 + PU L#124 (P#124)
L2 L#125 (1024KB) + L1d L#125 (64KB) + L1i L#125 (64KB) + Core L#125 + PU L#125 (P#125)
L2 L#126 (1024KB) + L1d L#126 (64KB) + L1i L#126 (64KB) + Core L#126 + PU L#126 (P#126)
L2 L#127 (1024KB) + L1d L#127 (64KB) + L1i L#127 (64KB) + Core L#127 + PU L#127 (P#127)
L2 L#128 (1024KB) + L1d L#128 (64KB) + L1i L#128 (64KB) + Core L#128 + PU L#128 (P#128)
L2 L#129 (1024KB) + L1d L#129 (64KB) + L1i L#129 (64KB) + Core L#129 + PU L#129 (P#129)
L2 L#130 (1024KB) + L1d L#130 (64KB) + L1i L#130 (64KB) + Core L#130 + PU L#130 (P#130)
L2 L#131 (1024KB) + L1d L#131 (64KB) + L1i L#131 (64KB) + Core L#131 + PU L#131 (P#131)
L2 L#132 (1024KB) + L1d L#132 (64KB) + L1i L#132 (64KB) + Core L#132 + PU L#132 (P#132)
L2 L#133 (1024KB) + L1d L#133 (64KB) + L1i L#133 (64KB) + Core L#133 + PU L#133 (P#133)
L2 L#134 (1024KB) + L1d L#134 (64KB) + L1i L#134 (64KB) + Core L#134 + PU L#134 (P#134)
L2 L#135 (1024KB) + L1d L#135 (64KB) + L1i L#135 (64KB) + Core L#135 + PU L#135 (P#135)
L2 L#136 (1024KB) + L1d L#136 (64KB) + L1i L#136 (64KB) + Core L#136 + PU L#136 (P#136)
L2 L#137 (1024KB) + L1d L#137 (64KB) + L1i L#137 (64KB) + Core L#137 + PU L#137 (P#137)
L2 L#138 (1024KB) + L1d L#138 (64KB) + L1i L#138 (64KB) + Core L#138 + PU L#138 (P#138)
L2 L#139 (1024KB) + L1d L#139 (64KB) + L1i L#139 (64KB) + Core L#139 + PU L#139 (P#139)
L2 L#140 (1024KB) + L1d L#140 (64KB) + L1i L#140 (64KB) + Core L#140 + PU L#140 (P#140)
L2 L#141 (1024KB) + L1d L#141 (64KB) + L1i L#141 (64KB) + Core L#141 + PU L#141 (P#141)
L2 L#142 (1024KB) + L1d L#142 (64KB) + L1i L#142 (64KB) + Core L#142 + PU L#142 (P#142)
L2 L#143 (1024KB) + L1d L#143 (64KB) + L1i L#143 (64KB) + Core L#143 + PU L#143 (P#143)
HostBridge
PCIBridge
PCI 0010:01:00.0 (Ethernet)
Net "hsn2"
HostBridge
PCIBridge
PCI 0019:01:00.0 (3D)
Package L#2
NUMANode L#2 (P#2 120GB)
L3 L#2 (114MB)
L2 L#144 (1024KB) + L1d L#144 (64KB) + L1i L#144 (64KB) + Core L#144 + PU L#144 (P#144)
L2 L#145 (1024KB) + L1d L#145 (64KB) + L1i L#145 (64KB) + Core L#145 + PU L#145 (P#145)
L2 L#146 (1024KB) + L1d L#146 (64KB) + L1i L#146 (64KB) + Core L#146 + PU L#146 (P#146)
L2 L#147 (1024KB) + L1d L#147 (64KB) + L1i L#147 (64KB) + Core L#147 + PU L#147 (P#147)
L2 L#148 (1024KB) + L1d L#148 (64KB) + L1i L#148 (64KB) + Core L#148 + PU L#148 (P#148)
L2 L#149 (1024KB) + L1d L#149 (64KB) + L1i L#149 (64KB) + Core L#149 + PU L#149 (P#149)
L2 L#150 (1024KB) + L1d L#150 (64KB) + L1i L#150 (64KB) + Core L#150 + PU L#150 (P#150)
L2 L#151 (1024KB) + L1d L#151 (64KB) + L1i L#151 (64KB) + Core L#151 + PU L#151 (P#151)
L2 L#152 (1024KB) + L1d L#152 (64KB) + L1i L#152 (64KB) + Core L#152 + PU L#152 (P#152)
L2 L#153 (1024KB) + L1d L#153 (64KB) + L1i L#153 (64KB) + Core L#153 + PU L#153 (P#153)
L2 L#154 (1024KB) + L1d L#154 (64KB) + L1i L#154 (64KB) + Core L#154 + PU L#154 (P#154)
L2 L#155 (1024KB) + L1d L#155 (64KB) + L1i L#155 (64KB) + Core L#155 + PU L#155 (P#155)
L2 L#156 (1024KB) + L1d L#156 (64KB) + L1i L#156 (64KB) + Core L#156 + PU L#156 (P#156)
L2 L#157 (1024KB) + L1d L#157 (64KB) + L1i L#157 (64KB) + Core L#157 + PU L#157 (P#157)
L2 L#158 (1024KB) + L1d L#158 (64KB) + L1i L#158 (64KB) + Core L#158 + PU L#158 (P#158)
L2 L#159 (1024KB) + L1d L#159 (64KB) + L1i L#159 (64KB) + Core L#159 + PU L#159 (P#159)
L2 L#160 (1024KB) + L1d L#160 (64KB) + L1i L#160 (64KB) + Core L#160 + PU L#160 (P#160)
L2 L#161 (1024KB) + L1d L#161 (64KB) + L1i L#161 (64KB) + Core L#161 + PU L#161 (P#161)
L2 L#162 (1024KB) + L1d L#162 (64KB) + L1i L#162 (64KB) + Core L#162 + PU L#162 (P#162)
L2 L#163 (1024KB) + L1d L#163 (64KB) + L1i L#163 (64KB) + Core L#163 + PU L#163 (P#163)
L2 L#164 (1024KB) + L1d L#164 (64KB) + L1i L#164 (64KB) + Core L#164 + PU L#164 (P#164)
L2 L#165 (1024KB) + L1d L#165 (64KB) + L1i L#165 (64KB) + Core L#165 + PU L#165 (P#165)
L2 L#166 (1024KB) + L1d L#166 (64KB) + L1i L#166 (64KB) + Core L#166 + PU L#166 (P#166)
L2 L#167 (1024KB) + L1d L#167 (64KB) + L1i L#167 (64KB) + Core L#167 + PU L#167 (P#167)
L2 L#168 (1024KB) + L1d L#168 (64KB) + L1i L#168 (64KB) + Core L#168 + PU L#168 (P#168)
L2 L#169 (1024KB) + L1d L#169 (64KB) + L1i L#169 (64KB) + Core L#169 + PU L#169 (P#169)
L2 L#170 (1024KB) + L1d L#170 (64KB) + L1i L#170 (64KB) + Core L#170 + PU L#170 (P#170)
L2 L#171 (1024KB) + L1d L#171 (64KB) + L1i L#171 (64KB) + Core L#171 + PU L#171 (P#171)
L2 L#172 (1024KB) + L1d L#172 (64KB) + L1i L#172 (64KB) + Core L#172 + PU L#172 (P#172)
L2 L#173 (1024KB) + L1d L#173 (64KB) + L1i L#173 (64KB) + Core L#173 + PU L#173 (P#173)
L2 L#174 (1024KB) + L1d L#174 (64KB) + L1i L#174 (64KB) + Core L#174 + PU L#174 (P#174)
L2 L#175 (1024KB) + L1d L#175 (64KB) + L1i L#175 (64KB) + Core L#175 + PU L#175 (P#175)
L2 L#176 (1024KB) + L1d L#176 (64KB) + L1i L#176 (64KB) + Core L#176 + PU L#176 (P#176)
L2 L#177 (1024KB) + L1d L#177 (64KB) + L1i L#177 (64KB) + Core L#177 + PU L#177 (P#177)
L2 L#178 (1024KB) + L1d L#178 (64KB) + L1i L#178 (64KB) + Core L#178 + PU L#178 (P#178)
L2 L#179 (1024KB) + L1d L#179 (64KB) + L1i L#179 (64KB) + Core L#179 + PU L#179 (P#179)
L2 L#180 (1024KB) + L1d L#180 (64KB) + L1i L#180 (64KB) + Core L#180 + PU L#180 (P#180)
L2 L#181 (1024KB) + L1d L#181 (64KB) + L1i L#181 (64KB) + Core L#181 + PU L#181 (P#181)
L2 L#182 (1024KB) + L1d L#182 (64KB) + L1i L#182 (64KB) + Core L#182 + PU L#182 (P#182)
L2 L#183 (1024KB) + L1d L#183 (64KB) + L1i L#183 (64KB) + Core L#183 + PU L#183 (P#183)
L2 L#184 (1024KB) + L1d L#184 (64KB) + L1i L#184 (64KB) + Core L#184 + PU L#184 (P#184)
L2 L#185 (1024KB) + L1d L#185 (64KB) + L1i L#185 (64KB) + Core L#185 + PU L#185 (P#185)
L2 L#186 (1024KB) + L1d L#186 (64KB) + L1i L#186 (64KB) + Core L#186 + PU L#186 (P#186)
L2 L#187 (1024KB) + L1d L#187 (64KB) + L1i L#187 (64KB) + Core L#187 + PU L#187 (P#187)
L2 L#188 (1024KB) + L1d L#188 (64KB) + L1i L#188 (64KB) + Core L#188 + PU L#188 (P#188)
L2 L#189 (1024KB) + L1d L#189 (64KB) + L1i L#189 (64KB) + Core L#189 + PU L#189 (P#189)
L2 L#190 (1024KB) + L1d L#190 (64KB) + L1i L#190 (64KB) + Core L#190 + PU L#190 (P#190)
L2 L#191 (1024KB) + L1d L#191 (64KB) + L1i L#191 (64KB) + Core L#191 + PU L#191 (P#191)
L2 L#192 (1024KB) + L1d L#192 (64KB) + L1i L#192 (64KB) + Core L#192 + PU L#192 (P#192)
L2 L#193 (1024KB) + L1d L#193 (64KB) + L1i L#193 (64KB) + Core L#193 + PU L#193 (P#193)
L2 L#194 (1024KB) + L1d L#194 (64KB) + L1i L#194 (64KB) + Core L#194 + PU L#194 (P#194)
L2 L#195 (1024KB) + L1d L#195 (64KB) + L1i L#195 (64KB) + Core L#195 + PU L#195 (P#195)
L2 L#196 (1024KB) + L1d L#196 (64KB) + L1i L#196 (64KB) + Core L#196 + PU L#196 (P#196)
L2 L#197 (1024KB) + L1d L#197 (64KB) + L1i L#197 (64KB) + Core L#197 + PU L#197 (P#197)
L2 L#198 (1024KB) + L1d L#198 (64KB) + L1i L#198 (64KB) + Core L#198 + PU L#198 (P#198)
L2 L#199 (1024KB) + L1d L#199 (64KB) + L1i L#199 (64KB) + Core L#199 + PU L#199 (P#199)
L2 L#200 (1024KB) + L1d L#200 (64KB) + L1i L#200 (64KB) + Core L#200 + PU L#200 (P#200)
L2 L#201 (1024KB) + L1d L#201 (64KB) + L1i L#201 (64KB) + Core L#201 + PU L#201 (P#201)
L2 L#202 (1024KB) + L1d L#202 (64KB) + L1i L#202 (64KB) + Core L#202 + PU L#202 (P#202)
L2 L#203 (1024KB) + L1d L#203 (64KB) + L1i L#203 (64KB) + Core L#203 + PU L#203 (P#203)
L2 L#204 (1024KB) + L1d L#204 (64KB) + L1i L#204 (64KB) + Core L#204 + PU L#204 (P#204)
L2 L#205 (1024KB) + L1d L#205 (64KB) + L1i L#205 (64KB) + Core L#205 + PU L#205 (P#205)
L2 L#206 (1024KB) + L1d L#206 (64KB) + L1i L#206 (64KB) + Core L#206 + PU L#206 (P#206)
L2 L#207 (1024KB) + L1d L#207 (64KB) + L1i L#207 (64KB) + Core L#207 + PU L#207 (P#207)
L2 L#208 (1024KB) + L1d L#208 (64KB) + L1i L#208 (64KB) + Core L#208 + PU L#208 (P#208)
L2 L#209 (1024KB) + L1d L#209 (64KB) + L1i L#209 (64KB) + Core L#209 + PU L#209 (P#209)
L2 L#210 (1024KB) + L1d L#210 (64KB) + L1i L#210 (64KB) + Core L#210 + PU L#210 (P#210)
L2 L#211 (1024KB) + L1d L#211 (64KB) + L1i L#211 (64KB) + Core L#211 + PU L#211 (P#211)
L2 L#212 (1024KB) + L1d L#212 (64KB) + L1i L#212 (64KB) + Core L#212 + PU L#212 (P#212)
L2 L#213 (1024KB) + L1d L#213 (64KB) + L1i L#213 (64KB) + Core L#213 + PU L#213 (P#213)
L2 L#214 (1024KB) + L1d L#214 (64KB) + L1i L#214 (64KB) + Core L#214 + PU L#214 (P#214)
L2 L#215 (1024KB) + L1d L#215 (64KB) + L1i L#215 (64KB) + Core L#215 + PU L#215 (P#215)
HostBridge
PCIBridge
PCI 0020:01:00.0 (Ethernet)
Net "hsn3"
HostBridge
PCIBridge
PCI 0029:01:00.0 (3D)
Package L#3
NUMANode L#3 (P#3 120GB)
L3 L#3 (114MB)
L2 L#216 (1024KB) + L1d L#216 (64KB) + L1i L#216 (64KB) + Core L#216 + PU L#216 (P#216)
L2 L#217 (1024KB) + L1d L#217 (64KB) + L1i L#217 (64KB) + Core L#217 + PU L#217 (P#217)
L2 L#218 (1024KB) + L1d L#218 (64KB) + L1i L#218 (64KB) + Core L#218 + PU L#218 (P#218)
L2 L#219 (1024KB) + L1d L#219 (64KB) + L1i L#219 (64KB) + Core L#219 + PU L#219 (P#219)
L2 L#220 (1024KB) + L1d L#220 (64KB) + L1i L#220 (64KB) + Core L#220 + PU L#220 (P#220)
L2 L#221 (1024KB) + L1d L#221 (64KB) + L1i L#221 (64KB) + Core L#221 + PU L#221 (P#221)
L2 L#222 (1024KB) + L1d L#222 (64KB) + L1i L#222 (64KB) + Core L#222 + PU L#222 (P#222)
L2 L#223 (1024KB) + L1d L#223 (64KB) + L1i L#223 (64KB) + Core L#223 + PU L#223 (P#223)
L2 L#224 (1024KB) + L1d L#224 (64KB) + L1i L#224 (64KB) + Core L#224 + PU L#224 (P#224)
L2 L#225 (1024KB) + L1d L#225 (64KB) + L1i L#225 (64KB) + Core L#225 + PU L#225 (P#225)
L2 L#226 (1024KB) + L1d L#226 (64KB) + L1i L#226 (64KB) + Core L#226 + PU L#226 (P#226)
L2 L#227 (1024KB) + L1d L#227 (64KB) + L1i L#227 (64KB) + Core L#227 + PU L#227 (P#227)
L2 L#228 (1024KB) + L1d L#228 (64KB) + L1i L#228 (64KB) + Core L#228 + PU L#228 (P#228)
L2 L#229 (1024KB) + L1d L#229 (64KB) + L1i L#229 (64KB) + Core L#229 + PU L#229 (P#229)
L2 L#230 (1024KB) + L1d L#230 (64KB) + L1i L#230 (64KB) + Core L#230 + PU L#230 (P#230)
L2 L#231 (1024KB) + L1d L#231 (64KB) + L1i L#231 (64KB) + Core L#231 + PU L#231 (P#231)
L2 L#232 (1024KB) + L1d L#232 (64KB) + L1i L#232 (64KB) + Core L#232 + PU L#232 (P#232)
L2 L#233 (1024KB) + L1d L#233 (64KB) + L1i L#233 (64KB) + Core L#233 + PU L#233 (P#233)
L2 L#234 (1024KB) + L1d L#234 (64KB) + L1i L#234 (64KB) + Core L#234 + PU L#234 (P#234)
L2 L#235 (1024KB) + L1d L#235 (64KB) + L1i L#235 (64KB) + Core L#235 + PU L#235 (P#235)
L2 L#236 (1024KB) + L1d L#236 (64KB) + L1i L#236 (64KB) + Core L#236 + PU L#236 (P#236)
L2 L#237 (1024KB) + L1d L#237 (64KB) + L1i L#237 (64KB) + Core L#237 + PU L#237 (P#237)
L2 L#238 (1024KB) + L1d L#238 (64KB) + L1i L#238 (64KB) + Core L#238 + PU L#238 (P#238)
L2 L#239 (1024KB) + L1d L#239 (64KB) + L1i L#239 (64KB) + Core L#239 + PU L#239 (P#239)
L2 L#240 (1024KB) + L1d L#240 (64KB) + L1i L#240 (64KB) + Core L#240 + PU L#240 (P#240)
L2 L#241 (1024KB) + L1d L#241 (64KB) + L1i L#241 (64KB) + Core L#241 + PU L#241 (P#241)
L2 L#242 (1024KB) + L1d L#242 (64KB) + L1i L#242 (64KB) + Core L#242 + PU L#242 (P#242)
L2 L#243 (1024KB) + L1d L#243 (64KB) + L1i L#243 (64KB) + Core L#243 + PU L#243 (P#243)
L2 L#244 (1024KB) + L1d L#244 (64KB) + L1i L#244 (64KB) + Core L#244 + PU L#244 (P#244)
L2 L#245 (1024KB) + L1d L#245 (64KB) + L1i L#245 (64KB) + Core L#245 + PU L#245 (P#245)
L2 L#246 (1024KB) + L1d L#246 (64KB) + L1i L#246 (64KB) + Core L#246 + PU L#246 (P#246)
L2 L#247 (1024KB) + L1d L#247 (64KB) + L1i L#247 (64KB) + Core L#247 + PU L#247 (P#247)
L2 L#248 (1024KB) + L1d L#248 (64KB) + L1i L#248 (64KB) + Core L#248 + PU L#248 (P#248)
L2 L#249 (1024KB) + L1d L#249 (64KB) + L1i L#249 (64KB) + Core L#249 + PU L#249 (P#249)
L2 L#250 (1024KB) + L1d L#250 (64KB) + L1i L#250 (64KB) + Core L#250 + PU L#250 (P#250)
L2 L#251 (1024KB) + L1d L#251 (64KB) + L1i L#251 (64KB) + Core L#251 + PU L#251 (P#251)
L2 L#252 (1024KB) + L1d L#252 (64KB) + L1i L#252 (64KB) + Core L#252 + PU L#252 (P#252)
L2 L#253 (1024KB) + L1d L#253 (64KB) + L1i L#253 (64KB) + Core L#253 + PU L#253 (P#253)
L2 L#254 (1024KB) + L1d L#254 (64KB) + L1i L#254 (64KB) + Core L#254 + PU L#254 (P#254)
L2 L#255 (1024KB) + L1d L#255 (64KB) + L1i L#255 (64KB) + Core L#255 + PU L#255 (P#255)
L2 L#256 (1024KB) + L1d L#256 (64KB) + L1i L#256 (64KB) + Core L#256 + PU L#256 (P#256)
L2 L#257 (1024KB) + L1d L#257 (64KB) + L1i L#257 (64KB) + Core L#257 + PU L#257 (P#257)
L2 L#258 (1024KB) + L1d L#258 (64KB) + L1i L#258 (64KB) + Core L#258 + PU L#258 (P#258)
L2 L#259 (1024KB) + L1d L#259 (64KB) + L1i L#259 (64KB) + Core L#259 + PU L#259 (P#259)
L2 L#260 (1024KB) + L1d L#260 (64KB) + L1i L#260 (64KB) + Core L#260 + PU L#260 (P#260)
L2 L#261 (1024KB) + L1d L#261 (64KB) + L1i L#261 (64KB) + Core L#261 + PU L#261 (P#261)
L2 L#262 (1024KB) + L1d L#262 (64KB) + L1i L#262 (64KB) + Core L#262 + PU L#262 (P#262)
L2 L#263 (1024KB) + L1d L#263 (64KB) + L1i L#263 (64KB) + Core L#263 + PU L#263 (P#263)
L2 L#264 (1024KB) + L1d L#264 (64KB) + L1i L#264 (64KB) + Core L#264 + PU L#264 (P#264)
L2 L#265 (1024KB) + L1d L#265 (64KB) + L1i L#265 (64KB) + Core L#265 + PU L#265 (P#265)
L2 L#266 (1024KB) + L1d L#266 (64KB) + L1i L#266 (64KB) + Core L#266 + PU L#266 (P#266)
L2 L#267 (1024KB) + L1d L#267 (64KB) + L1i L#267 (64KB) + Core L#267 + PU L#267 (P#267)
L2 L#268 (1024KB) + L1d L#268 (64KB) + L1i L#268 (64KB) + Core L#268 + PU L#268 (P#268)
L2 L#269 (1024KB) + L1d L#269 (64KB) + L1i L#269 (64KB) + Core L#269 + PU L#269 (P#269)
L2 L#270 (1024KB) + L1d L#270 (64KB) + L1i L#270 (64KB) + Core L#270 + PU L#270 (P#270)
L2 L#271 (1024KB) + L1d L#271 (64KB) + L1i L#271 (64KB) + Core L#271 + PU L#271 (P#271)
L2 L#272 (1024KB) + L1d L#272 (64KB) + L1i L#272 (64KB) + Core L#272 + PU L#272 (P#272)
L2 L#273 (1024KB) + L1d L#273 (64KB) + L1i L#273 (64KB) + Core L#273 + PU L#273 (P#273)
L2 L#274 (1024KB) + L1d L#274 (64KB) + L1i L#274 (64KB) + Core L#274 + PU L#274 (P#274)
L2 L#275 (1024KB) + L1d L#275 (64KB) + L1i L#275 (64KB) + Core L#275 + PU L#275 (P#275)
L2 L#276 (1024KB) + L1d L#276 (64KB) + L1i L#276 (64KB) + Core L#276 + PU L#276 (P#276)
L2 L#277 (1024KB) + L1d L#277 (64KB) + L1i L#277 (64KB) + Core L#277 + PU L#277 (P#277)
L2 L#278 (1024KB) + L1d L#278 (64KB) + L1i L#278 (64KB) + Core L#278 + PU L#278 (P#278)
L2 L#279 (1024KB) + L1d L#279 (64KB) + L1i L#279 (64KB) + Core L#279 + PU L#279 (P#279)
L2 L#280 (1024KB) + L1d L#280 (64KB) + L1i L#280 (64KB) + Core L#280 + PU L#280 (P#280)
L2 L#281 (1024KB) + L1d L#281 (64KB) + L1i L#281 (64KB) + Core L#281 + PU L#281 (P#281)
L2 L#282 (1024KB) + L1d L#282 (64KB) + L1i L#282 (64KB) + Core L#282 + PU L#282 (P#282)
L2 L#283 (1024KB) + L1d L#283 (64KB) + L1i L#283 (64KB) + Core L#283 + PU L#283 (P#283)
L2 L#284 (1024KB) + L1d L#284 (64KB) + L1i L#284 (64KB) + Core L#284 + PU L#284 (P#284)
L2 L#285 (1024KB) + L1d L#285 (64KB) + L1i L#285 (64KB) + Core L#285 + PU L#285 (P#285)
L2 L#286 (1024KB) + L1d L#286 (64KB) + L1i L#286 (64KB) + Core L#286 + PU L#286 (P#286)
L2 L#287 (1024KB) + L1d L#287 (64KB) + L1i L#287 (64KB) + Core L#287 + PU L#287 (P#287)
HostBridge
PCIBridge
PCI 0030:01:00.0 (Ethernet)
Net "hsn1"
HostBridge
PCIBridge
PCI 0039:01:00.0 (3D)
%

  • Describe the machine and the processors it contains, as well as any memory and/or peripherals that matter to your issue
    This is an Nvidia "Grace"/"Hopper" machine (Isambard-AI)

Details of the problem

lstopo shows the L3$ as being 114MiB/package, whereas NVidia's documentation (here) says that there is 117MiB/package.

Additional information

If you need me to run experiments on the machine, as you may not have access to one, please let me know. That should be possible.

@bgoglin
Copy link
Contributor

bgoglin commented Sep 11, 2024

Hello Jim. I would guess it's a 1024 vs 1000 (114x1024 is close to 117x1000), somebody didn't read the specs carefully when filling hardware tables. That's what's exposed in the firmware and then by Linux, the bug wouldn't be on our side. It's somehow like people claiming they have 192GB of RAM while the BIOS actually only gives 185GB to the OS, we usually just ignore the issue.

What bothers me more is how this L3 is shared. The wording in the doc makes me wonder whether it's actually 234 per dual-package superchip instead of 117 per package. Somebody once reported that their experiment results indeed point to shared per dual-package. B we couldn't get a precise answer by NVIDIA. However, this would have to be fixed in their firmware as well (or worked around in some kernel driver).

@JimCownie
Copy link
Author

JimCownie commented Sep 12, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants