Support hugepages allocation in containers #2589
Labels
comp:agent
Related to Agent component
comp:manager
Related to Manager component
urgency:3
Must be finished within a certain time frame.
Milestone
Some HPC applications may want to use hugepages (2 MiB / 1 GiB page sizes) to reduce TLB cache pressure.
In container runtimes, there are several examples to support hugepages:
Some references on hugepages:
We need to explicitly enable hugepages on part of our testing infra and implement the option, like:
or,
The first option (resource-opt) does not prevent the overlapped usage but just allow the hugepage access from containers with limits.
The second option (resource-slot) treats hugepages as an accounted resource that cannot be shared between different containers. For consistency with MIG slots (
cuda.mig-5g
, ...), I've removed the trailingi
(binary suffix) in the resource slot names.The text was updated successfully, but these errors were encountered: