Newest 'numa' Questions

1 vote

1 answer

52 views

How to enable NUMA nodes in Docker container

When I run a Docker container on my system (ARM MacOS, with Docker Desktop), I see that /sys/devices/system/node is not present: docker run -it ubuntu:24.04 # ls /sys/devices/system clockevents ...

Daniel Porteous

6,483

asked Nov 11 at 19:14

1 vote

0 answers

75 views

Why is the core-to-core-latency performance of EPYC 4 so poor in NUMA2 mode?

I test the EYPC 9564 CPU (dual socket), the core-to-core latency of the second socket is very high, even greater than the latency for inter-socket communication. As shown for AMD EPYC 7R13, 48 Cores, ...

wang fuqiang

81

asked Apr 25 at 2:34

1 vote

2 answers

102 views

Is it possible to somehow mix static and dynamic loop scheduling?

I am using a machine with 2 Xeon CPUs having 16 cores each. There are 2 NUMA domains, one for each CPU. I have intensive computation, that also use a lot of memory, and everything is multithreaded. ...

PierU

2,737

asked Nov 12, 2024 at 23:42

1 vote

0 answers

104 views

numactl: Is it possible to use cpu and memory from different numa nodes?

The numactl man page says: --membind=nodes, -m nodes Only allocate memory from nodes. Allocation will fail when there is not enough memory available on these nodes. nodes may be specified as noted ...

smwikipedia

65k

asked Oct 29, 2024 at 8:12

1 vote

0 answers

104 views

Is it possible to load Linux kernel code to a specific NUMA node when booting?

I'm doing a benchmark in Linux kernel. I want to make sure all the benchmarked kernel code is stored in the same NUMA node as the CPU that runs the code. I implement a system call to trigger the ...

sk_buff

101

asked May 15, 2024 at 1:38

1 vote

0 answers

101 views

how to simulate NUMA in gem5?

gem5 how to build NUMA architecture? I know gem5 supports analog NUMA architecture. but I did not find the relevant information under the official library, I didn't find the configuration information ...

hhh1

11

asked Mar 9, 2024 at 17:09

0 votes

0 answers

38 views

Is there a NUMA-like mechanism for a DRAM?

Can I create, for example, two buffers on adjacent memory chips? On chips that are physically closer to the CPU. Or is it implied that ram physical addresses are always adjacent in space? You could ...

Lem

173

asked Nov 1, 2023 at 18:35

1 vote

0 answers

278 views

How to calculate the theoretical max UPI bandwidth of a Linux dual-socket machine?

I want to calculate the theoretical UPI bandwidth of a dual-socket machine running Linux system in order to estimate the max remote memory access bandwidth. Theoretically, UPI bandwidth = UPI speed (...

Frontier_Setter

809

asked May 8, 2023 at 5:15

1 vote

1 answer

269 views

Return value of struct bitmask *numa_get_membind

I bind memory to run program on node 1. I insert some print code in the program to check current binded node. I found a function from numa.h: struct bitmask *numa_get_membind But I couldn't know how ...

김시은

11

asked Apr 15, 2023 at 20:18

0 votes

1 answer

512 views

NUMA memory allocation with hwloc

I'm trying to do NUMA aware memory allocation with hwloc and get somewhat strange behavior. My goal is to allocate blocks of memory on different NUMA nodes as i need this for a project. To verify that ...

Daniel

1

asked Mar 23, 2023 at 18:25

1 vote

1 answer

12k views

Error message in TensorFlow: "could not open file to read NUMA node" and missing directory in /sys/bus/pci/devices

I'm using TensorFlow in my project, and every time I run my code, I get the following error message: 2023-02-23 13:17:55.003041: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:967]...

nazim elhadi

29

asked Feb 23, 2023 at 5:36

0 votes

2 answers

98 views

What is the "TBD Release Iron" and what are the modifications?

Some Win32 API function documentation (for example this and this) contains the following note: Starting with TBD Release Iron, the behavior of this and other NUMA functions has been modified to ...

Maris B.

2,497

asked Nov 25, 2022 at 9:35

0 votes

1 answer

99 views

How granular can multithreaded memory-write access be?

I've read about how NUMA works and that memory is pulled in from RAM through L2 and L1 caches. And that there are only two ways to share data: read access from n (n>=0) threads read-write access ...

office-account

9

asked Oct 7, 2022 at 10:47

1 vote

1 answer

1k views

Why Linux distributes threads among NUMA nodes almost equally?

I'm running an application with multiple threads and it seems Linux is distributing threads among NUMA nodes almost equally. Say my application spawns 4 threads and my machine has 4 sockets. I observe ...

Mohammad Siavashi

1,292

asked Sep 10, 2022 at 8:36

1 vote

2 answers

584 views

How to migrate array to a new NUMA node in C?

I have allocated an array in C as follows: void *mem = mmap(NULL, 8192, PROT_READ | PROT_WRITE, MAP_PRIVATE | MAP_ANONYMOUS | MAP_POPULATE, -1, 0); Imagine this array is initialized and now I need to ...

Mohammad Siavashi

1,292

asked Aug 15, 2022 at 20:09

0 votes

1 answer

1k views

Is it possible to find out which NUMA system memory bank the current thread belongs to?

I'm writing a NUMA-aware algorithm and need this information for optimal memory keeping. It would be nice if you know a solution for JVM(for example using oshi), but I can't find it even for C/C++

Dave11ar

420

asked Jul 18, 2022 at 19:19

1 vote

2 answers

230 views

Understanding the speed up of openmp program across NUMA nodes

I came across this behavior of speed up and I am finding it hard to explain. Following is the background: Program Invocation of Gaussian Elimination method to solve linear equation within a loop to ...

Sriram G

11

asked Jun 26, 2022 at 6:38

3 votes

1 answer

1k views

Explanation for why effective DRAM bandwidth reduces upon adding CPUs

This question is a spin-off of the one posted here: Measuring bandwidth on a ccNUMA system I've written a micro-benchmark for the memory bandwidth on a ccNUMA system with 2x Intel(R) Xeon(R) Platinum ...

Nitin Malapally

648

asked May 13, 2022 at 12:21

0 votes

1 answer

400 views

How to test the problem size scaling performance of code

I'm running a simple kernel which adds two streams of double-precision complex-values. I've parallelized it using OpenMP with custom scheduling: the slice_indices container contains different indices ...

Nitin Malapally

648

asked May 4, 2022 at 11:41

1 vote

0 answers

100 views

What is the order of memory allocation when demand exceeds single numa node

With a 4 numa node linux server(128G each), I was trying to allocate 300G memory by kmalloc_node(2) to specify the allocation start node. Could any great master tell me what is the order of allocation ...

L.H

11

asked Mar 29, 2022 at 2:07

0 votes

1 answer

870 views

What is the meaning of size for the numactl --hardware output

Does anyone know the exact meaning of "node size" for "numactl --hardware" output. I'm asking because I expected this memory value to be fixed but it changes slightly on some of ...

Farouk Khawaja

1

asked Mar 18, 2022 at 15:17

2 votes

1 answer

277 views

How can I realize data local spawning or scheduling of tasks in OpenMP on NUMA CPUs?

I have this simple self-contained example of a very rudimentary 2 dimensional stencil application using OpenMP tasks on dynamic arrays to represent an issue that I am having on a problem that is less ...

user151387

103

asked Feb 27, 2022 at 10:42

1 vote

0 answers

570 views

How to PInvoke UpdateProcThreadAttribute with PROC_THREAD_ATTRIBUTE_PREFERRED_NODE attribute

I'm trying to PInvoke UpdateProcThreadAttribute() with PROC_THREAD_ATTRIBUTE_PREFERRED_NODE attribute, so that I could launch a process on a specific NUMA node. I'm working on Windows Server 2019. I ...

Dan Sagher

21

asked Jan 31, 2022 at 11:11

1 vote

0 answers

500 views

Can page faults be triggered by NUMA access?

I have some multithreaded code where the threads spend a significant amount time in the page fault handler of the kernel (Linux 5.4). But this only happens on a two Socket NUMA machine, but not on on ...

benjamin-lieser

1,888

asked Jan 17, 2022 at 10:43

0 votes

0 answers

232 views

Is there a mlock issue when allocate 1G hugepages return so slow?

Issue: I meet an issue of the 'mlock()' API. the first load is fast when lock the memory from '"/sys/devices/system/node/node0"', but it is too slow on node1, about take more than 3s to ...

Charles

1

asked Nov 25, 2021 at 12:01

Collectives™ on Stack Overflow

How to enable NUMA nodes in Docker container

Why is the core-to-core-latency performance of EPYC 4 so poor in NUMA2 mode?

Is it possible to somehow mix static and dynamic loop scheduling?

numactl: Is it possible to use cpu and memory from different numa nodes?

Is it possible to load Linux kernel code to a specific NUMA node when booting?

how to simulate NUMA in gem5?

Is there a NUMA-like mechanism for a DRAM?

How to calculate the theoretical max UPI bandwidth of a Linux dual-socket machine?

Return value of struct bitmask *numa_get_membind

NUMA memory allocation with hwloc

Error message in TensorFlow: "could not open file to read NUMA node" and missing directory in /sys/bus/pci/devices

What is the "TBD Release Iron" and what are the modifications?

How granular can multithreaded memory-write access be?

Why Linux distributes threads among NUMA nodes almost equally?

How to migrate array to a new NUMA node in C?

Is it possible to find out which NUMA system memory bank the current thread belongs to?

Understanding the speed up of openmp program across NUMA nodes

Explanation for why effective DRAM bandwidth reduces upon adding CPUs

How to test the problem size scaling performance of code

What is the order of memory allocation when demand exceeds single numa node

What is the meaning of size for the numactl --hardware output

How can I realize data local spawning or scheduling of tasks in OpenMP on NUMA CPUs?

How to PInvoke UpdateProcThreadAttribute with PROC_THREAD_ATTRIBUTE_PREFERRED_NODE attribute

Can page faults be triggered by NUMA access?

Is there a mlock issue when allocate 1G hugepages return so slow?

Hot Network Questions