pcm-latency reports small latency #518
Unanswered
majid0jalili
asked this question in
Q&A
Replies: 1 comment
-
There are a few differences: The DDR latency reported by pcm-latency is measured at the memory controller. The pcm main utility has LLCRDMISSLAT metric which is similar to the mentioned CHA miss latency. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I was trying to measure the DRAM latency with pcm-latency, and I noticed that the number it reports is a bit off.
For example, I ran GUPS and STREAM and below is the output for one iteration, although it is roughly similar for other samples:
GUPS:
Average latency per socket
L1 Cache Miss Latency(ns) [Adding 5 clocks for L1 Miss]
Socket0: 4.97
Socket1: 5.10
DDR read Latency(ns)
Socket0: 27.27
Socket1: 27.22
STREAM
Average latency per socket
L1 Cache Miss Latency(ns) [Adding 5 clocks for L1 Miss]
Socket0: 21.26
Socket1: 21.40
DDR read Latency(ns)
Socket0: 118.13
Socket1: 106.93
There is no way this can be true; a fully random access pattern has that low latency. I digged further and used
perf stat
to see if this can be solved. I noticedperf stat
uses 3 hw counters to measure latencyC1=
cha/event=0x36\,umask=0x21\,config=0x40433/
C2=
cha/event=0x35\,umask=0x21\,config=0x40433/
C3=
cha_0/event=0x0/
Then, does
C1/C2/C3*1billion
to report the DRAM latency. I took the below from the Kernel websiteI tried on two different machines SKX and casecadelake and observe the same behavior.
Beta Was this translation helpful? Give feedback.
All reactions