-
Notifications
You must be signed in to change notification settings - Fork 20
snap PCM task gives wrong values #38
Comments
@keren42: I noticed similar behavior on my machine running ubuntu 16.04 with a xeon (though not the exact model you have). I'm not overly familiar with the plugin/pcm but Ithink that maybe @IzabellaRaulin knows more than me and can help us out. |
hello @keren42, which version of snap-plugin-collector-pcm do you use? Is is the latest one - version 10? |
Hi @IzabellaRaulin , we have 3 servers, only two of them have the problem I described above. On those two we have updated to version 10 and problem remains. The task manifest: {"version":1,"schedule":{"type":"simple","interval":"5s"},"workflow":{"collect":{"metrics":{"/intel/pcm/AFREQ":{},"/intel/pcm/C0res%":{},"/intel/pcm/C1res%":{},"/intel/pcm/C2res%":{},"/intel/pcm/C3res%":{},"/intel/pcm/EXEC":{},"/intel/pcm/FREQ":{},"/intel/pcm/INST":{},"/intel/pcm/INSTnom":{},"/intel/pcm/INSTnom%":{},"/intel/pcm/IPC":{},"/intel/pcm/L2HIT":{},"/intel/pcm/L2MISS":{},"/intel/pcm/L3HIT":{},"/intel/pcm/L3MISS":{},"/intel/pcm/PhysIPC":{},"/intel/pcm/PhysIPC%":{},"/intel/pcm/READ":{},"/intel/pcm/WRITE":{},"/intel/pcm/RMB":{},"/intel/pcm/LMB":{}},"config":{},"process":null,"publish":[{"plugin_name":"rabbitmq","config":{"uri":"10.0.0.216:5672","exchange_name":"snap","routing_key":"ben1","exchange_type":"direct","durable":false}}]}}} |
@Keren, it looks like an issue related with So, the question is why the
Also, could you provide:
|
I can reproduce @keren42 issue (using PCM 201702 release). After some research, I was able to get readings by disabling perf support in PCM Makefile and then rebuilding PCM binaries: # rely on Linux perf support (user needs CAP_SYS_ADMIN privileges), comment out to disable
ifneq ($(wildcard /usr/include/linux/perf_event.h),)
CXXFLAGS += -DPCM_USE_PERF
endif try to comment out these lines. Before:
After:
Maybe it is some issue with privileges (as mentioned in Makefile comment), but tried to change privileges using |
@Keren, any updates? Does the disabling of perf support in PCM resolve your issue? |
@IzabellaRaulin, yes, pcm task watch in all of the servers now display the data, we are not entirely sure what made the difference. Do we need to execute the modprobe msr command after each time we reset the server? |
@keren42 you can add msr module to load at boot editing file /etc/modules on debian based OS, or /etc/modules.conf on CentOS. This module is needed to read performance counters. |
using ubuntu 16.04, xeon CPU E5-2699 v4.
snap release 1.0
When running pcm task we get the following output from task watch:
Watching Task (6e582608-b41e-4d0a-9a3a-109c259d8581):
NAMESPACE DATA TIMESTAMP
^Cntel/pcm/AFREQ -1 2017-02-21 17:02:20.411681592 +0200 IST
/intel/pcm/C0res% 0 2017-02-21 17:02:20.411688011 +0200 IST
/intel/pcm/C1res% 71 2017-02-21 17:02:20.411663986 +0200 IST
/intel/pcm/C2res% 0 2017-02-21 17:02:20.411675044 +0200 IST
/intel/pcm/C3res% 0 2017-02-21 17:02:20.411682865 +0200 IST
/intel/pcm/EXEC 0 2017-02-21 17:02:20.411660247 +0200 IST
/intel/pcm/FREQ 0 2017-02-21 17:02:20.411665382 +0200 IST
/intel/pcm/INST 0 2017-02-21 17:02:20.411662151 +0200 IST
/intel/pcm/INSTnom 0 2017-02-21 17:02:20.411666862 +0200 IST
/intel/pcm/INSTnom% 0 2017-02-21 17:02:20.411676387 +0200 IST
/intel/pcm/IPC -1 2017-02-21 17:02:20.411668292 +0200 IST
/intel/pcm/L2HIT 1 2017-02-21 17:02:20.411677697 +0200 IST
/intel/pcm/L2MISS 0 2017-02-21 17:02:20.411684095 +0200 IST
/intel/pcm/L3HIT 1 2017-02-21 17:02:20.411669659 +0200 IST
/intel/pcm/L3MISS 0 2017-02-21 17:02:20.411679016 +0200 IST
/intel/pcm/LMB 5556 2017-02-21 17:02:20.411670942 +0200 IST
/intel/pcm/PhysIPC -1 2017-02-21 17:02:20.411685391 +0200 IST
/intel/pcm/PhysIPC% -25 2017-02-21 17:02:20.411686744 +0200 IST
/intel/pcm/READ 0.0595 2017-02-21 17:02:20.411672321 +0200 IST
/intel/pcm/RMB 0 2017-02-21 17:02:20.411673628 +0200 IST
/intel/pcm/WRITE 0.0578 2017-02-21 17:02:20.411680269 +0200 IST
All metrics except READ,WRITE,LMB and C1res% remain with the above constant values,
while the pcm application's output shows that all the above metrics do have changing values which from some reason are not received from the task.
snap daemon is running in root mode and the task is in user mode.
What can be the reason for the inconsistency between the values?
Thanks.
The text was updated successfully, but these errors were encountered: