-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
difference to values in IB monitor #1
Comments
Hi Michael, Sorry for the delay. I though I responded to you over the weekend but it must have never been sent. I haven't had any reason to update xltop in some time so I haven't been actively maintaining it. The difference you seem may be explained by the difference in sampling intervals or by that fact that xltop uses a moving average whereas perfquery -r will give you counter deltas. What values are you using for the tick, window, and interval in your xltop-master.conf? Best, John |
Hi John, tick = 2 The benchmark runs pretty long (> 5mins) and shows the same values all the time. I have 4 servers, showing 7,5,5, and 3 GB/s while the live IB and the live OST monitor (little scripts I wrote) show both 5 GB/s on all servers all the time. Regards, Michael Dr.-Ing. Michael Kluge Technische Universität Dresden Contact: Am 17.09.2013 um 20:06 schrieb John Hammond:
|
Thanks Michael. I'll take a look at the code. In the mean time, could you try again with (tick, window, interval) = (1, 5, 5) and (5, 5, 5)? |
Hi John, did that. I took a couple of screenshots (attached). The upper part of The oss2_write_phase* screenshots are very interesting. The screenshots Regards, Michael --- 8< ----------------------------------------------- while [ 1 ] ; do On 18.09.2013 13:27, John Hammond wrote:
Dr.-Ing. Michael Kluge Technische Universität Dresden Contact: |
Hi John,
do you still maintain xltop? I installed it on a Lustre 2.1.3 cluster and have in parallel a small script running that queries the IB port. I see a large difference between throughput reported by the IB monitor and "xltop u s". The sum of the throughput on all IB port of all oss servers matches the sum of the throughput for all servers as reported by xltop. The numbers are just differently distributed. As the IB monitor only uses "perfquery -r" once a second, I believe this data more than xltop. Do you have any idea how to debug this?
Regards, Michael
The text was updated successfully, but these errors were encountered: