| Summary: | Performance results are not getting generated | ||
|---|---|---|---|
| Product: | [Eclipse Project] Platform | Reporter: | Satyam Kandula <satyam.kandula> |
| Component: | Releng | Assignee: | Kim Moir <kim.moir> |
| Status: | RESOLVED FIXED | QA Contact: | |
| Severity: | normal | ||
| Priority: | P3 | CC: | daniel_megert, deepakazad, kim.moir, markus.kell.r |
| Version: | 3.7 | ||
| Target Milestone: | --- | ||
| Hardware: | PC | ||
| OS: | Windows XP | ||
| Whiteboard: | |||
|
Description
Satyam Kandula
The performance results are now available for I20110424-2000. As is the log for the second windows performance machine. For some reason, the tests are completing, but not giving up their network connection. I had to kill the rsh connection manually so the build could proceed to a state where the results could be generating. As for the tests failing generating the jacoco report, I have reopened bug 342785. (In reply to comment #1) > The performance results are now available for I20110424-2000. As is the log > for the second windows performance machine. For some reason, the tests are > completing, but not giving up their network connection. I had to kill the rsh > connection manually so the build could proceed to a state where the results > could be generating. As for the tests failing generating the jacoco report, I > have reopened bug 342785. Kim, Thanks for taking care of the results for I20110424-2000. I do see the performance results for I20110428-0848 and not for N20110430-2000. Do you think the problem is happening some times and it is good some times? The rsh connections to the test machines keep hanging and preventing the performance results from being generated. The results for http://download.eclipse.rog/eclipse/downloads/drops/N20110430-2000/performance/performance.php are now available The results are missing for I20110504-0800 :( They are being generated now. Again, the rsh connections to the windows machines stay open after the tests have completed, delaying the results generation. I'm still trying to find the root cause of the problem. Talked to our sysadmin and there haven't been any changes to the machines lately. He also said that there aren't any issues with long running network connections on the machines. Since I'm at a loss at how to solve this, I'll implement something like this to kill rsh processes that are over a day old.
ps -eo uid,pid,etime,cmd | egrep '^ *507' | grep rsh | egrep ' ([1-9]+-)?([0-9]{2}:?){3}' | awk '{print $2}'| xargs -I{} kill {}
For reference http://stackoverflow.com/questions/6134/how-do-you-find-the-age-of-a-long-running-linux-process I added
30 6 * * * ps -eo uid,pid,etime,cmd | egrep '^ *507' | grep rsh | egrep '([1-9]+-)?([0-9]{2}:?){3}' | awk '{print $2}'| xargs kill -9
to the crontab to kill defunct rsh processes that are delaying the performance results generation.
Also, the results for I20110510-0800 are now being generated. Hopefully tomorrow they will be available more quickly.
I think this can be closed. |