IBM®
Skip to main content
    Country/region select      Terms of use
 
 
   
     Home      Products      Services & solutions      Support & downloads      My account     
 
developerWorks
AIX and UNIX
Information Mgmt
Lotus
New to Lotus
Products
How to buy
Downloads
Live demos
Technical library
Training
Support
Forums & community
Events
Rational
Tivoli
WebSphere
Java™ technology
Linux
Open source
SOA and Web services
Web development
XML
My developerWorks
About dW
Submit content
Feedback



developerWorks  >  Lotus  >  Technical Library
developerWorks

[back to "Measuring your Domino server's reliability"]

MTBF Server Statistics (sidebar)
Here's an example of a Server Statistic document for a period of one year.

CN=Brubeck/O=Iris for Last Year
Sample was taken: 04/10/98 05:00:11 AM Server build number: 4.1.5
Start Date
End Date
# of Crashes
# of Shutdowns
# of Startups
%Time Up
%Time Down
03/18/97 11:25:20 AM
04/10/98 04:56:32 AM
17
34
48
93.46%
1.10%
Days
Hours
Minutes
Seconds
From
To
Range Covering
387
16
31
12
03/18/97 11:25:20 AM
04/10/98 04:56:32 AM
Average Time Up Between Failures
20
3
6
57
0
0
Geometric Mean Between Failures
14
3
41
22
0
0
Maximum Time Up
36
23
55
45
07/23/97 04:22:11 PM
08/29/97 04:17:56 PM
Minimum Time Up
0
0
0
43
02/09/98 11:30:59 AM
02/09/98 11:31:42 AM
Average Time Up
7
9
28
16
0
0
Total Time Up
362
8
5
18
0
0
Maximum Time Down
0
23
20
50
07/18/97 12:05:20 PM
07/19/97 11:26:10 AM
Minimum Time Down
0
0
0
6
02/09/98 11:30:53 AM
02/09/98 11:30:59 AM
Average Time Down
0
2
0
29
0
0
Total Time Down
4
6
24
43
0
0
Server Last Up
9
13
55
1
03/31/98 02:01:31 PM
04/10/98 04:56:32 AM
The name of the server shown above is Brubeck. MTBF calculated these statistics on 4/10/98 at 5:00:11 a.m., as shown in the "Sample was taken" field. Brubeck runs build 4.1.5

The Start Date column shows you how far back in time MTBF went when compiling the statistics. You might request statistics for the last year, but if MTBF only logged information for the last few months, then this column shows that date. The End Date column shows the most current log entry date that MTBF found when calculating statistics for the period requested. Brubeck has been running for a little over a year.

The other columns include information on the number of shutdowns and crashes, and the percentage of time the server was up and down, between the start date and the end date. You can see that Brubeck had 17 crashes, and 34 shutdowns during this time. There were 48 startups. The percentage of time Brubeck was up and running was 93.46%.

You can also look down the rows and find other statistics on Brubeck:

  • Range Covering: This row shows a breakdown of the time between the Start Date and the End Date.
  • Average Time Up Between Failures: This row shows a breakdown of the average time the server was up and running between the Start Date and the End Date. This number may not be a good way to measure server performance, because quickly restarting your servers after all crashes could make this number deceptively high.
  • Geometric Mean Between Failures: This row shows a breakdown of the geometric mean of the time the server was up and running between the Start Date and the End Date. For example, if we had 11 measurements sorted by the least amount of time up to the most amount of time up, the geometric mean would be the uptime of the sixth measurement, since it falls right in the middle.
  • Maximum Time Up: This row shows a breakdown of the maximum time the server was up and running between the Start Date and the End Date.
  • Minimum Time Up: This row shows a breakdown of the minimum time the server was up and running between the Start Date and the End Date (including server shutdowns).
  • Average Time Up: This row shows a breakdown of the average time the server was up and running between the Start Date and the End Date (including server shutdowns).
  • Total Time Up: This row shows a breakdown of the total amount of time the server was up and running between the Start Date and the End Date (including server shutdowns).
  • Maximum Time Down: This row shows a breakdown of the maximum time the server was down between the Start Date and the End Date (including server shutdowns).
  • Minimum Time Down: This row shows a breakdown of the minimum time the server was down between the Start Date and the End Date (including server shutdowns).
  • Average Time Down: This row shows a breakdown of the average time the server was down between the Start Date and the End Date (including server shutdowns).
  • Total Time Down: This row shows a breakdown of the total amount of time the server was down between the Start Date and the End Date (including server shutdowns).
  • Server Last Up: This row shows a breakdown of the amount of time the server was running when MTBF calculated these statistics.

    Viewing adjusted server statistics
    The second section of charts in a Server Statistic document contains adjusted server statistics. These charts have the same rows and columns as the set of server statistics charts we just examined. The only difference is that if a Server Crash document is marked resolved, MTBF does not factor that crash into these statistics. For example, if there is a power failure and your server was not on a UPS, MTBF generates a Server Crash document. You could mark the crash as resolved, since it was not Notes-related and it was explainable.

    At Iris, we use this section of charts after we fix a bug that caused multiple server crashes. We then mark all the Server Crash documents for those crashes as resolved. The next time MTBF generates statistics, it no longer takes the resolved crashes into consideration when calculating the adjusted server statistics. We can then get a good idea of our average uptime with the bug fixed. This lets us know how close a build is to a release. The following charts show adjusted statistics:

  • Start Date
    End Date
    # of Crashes
    # of Shutdowns
    # of Startups
    %Time Up
    %Time Down
    03/18/97 11:25:20 AM
    04/10/98 04:56:32 AM
    15
    34
    46
    93.49%
    1.07%

    Average Time Up Between Failures
    22
    15
    41
    22
    0
    0
    Geometric Mean Between Failures
    14
    3
    41
    22
    0
    0
    Maximum Time Up
    36
    23
    55
    45
    07/23/97 04:22:11 PM
    08/29/97 04:17:56 PM
    Minimum Time Up
    0
    0
    0
    43
    02/09/98 11:30:59 AM
    02/09/98 11:31:42 AM
    Average Time Up
    7
    17
    5
    9
    0
    0
    Total Time Up
    362
    11
    2
    6
    0
    0
    Maximum Time Down
    0
    23
    20
    50
    07/18/97 12:05:20 PM
    07/19/97 11:26:10 AM
    Minimum Time Down
    0
    0
    0
    6
    02/09/98 11:30:53 AM
    02/09/98 11:30:59 AM
    Average Time Down
    0
    2
    1
    47
    0
    0
    Total Time Down
    4
    3
    27
    55
    0
    0
    Viewing expanded crash information
    The third section of charts in the Server Statistic document contains expanded crash information. It lists as many as 20 previous crashes and shows the date you started the server, and the date the server crashed. You can click the button in the Crash column to open the Server Crash document for that particular entry. You can click the button in the Fixed column to update the build and fixed status information for the particular entry.
    Crash
    Build
    Days
    Hours
    Minutes
    Seconds
    From
    To
    Fixed
    1
    4.1.5
    0
    15
    13
    32
    03/30/98 06:55:32 PM
    03/31/98 10:09:04 AM
    No
    2
    4.1.5
    0
    0
    45
    24
    03/30/98 06:04:33 PM
    03/30/98 06:49:57 PM
    No
    3
    4.1.5
    3
    2
    8
    56
    03/24/98 02:54:29 PM
    03/27/98 05:03:25 PM
    No
    4
    0
    32
    21
    40
    0
    02/19/98 04:47:49 PM
    03/24/98 02:27:49 PM
    No
    *5
    0
    *0
    *0
    *3
    *19
    02/09/98 11:27:34 AM
    02/09/98 11:30:53 AM
    No
    6
    0
    16
    19
    42
    56
    01/23/98 03:44:38 PM
    02/09/98 11:27:34 AM
    No
    *7
    0
    *0
    *0
    *41
    *29
    12/22/97 06:42:54 PM
    12/22/97 07:24:23 PM
    No
    8
    0
    12
    18
    15
    47
    12/09/97 03:36:39 PM
    12/22/97 09:52:26 AM
    No
    9
    4.1.5
    2
    21
    28
    49
    12/06/97 05:14:27 PM
    12/09/97 02:43:16 PM
    No
    10
    4.1.5
    5
    13
    13
    41
    11/30/97 01:32:30 PM
    12/06/97 02:46:11 AM
    No
    11
    4.1.5
    4
    8
    38
    13
    11/20/97 09:56:26 AM
    11/24/97 06:34:39 PM
    No
    12
    0
    15
    13
    6
    57
    10/22/97 02:04:55 PM
    11/07/97 02:11:52 AM
    No
    13
    4.1.5
    26
    3
    28
    7
    09/26/97 09:51:07 AM
    10/22/97 01:19:14 PM
    No
    *14
    4.1.4
    *10
    *20
    *33
    *27
    06/23/97 09:49:17 AM
    07/04/97 06:22:44 AM
    No
    15
    4.1.4
    1
    22
    6
    5
    06/21/97 10:50:27 AM
    06/23/97 08:56:32 AM
    Yes
    16
    4.1.4
    15
    20
    32
    21
    06/05/97 10:13:53 AM
    06/21/97 06:46:14 AM
    Yes
    17
    V4.1.4_7
    8
    6
    37
    50
    05/18/97 05:31:41 PM
    05/27/97 12:09:31 AM
    No
        About IBM Privacy Contact