Now Reading
Partial outage: Shared Computing Cluster batch job, BU Works test environment – TechWeb : Blog Archive – Boston University
[vc_row thb_full_width=”true” thb_row_padding=”true” thb_column_padding=”true” css=”.vc_custom_1608290870297{background-color: #ffffff !important;}”][vc_column][vc_row_inner][vc_column_inner][vc_empty_space height=”20px”][thb_postcarousel style=”style3″ navigation=”true” infinite=”” source=”size:6|post_type:post”][vc_empty_space height=”20px”][/vc_column_inner][/vc_row_inner][/vc_column][/vc_row]

Partial outage: Shared Computing Cluster batch job, BU Works test environment – TechWeb : Blog Archive – Boston University

Incident Discovery Time04/12/2022, 03:04pm

Services affectedServer Infrastructure

Description of Impact

The Massachusetts Green High Performance Data Center, (MGHPCC), overheated briefly today. A few servers were shut down.

Current Status

SCC compute nodes are now online, and all clients were notified. Our data center operations team will be visiting Holyoke in order to investigate other servers.

Next Update: 07:30pm

Previous Update

Incident Discovery Time03:04pm, 04/12/2022

Services affectedServer Infrastructure

Description of Impact

The Massachusetts Green High Performance Data Center, (MGHPCC), overheated briefly today. A few servers were shut down.

Current Status

IS&T teams have remediated the cooling problem and are now waiting for temperatures that drop enough to bring servers online again.

Additional Information

Batch computing jobs on Shared Computing Cluster, and test environments for BU Works Basic were affected. IS&T teams continue to assess the impact and scope.

Next Update: 07:30pm

Previous Update

Incident Discovery Time03:04 PM on 04/12/2022

Services affected:Shared Computing Cluster Batch jobs, BU Works testing environment

Description of Impact

The Massachusetts Green High Performance Data Center has experienced an air conditioner problem. The room became so hot that servers were forced to shut down. SCC login nodes, filesystem and filesystem were not affected. However, some batch jobs or BU works services might be unavailable.

Current Status

IS&T teams are investigating and trying to restore servers online.

Next Update: 5:30pm.

View Comments (0)

Leave a Reply

Your email address will not be published.