Datacenter Cooling failure for CHI@UC

Resolved Posted by Michael Sherman on August 01, 2022
Outage start Monday, August 01, 2022 4:34 p.m.
Expected end Monday, August 01, 2022 9 p.m.

Update 9pm: The chillers have been repaired, and we've received the all clear from ANL staff. CHI@UC is now back to normal operation.


The datacenter hosting CHI@UC has experienced a failure in its cooling system. To reduce load on the remaining cooling, we are blocking new node reservations at UC until the failure is resolved. (Time to be determined)

We recommend, if your experiments can tolerate the interruption, that you snapshot your nodes and power them off.

Although temperatures have stabilized for the moment, if they continue to climb we may need to hard power off nodes for safety. Doing it yourself will ensure a consistent state.

We'll keep you posted.