← Go back to Status of the Grex HPC system

Unplanned power outage in HPCC, Grex down

April 23, 2025 at 9:10 AM UTC

Compute nodes Login nodes Network Lustre /project NFS /home OpenOnDemand portal ComputeCanada CVMFS

Resolved after 3h 30m of downtime. April 23, 2025 at 12:40 PM UTC

Power to the HPCC Centre restored

Manitoba Hydro had restored power to Campus, and Grex is back online. All running and queued jobs were lost during the outage. We have used the opportunity to update SLURM scheduler to the current major version 24.11 . All Grex subsystems (compute, storage, login nodes and Web portal) are operational.

If you have questions or concerns, please do not hesitate to contact us at support@tech.alliancecan.ca , mentioning Grex in the subject line.

A power outage happened in HPCC Centre

A power outage in Grex’s datacentre happened , with a complete loss of power at about 9:10 AM Winnipeg time. The system is down. The reason for the outage is a problem at Manitoba Hydro, our electricity provider.

https://account.hydro.mb.ca/Portal/outeroutage.aspx

We are waiting for the power to be restored. Thank you for your patience!

If you have questions or concerns, please do not hesitate to contact us at support@tech.alliancecan.ca , mentioning Grex in the subject line.

Last updated: April 23, 2025 at 5:48 PM UTC