← Go back to Status of the Grex HPC system

Unplanned SLURM outage due to a problematic update

January 21, 2025 at 11:00 AM UTC

Compute nodes

Resolved after 5h 5m of downtime January 21, 2025 at 4:05 PM UTC

SLURM scheduler update done

We have finished updating SLURM scheduler on Grex. Running jobs should not have been affected, but couple of new jobs have failed to start and need to be resubmitted. Sorry about the inconvenience!

In case you experience problems with the updated schedler , please do not hesitate to contact support@tech.alliancecan.ca, mentioning Grex in the subject line.

SLURM scheduler outage

Due to a glitch during rolling SLURM scheduler update, new compute jobs are failing to start on Grex. We are working on fixing the issue. Thank you for your patience!

Last updated: January 21, 2025 at 4:07 PM UTC