RWTH High Performance Computing (HPC)

Mehr Informationen zu dem Service finden Sie in unserem Dokumentationsportal.

Accounting data since 06.01.2026 Missing

Teilstörung
Dienstag, 06.01.2026 13:56 - Unbekannt

As part of the accounting data fixes, we are regenerating the missing accounting data since 06.01.2026

15.01.2026 13:56

Störung jupyterhub

Teilstörung
Donnerstag, 08.01.2026 10:55 - Donnerstag, 08.01.2026 15:22

Aufgrund von Slurm-Problemen ist jupyterhub aktuell nicht erreichbar. An einer Lösung wird gearbeitet.

08.01.2026 11:00

Slurm is not sending emails.

Teilstörung
Mittwoch, 07.01.2026 17:06 - Freitag, 09.01.2026 10:16

We are encountering an issue where Slurm does not send emails at the end of jobs. We are working on workarounds.

08.01.2026 17:07
Updates
Issues with IPV6 and sendmail were fixed. Emails should be sending correctly.
09.01.2026 10:17

Coordinator roles in Slurm missing

Teilstörung
Montag, 05.01.2026 20:21 - Dienstag, 13.01.2026 20:21

Coordinator roles in Slurm are currently missing and we are working on restoring them.

13.01.2026 20:21

Slurm Job Submission failing

Teilstörung
Montag, 15.12.2025 18:56 - Montag, 12.01.2026 15:46

We are aware if issues starting Slurm jobs on certain accounts and sacct. We are working on a solution.

15.12.2025 18:56

Correction of Erroneous Job Billings

Teilstörung
Freitag, 12.12.2025 06:00 - Montag, 12.01.2026 15:37

A configuration error was made in the job accounting of the HPC cluster on December 12th, which resulted in incorrect billing values being generated. This issue has been resolved as of December 30th, and all jobs submitted from this point onward will be billed correctly. We are currently working on correcting the erroneous billing for affected jobs.

30.12.2025 12:49
Updates
Values have been corrected. Changes should be visible in short
12.01.2026 15:37

Full Downtime Maintenance

Wartung
Dienstag, 06.01.2026 08:00 - Mittwoch, 07.01.2026 17:00

During the maintenance we will upgrade the Slurm scheduler to the newest version.
We will also work on resolving recent issues with sacct and Slurm Accounts not being available at Job submission.
Finally, we will setup a tuned and improved job priority configuration for the submission queues, with the goal to improve QoS based on user
feedback we have received.

To achieve this, all jobs still pending in the queue by the time the maintenance starts will be removed by us. These will have to be resubmitted by users. We ask users to please take measures to track and resubmit the jobs if there is a need for it.

18.12.2025 13:56
Updates
The Upgrade of Slurm and required accounting fixes is taking longer than expected and will extend the Maintenance period by one extra day.
06.01.2026 16:42