Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Oscar Upgrade to RHEL9 #62

Open
yangliu2009 opened this issue Jan 9, 2024 · 0 comments
Open

Oscar Upgrade to RHEL9 #62

yangliu2009 opened this issue Jan 9, 2024 · 0 comments

Comments

@yangliu2009
Copy link

CCV will conduct the scheduled Oscar maintenance that upgrades the operating system and module system in January. To facilitate the transition, CCV has set up a mini cluster with the upgraded OS and module system for users to test their programs/jobs.

  • Users can connect to the mini RHEL/9.2 cluster by running the command ssh -X login009 on the current login nodes (login007 or login008). For more details, refer to this page.

  • Users can drop in during this Zoom meeting on Mondays and Wednesdays between 3 and 4pm before winter break for any assistance for the new cluster.

  • We encourage users to test their programs/jobs on this cluster and reach out to us ([email protected]) for any help, issues or feedback.

Please see below for details on the scheduled maintenance.

Maintenance Window:

  • Note: Below is the estimated window. The exact dates and times are still tentative.
  • Start: 1/9/2024 5:00 am EST
  • End: 1/12/2024 5:00 pm EST

Maintenance Description:

  • The OS will be upgraded from RHEL/7.2 to RHEL/9.2. For exact details please refer to this documentation page

  • We are increasing the cpu cores for priority-GPU accounts. Please see above link

  • Oscar modules system will be migrated to LMod . For a comprehensive list of old and new modules refer to this documentation page.

  • Expected Impact:

  • All Oscar services will be unavailable during the downtime

  • Jobs which won’t complete by the beginning of the maintenance window won’t start and ‘myq/squeue’ will report (ReqNodeNotAvail, Reserved for maintenance)

  • Instructions for Users

  • Users will need to resubmit jobs after the maintenance

  • Users will need to update their job submission scripts since Oscar module names/versions will be different in the new module system.

  • Locally installed packages/environments may not work after the maintenance due to OS upgrade. Users are recommended to reinstall and test their installed packages on the RHEL/9.2 mini cluster before the downtime.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant