The new home storage subsystem will be down for ~1 hour for a software upgrade. We hope to have it back in production later this morning.
As always, we apologize for the trouble -- our goal is to make OSCER resources better!
---
Henry Neeman ([log in to unmask]) Director, OU Supercomputing Center for Education & Research (OSCER) Associate Professor, Gallogly College of Engineering Adjunct Associate Professor, School of Computer Science OU Information Technology The University of Oklahoma
the upgrade is complete, and the system looks stable again, and better performant than before.
Please let us know if you are still experiencing any issues.
Thanks,
Horst
On 5/16/25 9:04 AM, Neeman, Henry J. wrote: > > > OSCER users, > > > The new home storage subsystem will be down for ~1 hour > for a software upgrade. We hope to have it back in production > later this morning. > > As always, we apologize for the trouble -- our goal is to make > OSCER resources better! > > --- > > Henry Neeman
OSCER Users, The scheduled maintenance has now completed. If you should find an issue with any OSCER system, please email [log in to unmask]<mailto:[log in to unmask]>.
Thanks!
Dave
From: "Akin, David S." <[log in to unmask]> Date: Wednesday, May 14, 2025 at 8:06 AM To: OSCER users <[log in to unmask]> Cc: "[log in to unmask]" <[log in to unmask]> Subject: Our scheduled maintenance is now starting
OSCER users,
Our scheduled maintenance today is starting. We should be back by midnight CST. This will affect the following systems:
OSCER scheduled maintenance outage Wed May 14 8am-midnight
This coming Wednesday (May 14) 8:00am-midnight, OSCER will hold a scheduled maintenance outage, for the following (in priority order):
(1) OURdisk: Upgrade the software to apply an important bug fix.
(2) Supercomputer: Increase the reliability of one of our /home storage subsystems.
(3) OneOklahoma Friction Free Network and supercomputer network: Upgrade Ethernet switches' operating system software, to improve network reliability for all OSCER systems.
All OSCER systems will be down this coming Wed (May 14) 8:00am-midnight Central Time for scheduled maintenance.
*IMPORTANT IMPORTANT IMPORTANT IMPORTANT!!!*
Before the scheduled maintenance outage starts this coming Wed (May 14) at 8:00am:
Jobs that wouldn't finish before the scheduled maintenance outage starts won't be able to start at all, until after the scheduled maintenance outage has ended (planned for Wed May 14 midnight CT).
OSCER users, Our Ceph storage cluster called OURdisk#1 is down for now. We'll come back and relay any updates as they become available. Thanks for your patience.
OURdisk #1 is available again. Several users on Schooner's login nodes were able to run processes that denied access to OURdisk on the login nodes themselves. We ask for your cooperation in running computing processes on a compute node only. Which will keep the supercomputer available for all research teams. We have a rule ...