OSCER users


Options: Use Classic View

Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Topic: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Henry Neeman <[log in to unmask]>
Mon, 24 Apr 2023 19:22:04 -0500
text/plain (80 lines)
OSCER users,


Unfortunately, OURdisk remains offline, and will still be
down at least through tomorrow (Tue Apr 24).

Our team worked all day today, including extensively with
our external Ceph expert consultant.

Although our team took all the recommended steps, we've
continued to encounter an error that is preventing
OURdisk from returning to service.

For tomorrow, we have three separate plans that we're
going to work on, and we're hoping that at least one of
them provides a better outcome.

Note that the problem is in the Ceph monitor subsystems,
*NOT* in user data storage, so we aren't expecting any
loss of files.

We're very cognizant of how disruptive this outage is to
mission critical research progress, and we apologize for
this situation.



On Mon, 24 Apr 2023, Henry Neeman wrote:

>OSCER users,
>We hope to have OURdisk in production later today. Our team
>has been working on it since yesterday morning.
>We think we've isolated the issue to the network driver layer,
>and we're collaborating closely with our external Ceph expert
>consultant to resolve the issue.
>We also hope to have a more permanent fix in place soon,
>most likely during the next maintenance outage, which is
>likely to be mid-May, in order to avoid dissertation and
>thesis deadlines.
>But, we can't know for sure until we've returned OURdisk to
>Again, we apologize for the trouble.
>On Sun, 23 Apr 2023, Henry Neeman wrote:
>OSCER users,
>OURdisk is down. Our team is working to return it to service,
>but it's most likely that'll take until tomorrow (Mon Apr 24).
>We apologize for the trouble.
>Henry Neeman ([log in to unmask])
>Director, OU Supercomputing Center for Education & Research (OSCER)
>Associate Professor, Gallogly College of Engineering
>Adjunct Associate Professor, School of Computer Science
>OU Information Technology
>The University of Oklahoma
>Engineering Lab 212, 200 Felgar St, Norman OK 73019
>405-325-5386 (office), 405-325-5486 (fax), 405-245-3823 (cell),
>[log in to unmask] (to e-mail me a text message)