On Friday, 4/23/21 at 8:00 AM, two of the ten servers that host the ls15 Lustre scratch filesystem will be rebooted. This reboot will return these servers to a fault-tolerant state where one server can take over for the other in the event of a hardware failure. This reboot may cause a momentary hang-up for jobs accessing files on the ls15 scratch filesystem. If you have any questions, please contact us at https://contact.icer.msu.edu/.
We will be performing rolling reboots of gateways and development nodes during the week of April 12th. These reboots are required to update the client side of our high performance file system. Reboots will occur overnight and servers are expected to be back online before morning. Servers will be rebooted according to the following schedule:
April 12th at 4:00 AM: gateway-00, gateway-03
April 13th at 4:00 AM: globus-02, rdpgw-01, dev-intel14, dev-intel14-k20
April 14th at 4:00 AM: openondemand-00, dev-intel16, dev-intel16-k80
April 15th at 4:00 AM: dev-amd20, dev-amd20-v100
Dev-intel18, gateway-01, and gateway-02 are already updated and do not require a reboot. If you have any questions, please contact us at https://contact.icer.msu.edu.