WHAT HAPPENED?
Several research VMs lost network connectivity; the VMs themselves did not go down and continued any processes that did not need outside connectivity.
Affected servers (VMs):
chaos
chexmix
chipc-slurmctl
chipc-login
claws
compbio
killerbee3
Maestro-minion2
Maestro-minion3
marvin
mrsnp
optimus
rec
rg-fpga-dev-3
rg-login.crnch
sat1
sat3
slurm-db
sqlcheck
tda
theadvisor
WHEN DID IT HAPPEN?
March 3rd, 2020, between ~5:00 PM and 7:30 PM
WHY DID IT HAPPEN?
While configuring network ports for new servers being added to the VM cluster, the link between the pair of switches that provide networking to the cluster was mistakenly disabled. The configuration error was corrected and networking was restored to the affected VMs.
WHO WAS AFFECTED?
Users trying to connect to the research VMs listed above during the outage window found their servers unreachable.
WHAT DO YOU NEED TO DO?
No user action is required.
WHO SHOULD YOU CONTACT FOR QUESTIONS?
Feel free to contact the TSO Help Desk (CCB 212, 404.894.7065, helpdesk@cc.gatech.edu).