Incident Report: 20200511: Difference between revisions

No edit summary
 
(3 intermediate revisions by 2 users not shown)
Line 4: Line 4:
   |impact=High (Dead air for around 45 minutes)
   |impact=High (Dead air for around 45 minutes)
   |start=2020-05-11 05:00
   |start=2020-05-11 05:00
   |end=202-05-11 05:46
   |end=2020-05-11 05:46
   |mitigation=Reduce dependency on uplink
   |mitigation=Reduce dependency on uplink
   |leader=Connor Sanders (CS)
   |leader=Connor Sanders (CS)
   |others=Isaac Lowe (IL)
   |others=Isaac Lowe (IL), Marks Polakovs (MP)
}}
}}


Line 22: Line 22:
* Reduce dependency on upstream services
* Reduce dependency on upstream services
:* Investigate a local caching DNS resolver?
:* Investigate a local caching DNS resolver?
* Ask ITS kindly to tell us when they take down our campus uplink
:* '''MP - done-ish, running unbound on uryfw0 and many (but not all boxes use it)'''
* Ask ITS kindly to make it reboot at xx:30 instead of xx:00
* Ask ITS nicely to tell us when they take down our campus uplink
:* '''MP - done'''
* Ask ITS nicely to make it reboot at xx:30 instead of xx:00
:* '''MP - done'''
* Improve documentation and logging of the new WebStudio services, to make future troubleshooting easier
* Improve documentation and logging of the new WebStudio services, to make future troubleshooting easier
* Figure out why Dearie-Me didn't fire - possibly needs a recalibrate
* Figure out why Dearie-Me didn't fire - possibly needs a recalibrate
Line 32: Line 35:
                   HH:MM:SS
                   HH:MM:SS
   Dead air start:  05:02:06.500
   Dead air start:  05:02:06.500
   Dead air ends:   05:45:42.000
   Dead air end:   05:45:42.000
   TOTAL:          00:43:35.500
   TOTAL:          00:43:35.500


[[Category:Incident Reports]]
[[Category:Incident Reports]]