|
||||||||||
|
||||||||||
10 Jul 2013, 21:38 (Ref:3276331) | #1 | ||
Admin
Veteran
Join Date: Mar 2002
Posts: 12,063
|
Major Outage 10-July-2013
Today Tenths suffered its longest unplanned outage ever.
At the moment I've not attempted a root cause analysis as the focus has been on getting Tenths back and running. At approximately 2:30pm we experienced a major server failure that could only be resolved by re-booting the server. This caused a large database corruption and also a software corruption. Unfortunately as I was in a meeting for the "day job" (yes, Tenths is a hobby for me too!) and was unable to give the problem my full attention. Data was restored from the backup replication server however re-starting the forum software also caused the backup to corrupt. Using the replication server as the main database server ALSO caused the backup to corrupt. At approximately 7pm the decision was made to abandon attempting to repair the primary server and switch to our secondary server in the US. This involved shortening the DNS cache time from 1 hour to 30 minutes in order to allow a quicker change over when the site was brought back online. This migration work also involved checking and configuring the secondary server to act as "primary". This work was completed around 8:30pm and a further hour was spent testing functionality from here at Ten-Tenths Towers! DNS migration took place at approximately 10pm and by the time of this post we are seeing traffic levels increase as the DNS change propagates. |
||
|
10 Jul 2013, 22:25 (Ref:3276349) | #2 | ||
#WhatAreHashTags
Veteran
Join Date: Oct 2003
Posts: 2,526
|
Having had to manage similar problems before I retired, I feel your pain! Well done
|
||
__________________
John Smith Clerk of the Course and MSA Steward Race Director for 360MRC |
10 Jul 2013, 22:43 (Ref:3276355) | #3 | ||
Veteran
Join Date: Oct 2009
Posts: 10,703
|
Great job! I'm happy that we're back on track
|
||
__________________
Nitropteron - Fly fast or get crushed! by NaBUrean Prodooktionz naburu38.itch.io |
11 Jul 2013, 08:43 (Ref:3276490) | #4 | ||
Veteran
Join Date: Dec 2002
Posts: 3,364
|
Thanks to Grant and Terri. Sounds like your disaster planning has worked well.
Regards Jim |
||
__________________
Life is not safe, just choose where you want to take the risks. |
12 Jul 2013, 07:00 (Ref:3276890) | #5 | ||
Veteran
Join Date: Nov 2006
Posts: 9,446
|
Come off it Grant we all know it's just a case off flicking a swich !!!
|
||
__________________
Balls of steel (knob of butter) They're Asking For Larkins. ( Proper beer) not you're Eurofizz crap. Hace más calor en España. Me han conocido a hablar un montón cojones! Send any cheques and cash to PO box 1 Lagos Nigeria Africa ! |
12 Jul 2013, 14:14 (Ref:3277000) | #6 | ||
Veteran
Join Date: Aug 2004
Posts: 3,834
|
Nice to have you back.
Have you traced what was causing ALL the backups to fail??? |
||
__________________
Tim Yorath Ecurie Llanfairpwllgwyngyllgogerychwyrndrobwllllantysiliogogogoch Fan of "the sacred monster Christophe Bouchut"... |
12 Jul 2013, 14:57 (Ref:3277016) | #7 | |
Veteran
Join Date: Feb 2010
Posts: 602
|
Thank you for all that you do. 10-Tenths is far and away the best motorsport forum.
|
|
|
12 Jul 2013, 14:58 (Ref:3277017) | #8 | ||
Admin
Veteran
Join Date: Mar 2002
Posts: 12,063
|
Backups were all good, what was happening was that when a copy from the "replication" site (which is the live/realtime backup) was loaded on to the primary server it would be pretty instantly corrupted. The same data was fine on the replication server. Pointing the primary web server to use the replication server also caused the data to be corrupted, so it was at that point the decision was made to switch the whole site to run on the secondary server.
|
||
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Pickup Truck Racing at Lydden 13th -14th July 2013 | Stephen H | National & Club Racing | 3 | 9 Jul 2013 19:17 |
AMOC at Brands Hatch Grand Prix circuit - July 2013 | Brian A | Historic Racing Today | 18 | 9 Nov 2012 17:48 |