Avoiding downtime
Early this morning, Goplan was down. In fact, this was the second time Goplan was down this month, tarnishing the otherwise impeccable uptime record we were on. We’ve been down a total of 9 hours since January 1st 2012, which results in 99.985% uptime for this year, all because of these two individual recent incidents - we can (and want to) do better.
We’re never happy when downtime happens, and every time we experience an incident such as today we take measures to make sure it doesn’t happen again. The biggest lesson from today was that we need to respond quicker to these incidents. So we improved our alerting and auto-scaling systems to be more aggressive: we’ll know sooner that something’s up, and we’ll consequently fix it sooner too. We want to be transparent about this too: we setup a public page at Pingdom where you can keep an eye on our uptime as much as we do.
We feel your pain whenever downtime occurs. We’ll keep fighting to keep those 99.98% back to 100. We’re sorry about today. Thanks for being with us through this.