I appreciate the move to incident.io and the faster update times when there is an outage, but the old status page had some really useful charts for debugging issues, particularly around tracking when there had been latency spikes that can explain failed API calls and other weird user experiences.
@fede.bubble or others, is there any plan on bringing those charts back for the new status page?
Thanks for raising it up, I’m definitely tracking this and looking into ways to expose the metrics again. The tricky part is there’s not a quick solution to it, so it takes engineering resources.
We had 4 metrics we showed prevoiusly (see below, and their source)
“Externally measured latency” - Pingdom
“Median end-to-end page load” - custom metric on AppOptics
“Median system api latency“ - custom metric on AppOptics
“Successful system api requests” - custom metric on AppOptics
For the custom metrics, they were stored under AppOptics, which you can see is EOL this month here, it required us to switch to a different observability vendor (Observe, which is actually much better for our use-case)
So the integration with the old statuspage also would have broken the custom metrics we had as well, even if we didn’t switch to the new statuspage.
We are investigating to see if there’s an easy way to expose the Pingdom metric while we think about a more custom solution for the other metrics.
FYI, right now is one of those times where it would be helpful to see the external latency. I’m getting elevated errors on inbound APIs due to application timeouts, I suspect due to latency spikes but its hard to diagnose without those charts