The “Request Count by Status” and “Request Count by API x Status” in the “Monitoring → Datatstore” category does not properly report all datastore errors.
The graph by status reports only 3 “non-ok” request and the graph by API x Status reports 19 while the error report shows a little over 1300 over the same 4 weeks period
In my opinion it doesn’t have to be one or the other tho, there could be a dropdown that lets you select which kind you want, either averaged or summed, so people have the choice for their preferred way (I’m sure some are likely used to average and would prefer to keep it)
We added the ability to zoom in and view the metric on a minutely level, this way you can see the exact count of errors on the charts. This is releasing later today.
We will also think about aggregation rules a bit more.
Great quick release! I’ll definitely try to check for hourly, but this really doesn’t give much margin to see if it works properly or not. I’ll try to update this post if I happen to stumble in an hour range that contained internal errors.
But I still can’t consider this fixed when the issue still happens:
You can see that despite being average, the average doesn’t make any sense. It successfully reported 2 internal errors at 5PM, but, somehow, 11 at 5AM did not go through, not a single one.
I 100% believe something is still broken, either with average or something else.
I agree with you, there is some estimation done to calculate the average. I will bring it to the team. In the meantime, please drill down to minutely granularity to see the exact counts.
Sorry but shouldn’t they show the same data? It seems kinda weird that some errors in the error dashboard are not reported in the datastore dashboard? If anything I would expect it the other way around, datastore dashboard should be more precise, more accurate or at the very least the same as the error reports when it comes to datastore errors?
From my perspective this is deceiving for any creator looking at the datastore dashboard and would be even more deceiving for Roblox engineers tracking the error rates from datastores, since it would show less error than what is actually happening?
I understand that they’re 2 different systems, but I just don’t understand how the difference can happen?