We also maintain a list of Known Product Issues separate from this site here.
Between April 3, 2023 4:55 PM PDT, and April 4, 2023 12:47 AM PDT, some users may have experienced difficulties while working in Box. During this time, customers may have experienced delays when using Box Relay. We would like to take this opportunity to provide an analysis of this issue and the corrective actions we have taken in response.
Analysis
The issue occurred due to one of our services in a specific region experiencing high latency when communicating with a backend cloud database. This caused the service in that region to run in a degraded state. We were able to resolve the issue by routing all of the upstream traffic to another region where the service was healthy. In addition, we are working with the cloud service provider to prevent similar issues from occurring in the future.
The existing logs and metrics we had during the incident did not reveal any indication as to what might have caused the degradation. So, following the recommendation from our cloud provider, we temporarily enabled extensive cloud logging for that region, diverted some traffic and monitored the service for degradation. The service was not degraded during this time. While the extensive logging has since been disabled, we have continued to closely monitor the service in that region and have not observed any anomalies or further signs of degradation. We will continue to monitor for any anomalies.
Corrective Actions
The following corrective actions have been completed or are planned:
Our existing metrics promptly alert the on-call engineer when such issues occur. We have additionally made the following changes to our runbook that the on-call follows:
We are continuously working to improve Box and want to make sure we are delivering the best product and user experience we can. We hope we have provided some clarity here and we would be happy to answer any questions you may still have regarding this matter.
Sincerely,
The Box Team