For more information about our Incident Response and Communications please read this support article.

We also maintain a list of Known Product Issues separate from this site here.

[Minor] Issues with Multiple Box Services

Incident Report for Box

Postmortem

We recently addressed issues affecting the Box Webapp and Public API. We would like to take the opportunity to further explain these issues and the steps we have taken to keep them from happening in the future.

Between 4:29 PM PDT and 7:14 PM PDT on April 13, 2025, some users may have experienced difficulties while working in Box. During this time, users may have experienced slowness or occasional errors when interacting with some features in the Box webapp or public API, including Logins, Uploads/Downloads, and Notes. The issue occurred as a result of CPU performance degradation in multiple instances of our relational data access service in a single availability zone. We were able to resolve the issue by performing a rolling restart of the affected instances. In addition, we are working to improve our remediation processes when a single availability zone is affected in order to prevent similar issues from occurring in the future. 

Analysis

During the time of the incident, we detected that several instances of our relational data access service in a single availability zone were experiencing higher-than-expected latencies. Because Box webapp and public API requests depend on this relational data access service, this additional latency impacted the Box webapp and public API. We resolved the issue by performing a rolling restart of the affected instances. However, the rolling restart took longer than desired and impacted latencies for the duration of the rolling restart. We identified corrective actions to more quickly remediate a similar issue in the future by leveraging tooling to divert traffic away from an impacted availability zone.

Corrective Actions

Box has initiated the following corrective actions:

  • Improve observability into issues that affect a single availability zone
  • Improve processes around usage of tooling to safely and quickly divert traffic away from an impacted availability zone
  • Decrease time to perform restart of relational data access service instances

We are continuously working to improve Box and want to make sure we are delivering the best product and user experience we can. We hope we have provided some clarity here and we would be happy to answer any questions you may still have regarding this matter. 

Sincerely,

The Box Team

Posted Apr 23, 2025 - 17:18 PDT

Resolved

This incident has been resolved.
Posted Apr 13, 2025 - 19:30 PDT

Update

After further monitoring, this incident is now considered resolved. All services have been restored to full functionality. If you continue to experience any issues, please contact Box Support at https://support.box.com.
Posted Apr 13, 2025 - 19:30 PDT

Monitoring

A fix has been implemented and we are monitoring the results.
Posted Apr 13, 2025 - 19:11 PDT

Update

We are continuing to work on a fix for this issue.
Posted Apr 13, 2025 - 18:38 PDT

Identified

The issue has been identified and a fix is being implemented.
Posted Apr 13, 2025 - 17:43 PDT

Investigating

We are investigating an ongoing issue affecting the Box API, uploads, downloads, logins, and Box Notes. We will provide more information as soon as it is available.
Posted Apr 13, 2025 - 17:24 PDT
This incident affected: Box Platform / API (Content API, Uploads/Downloads), Box Web Application (Login/SSO, Uploads/Downloads), and Box Notes (Web Application).