Handle many platform-related compute failures gracefully
Created by: malomarrec
Problem
Customers running large batch changes will have platform-related workspace execution failure (the larger the batch change, the higher the chance this happens). When that happens, the customer needs to spend time hitting "retry" for all failed workspaces. This is time consuming and slows down iterating on large batch changes.