

I understand the failure rates for the GPUs is huge. The duty cycles tend to be high, with power and cooling issues.
(Actually the power issues are wild and can destroy power distribution and generation equipment. “Power Stabilization for AI Training Datacenters” 21 Aug 2025 https://arxiv.org/pdf/2508.14318)


Me too!
And I quickly found out why everyone else had left. (The job after was good though.)