The Inference Reckoning: Why Tech Firms are Leaving Pure Public Cloud for Hybrid AI Architectures