A small fraction of queries that are slow, expensive or cold-started will drive most of the user-facing latency that matters.
Real software isn't separate front-end, back-end and infrastructure components. They must work together seamlessly.