Accelerating AI Service Delivery with a Single LLM Endpoint