Efficiently Serving LLMs