Multi-GPU LLM Inference