What is Inference? Inference is the process in which a trained (or fine-tuned) LLM makes a prediction for a given input. EOS: end of sequence Previous Benchmarking LLMs Next Using an LLM