1The main goal of llama.cpp is to enable LLM inference with minimal setup and 2state-of-the-art performance on a wide variety of hardware - locally and in 3the cloud. 4