is a Python package that makes it easy for developers to create machine learning apps powered by llama.cpp models using Gradio. You'll need a GGUF model file for llama.cpp. The easiest way is to use a ...