Categories
Artificial Intelligence Large Language Model (LLM)

LLM VRAM usage with llama.cpp

While trying to get some of Qwen’s latest models up and running on my AMD iGPU I encountered some crashes. The errors were misleading, but in the end it turned out to be out of memory errors, so I started to think about how much memory different components of a LLM use. The data in […]

Categories
Artificial Intelligence Container Docker GPU Graphics Card Large Language Model (LLM) Linux Ubuntu

Preparing ComfyUI (for open-webui usage)

While experimenting with open-webui I was looking for options to use local LLM resources for image creation. Besides the commercial models, openwebui offers two alternatives : Automatic1111 and ComfyUI. As ComfyUI is mentioned in several other places I decided to have a look at it. Installation In order to install ComfyUI I was looking for […]