Selecting GPUs and compute backends #428

farooqkz · 2024-12-28T11:08:12Z

Is your feature request related to a problem? Please describe.

These days a good number of computers come with multiple GPUs, and each GPU usually has got multiple APIs to offload computation to. For instance, Most GPUs support both OpenCL and Vulkan for compute. Intel Arc GPUs also have got XMX cores specifically created for matrix operations. Furthermore, many laptops and workstations come with more than a single GPU.

Describe the solution you'd like

It would be nice to have a selection mechanism for selecting GPU or CPU. And if it's a GPU, which GPU if there are multiple GPUs. There should be a global setting and users should be able to override the global one per model. Sensible defaults should be there. Such as always trying to use a GPU if one is available.

Describe alternatives you've considered

Not much of an alternative comes to mind.

Additional context
Additional to GPUs, some PCs, even cheap SBCs come with an NPU or TPU. This should be kept in mind.

Jeffser · 2024-12-28T18:12:30Z

Hi, the backend for Alpaca is Ollama, as far as I know Ollama is only compatible with ROCm and Cuda (no Vulkan / OpenCL)

Alpaca does provide some options in the preferences dialog to modify which GPU is being used, for example CUDA_VISIBLE_DEVICES and HIP_VISIBLE_DEVICES

Jeffser · 2024-12-28T18:12:48Z

I just noticed Ollama doesn't use HIP_VISIBLE_DEVICES anymore, I'm going to fix that

Jeffser · 2024-12-28T18:20:51Z

4d50b7d

Alright that fixed,

I could make a GPU / CPU selector in the preferences dialog but it could take a while to get it working correctly

For now, using those overrides is the best way of configuring Ollama

ossenthi · 2025-01-02T09:14:45Z

I haven't use the program for while, but I started using it again about a week ago and noticed it doesn't use my Nvidia RTX mobile Max-Q Gpu anymore. I Have tried everything guide told me to. I have cuda installed and works for other ollama-cuda setup. we need better way to diagnose problems when it comes to fails to utilize gpu.

farooqkz · 2025-01-05T13:24:38Z

Hi, the backend for Alpaca is Ollama, as far as I know Ollama is only compatible with ROCm and Cuda (no Vulkan / OpenCL)

Alpaca does provide some options in the preferences dialog to modify which GPU is being used, for example CUDA_VISIBLE_DEVICES and HIP_VISIBLE_DEVICES

There is a PR to add Vulkan support to Ollama. Then with that, in most multi GPU setups, there are also multiple backends as most GPUs, even ARM ones, support Vulkan.

Edit:

I just looked at the PR more thorough. And it seems the ollama team members are simply ignoring the PR...

farooqkz added the enhancement New feature or request label Dec 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Selecting GPUs and compute backends #428

Selecting GPUs and compute backends #428

farooqkz commented Dec 28, 2024

Jeffser commented Dec 28, 2024

Jeffser commented Dec 28, 2024

Jeffser commented Dec 28, 2024

ossenthi commented Jan 2, 2025

farooqkz commented Jan 5, 2025 •

edited

Loading

Selecting GPUs and compute backends #428

Selecting GPUs and compute backends #428

Comments

farooqkz commented Dec 28, 2024

Jeffser commented Dec 28, 2024

Jeffser commented Dec 28, 2024

Jeffser commented Dec 28, 2024

ossenthi commented Jan 2, 2025

farooqkz commented Jan 5, 2025 • edited Loading

farooqkz commented Jan 5, 2025 •

edited

Loading