GPUStack allows admins to download and manage model files.
GPUStack currently supports models from Hugging Face, ModelScope, and local paths. To add model files, navigate to the Model Files page.
Add Model File button and select Hugging Face from the dropdown.Qwen/Qwen3-0.6B.Available Files.Local Directory to download the model to a custom path instead of the GPUStack cache directory.Save button.Add Model File button and select ModelScope from the dropdown.Qwen/Qwen3-0.6B.Available Files.Local Directory to download the model to a custom path instead of the GPUStack cache directory.Save button.You can add models from a local path. The path can be a directory (e.g., a Hugging Face model folder) or a file (e.g., a GGUF model) located on the worker.
Add Model File button and select Local Path from the dropdown.Model Path.Save button.If a model file download fails — or gets stuck at a very low download speed — you can retry it:
Model Files page.Operations column and select Retry Download.Models can be deployed from model files. Since the model is stored on a specific worker, GPUStack will add a worker selector using the worker-name key to ensure proper scheduling.
!!! tip
If you want a model to fail over across nodes, make sure all nodes in the cluster can access the model files from the same path, and manually remove the `worker-name` label from the worker selector.
Model Files page.Deploy button in the Operations column.Name, Backend, Backend Version, Replicas, and other deployment parameters.Save button.Model Files page.Operations column and select Delete.Also delete the file from disk option.Delete button to confirm.