Model File Management

GPUStack allows admins to download and manage model files.

Add Model File

GPUStack currently supports models from Hugging Face, ModelScope, and local paths. To add model files, navigate to the Model Files page.

Add a Hugging Face Model

Click the Add Model File button and select Hugging Face from the dropdown.
Use the search bar in the top left to find a model by name, e.g., Qwen/Qwen3-0.6B.
(Optional) For GGUF models, select the desired quantization format from Available Files.
Select the target worker to download the model file.
(Optional) Specify a Local Directory to download the model to a custom path instead of the GPUStack cache directory.
Click the Save button.

Add a ModelScope Model

Click the Add Model File button and select ModelScope from the dropdown.
Use the search bar in the top left to find a model by name, e.g., Qwen/Qwen3-0.6B.
(Optional) For GGUF models, select the desired quantization format from Available Files.
Select the target worker to download the model file.
(Optional) Specify a Local Directory to download the model to a custom path instead of the GPUStack cache directory.
Click the Save button.

Add a Local Path Model

You can add models from a local path. The path can be a directory (e.g., a Hugging Face model folder) or a file (e.g., a GGUF model) located on the worker.

Click the Add Model File button and select Local Path from the dropdown.
Enter the Model Path.
Select the target worker.
Click the Save button.

Retry Download

If a model file download fails — or gets stuck at a very low download speed — you can retry it:

Navigate to the Model Files page.
Locate the model file.
Click the ellipsis button in the Operations column and select Retry Download.
GPUStack will attempt to download the model file again from the specified source.

Deploy Model

Models can be deployed from model files. Since the model is stored on a specific worker, GPUStack will add a worker selector using the worker-name key to ensure proper scheduling.

!!! tip

If you want a model to fail over across nodes, make sure all nodes in the cluster can access the model files from the same path, and manually remove the `worker-name` label from the worker selector.

Navigate to the Model Files page.
Find the model file you want to deploy.
Click the Deploy button in the Operations column.
Review or adjust the Name, Backend, Backend Version, Replicas, and other deployment parameters.
Click the Save button.

Delete Model File

Navigate to the Model Files page.
Find the model file you want to delete.
Click the ellipsis button in the Operations column and select Delete.
(Optional) Check the Also delete the file from disk option.
Click the Delete button to confirm.

model-file-management.md 2.9 KB Байнгын холболт Түүх Анхны өгөгдөл