Dify can integrate with MaaS-Base to leverage locally deployed LLMs, embeddings, reranking, image generation, Speech-to-Text and Text-to-Speech capabilities.
Deployments page and click on Deploy Model to deploy the models you need. Here are some example models:API Access Info to see how to integrate with this model.Hover over the user avatar and navigate to the API Keys page, then click on New API Key.
Fill in the name, then click Save.
Copy the API key and save it for later use.
PLUGINS, select Install from Marketplace, search for the MaaS-Base plugin, and choose to install it.Settings > Model Provider > GPUStack, then select Add Model and fill in:Model Type: Select the model type based on the model.
Model Name: The name must match the model name deployed on MaaS-Base.
Server URL: http://your-gpustack-url, do not use localhost, as it refers to the container’s internal network. If you’re using a custom port, make sure to include it. Also, ensure the URL is accessible from inside the Dify container (you can test this with curl).
API Key: Input the API key you copied from previous steps.
Click Save to add the model:
Add other models as needed, then select the added models in the System Model Settings and save:
You can now use the models in the Studio and Knowledge, here is a simple case:
Knowledge to create a knowledge, and upload your documents:Studio, add the previously created knowledge, select the chat model, and interact with it:qwen2.5-vl-3b-instruct, remove the previously added knowledge base, enable Vision, and upload an image in the chat to activate multimodal input: