Deploying private models through HuggingFace Repos
How to create a dedicated deployment for a private HuggingFace model
In order to deploy a Parasail dedicated endpoint for a private model from HuggingFace, you need to follow these steps:
Make your model private in HuggingFace.
Go to the settings page of your model.
At the top of the page, click on the button
Make private
.After the model becomes private, you should see the following:
Create a HuggingFace access token with finegrained permissions that only allows access to your private model.
On the
Access Tokens
page, click on the buttonCreate new token
.Leave token type as
Fine-grained
and write a name for your access token.Scroll down to the section
Repositories permissions
.In the textbox
Search for repos
, type the name of your model (e.g.meta-llama/Llama-3.2-1B
).Leave the permission to be
Read access to contents of selected repos
.Click on the button
Create token
at the bottom of the page.Copy the access token that is generated.
Create a dedicated deployment in Parasail.
Click on the button
Create Dedicated Model
on the Parasail dashboard.Copy your model name (e.g.
meta-llama/Llama-3.2-1B
) into the textboxHuggingFace ID / URL
Copy your access token to the textbox
HuggingFace Token
.Click on the button
Deploy
.
Last updated