Cerebras Inference
Cerebras Inference uses specialized silicon to provides fast inference for the Llama3.1 8B and Llama3.1 70B models.
- Create an account in the portal here.
- Create and copy the API key for use in Continue.
- Update your Continue config file:
config.json
{
"models": [
{
"title": "Cerebras Llama 3.1 70B",
"provider": "cerebras",
"model": "llama3.1-70b",
"apiKey": "YOUR_API_KEY"
}
]
}