docs/integrations/local-llms.mdx

---
title: Local LLMs
subtitle: Run Skyvern with Ollama or any OpenAI-compatible endpoint
slug: integrations/local-llms
---

Skyvern supports local LLMs for self-hosted deployments. Use Ollama directly or route through LiteLLM to connect any model provider.

---

## What you'll need

- A self-hosted Skyvern deployment
- Ollama installed locally, or an OpenAI-compatible endpoint

---

## Option A: Ollama

Use Ollama to run open-source models locally.

### Step 1: Start Ollama

```bash
ollama pull llama3.1
ollama serve
```

The API runs at `http://localhost:11434`.

### Step 2: Configure Skyvern

Add to your `.env` file:

```bash
ENABLE_OLLAMA=true
OLLAMA_SERVER_URL=http://localhost:11434
OLLAMA_MODEL=llama3.1

# Enable for vision models (qwen2-vl, llava, etc.)
OLLAMA_SUPPORTS_VISION=false
```

| Variable | Description |
|----------|-------------|
| `ENABLE_OLLAMA` | Enable Ollama integration. |
| `OLLAMA_SERVER_URL` | Ollama server URL. Defaults to `http://localhost:11434`. |
| `OLLAMA_MODEL` | Model name. Check available models with `ollama list`. |
| `OLLAMA_SUPPORTS_VISION` | Enable vision support for multimodal models like `qwen2-vl` or `llava`. |

### Step 3: Verify the connection

```bash
curl -s http://localhost:11434/api/tags | jq .
```

---

## Option B: LiteLLM

Use LiteLLM as an OpenAI-compatible proxy to connect any model provider.

### Step 1: Start LiteLLM

```bash
litellm --model ollama/llama3.1 --host 0.0.0.0 --port 4000
```

### Step 2: Configure Skyvern

Add to your `.env` file:

```bash
ENABLE_OPENAI_COMPATIBLE=true
OPENAI_COMPATIBLE_MODEL_NAME=llama3.1
OPENAI_COMPATIBLE_API_KEY=sk-test
OPENAI_COMPATIBLE_API_BASE=http://localhost:4000/v1
```

| Variable | Description |
|----------|-------------|
| `ENABLE_OPENAI_COMPATIBLE` | Enable OpenAI-compatible provider. |
| `OPENAI_COMPATIBLE_MODEL_NAME` | Model identifier. |
| `OPENAI_COMPATIBLE_API_KEY` | API key for the proxy. |
| `OPENAI_COMPATIBLE_API_BASE` | Base URL. Must end with `/v1`. |
| `OPENAI_COMPATIBLE_SUPPORTS_VISION` | Enable vision support for multimodal models. |
| `OPENAI_COMPATIBLE_REASONING_EFFORT` | Set to `low`, `medium`, or `high`. |

### Step 3: Verify the connection

```bash
curl -s http://localhost:4000/v1/models \
  -H "Authorization: Bearer sk-test" | jq .
```

---

## Step 4: Start Skyvern

After configuring your `.env`, start the server:

```bash
# With Docker
docker compose up -d

# Or locally
skyvern run server
```

---

## Troubleshooting

| Issue | Solution |
|-------|----------|
| Model not responding | Ensure `ollama serve` is running and the model exists (`ollama list`). |
| LiteLLM 401 error | Set `OPENAI_COMPATIBLE_API_KEY` to a value the proxy accepts. |
| Model not visible | Set `ENABLE_OLLAMA=true` or `ENABLE_OPENAI_COMPATIBLE=true` and restart. |
| Wrong base URL | Confirm `OPENAI_COMPATIBLE_API_BASE` ends with `/v1`. |

---

## Next steps

<CardGroup cols={2}>
  <Card
    title="API Quickstart"
    icon="rocket"
    href="/getting-started/quickstart"
  >
    Get started with Skyvern
  </Card>
  <Card
    title="Run a Task"
    icon="play"
    href="/running-automations/run-a-task"
  >
    Learn the task API
  </Card>
</CardGroup>
docs: new integrations section with screen recordings and videos + restructuring (#4705) Co-authored-by: Kunal Mishra <kunalm2345@gmail.com> 2026-02-12 05:18:37 +05:30			`---`
			`title: Local LLMs`
			`subtitle: Run Skyvern with Ollama or any OpenAI-compatible endpoint`
			`slug: integrations/local-llms`
			`---`

			`Skyvern supports local LLMs for self-hosted deployments. Use Ollama directly or route through LiteLLM to connect any model provider.`

			`---`

			`## What you'll need`

			`- A self-hosted Skyvern deployment`
			`- Ollama installed locally, or an OpenAI-compatible endpoint`

			`---`

			`## Option A: Ollama`

			`Use Ollama to run open-source models locally.`

			`### Step 1: Start Ollama`

			```bash
			`ollama pull llama3.1`
			`ollama serve`
			```

			The API runs at `http://localhost:11434`.

			`### Step 2: Configure Skyvern`

			Add to your `.env` file:

			```bash
			`ENABLE_OLLAMA=true`
			`OLLAMA_SERVER_URL=http://localhost:11434`
			`OLLAMA_MODEL=llama3.1`

			`# Enable for vision models (qwen2-vl, llava, etc.)`
			`OLLAMA_SUPPORTS_VISION=false`
			```

			`\| Variable \| Description \|`
			`\|----------\|-------------\|`
			\| `ENABLE_OLLAMA` \| Enable Ollama integration. \|
			\| `OLLAMA_SERVER_URL` \| Ollama server URL. Defaults to `http://localhost:11434`. \|
			\| `OLLAMA_MODEL` \| Model name. Check available models with `ollama list`. \|
			\| `OLLAMA_SUPPORTS_VISION` \| Enable vision support for multimodal models like `qwen2-vl` or `llava`. \|

			`### Step 3: Verify the connection`

			```bash
			`curl -s http://localhost:11434/api/tags \| jq .`
			```

			`---`

			`## Option B: LiteLLM`

			`Use LiteLLM as an OpenAI-compatible proxy to connect any model provider.`

			`### Step 1: Start LiteLLM`

			```bash
			`litellm --model ollama/llama3.1 --host 0.0.0.0 --port 4000`
			```

			`### Step 2: Configure Skyvern`

			Add to your `.env` file:

			```bash
			`ENABLE_OPENAI_COMPATIBLE=true`
			`OPENAI_COMPATIBLE_MODEL_NAME=llama3.1`
			`OPENAI_COMPATIBLE_API_KEY=sk-test`
			`OPENAI_COMPATIBLE_API_BASE=http://localhost:4000/v1`
			```

			`\| Variable \| Description \|`
			`\|----------\|-------------\|`
			\| `ENABLE_OPENAI_COMPATIBLE` \| Enable OpenAI-compatible provider. \|
			\| `OPENAI_COMPATIBLE_MODEL_NAME` \| Model identifier. \|
			\| `OPENAI_COMPATIBLE_API_KEY` \| API key for the proxy. \|
			\| `OPENAI_COMPATIBLE_API_BASE` \| Base URL. Must end with `/v1`. \|
			\| `OPENAI_COMPATIBLE_SUPPORTS_VISION` \| Enable vision support for multimodal models. \|
			\| `OPENAI_COMPATIBLE_REASONING_EFFORT` \| Set to `low`, `medium`, or `high`. \|

			`### Step 3: Verify the connection`

			```bash
			`curl -s http://localhost:4000/v1/models \`
			`-H "Authorization: Bearer sk-test" \| jq .`
			```

			`---`

			`## Step 4: Start Skyvern`

			After configuring your `.env`, start the server:

			```bash
			`# With Docker`
			`docker compose up -d`

			`# Or locally`
			`skyvern run server`
			```

			`---`

			`## Troubleshooting`

			`\| Issue \| Solution \|`
			`\|-------\|----------\|`
			\| Model not responding \| Ensure `ollama serve` is running and the model exists (`ollama list`). \|
			\| LiteLLM 401 error \| Set `OPENAI_COMPATIBLE_API_KEY` to a value the proxy accepts. \|
			\| Model not visible \| Set `ENABLE_OLLAMA=true` or `ENABLE_OPENAI_COMPATIBLE=true` and restart. \|
			\| Wrong base URL \| Confirm `OPENAI_COMPATIBLE_API_BASE` ends with `/v1`. \|

			`---`

			`## Next steps`

			`<CardGroup cols={2}>`
			`<Card`
			`title="API Quickstart"`
			`icon="rocket"`
			`href="/getting-started/quickstart"`
			`>`
			`Get started with Skyvern`
			`</Card>`
			`<Card`
			`title="Run a Task"`
			`icon="play"`
			`href="/running-automations/run-a-task"`
			`>`
			`Learn the task API`
			`</Card>`
			`</CardGroup>`