Hermes on Mingles Router

Run Hermes on Mingles Router

A step-by-step wizard for pointing the Hermes Agent at Mingles Router — whether you are installing it from scratch or already have it running.

Pick your model and paste your key — every command and value on this page updates live.

Endpoint: https://router.mingles.ai/v1 · key stays in your browser · no key entered? commands show <your-key>.

Install + interactive setup

1. Install Hermes

One command. Works on macOS and Linux.

shell
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash

2. Start the setup

Run hermes (or hermes setup). At the first prompt choose Full setup — bring your own keys, not Quick Setup. In the provider list, scroll to the bottom and pick Custom endpoint (enter URL manually).

shell
hermes

3. Answer the four prompts

Hermes asks for the base URL, key, compatibility mode and model — use the exact values in the table below. The endpoint is verified against /v1/models as you type the URL; you should see “3 model(s) visible”.

4. Verify & chat

hermes doctor checks the connection; then just run hermes to start a conversation.

shell
hermes doctor
hermes

Exact values Hermes asks for

API base URL https://router.mingles.ai/v1
API key <your-key>
API compatibility mode 2 — Chat Completions
Model moonshotai/Kimi-K2.6
Context length 200000

Switch model

All three models share the same endpoint, so switching is just a model id — no new key, no re-setup. Inside a chat use /model, or persist it from the shell:

hermes session
/model custom:moonshotai/Kimi-K2.6
shell
hermes config set model.default moonshotai/Kimi-K2.6

Before you file a bug: read the limits

Reasoning models spend the output budget on internal thinking, output is capped at 8192 tokens, there is no KV cache, and there are no built-in web tools. Most “it broke on Mingles Router” reports are one of these. See Model limits & behavior →

Frequently asked

Which API compatibility mode do I pick? +

Chat Completions (option 2). Mingles Router speaks the OpenAI /v1/chat/completions dialect. Do not pick Responses/Codex or Anthropic Messages.

What provider name does Hermes use for a custom endpoint? +

custom. In ~/.hermes/config.yaml that is model.provider: "custom" with model.base_url pointing at the gateway. Aliases ollama / vllm / llamacpp also map to custom.

What context length should I enter? +

Set model.context_length to 200000 for Kimi-K2.6. Hermes auto-detects context length from a provider catalog, but it does not know our custom endpoint, so set it manually. Output is separately capped at 8192 tokens.

Where is the API key stored? +

Either in ~/.hermes/config.yaml (model.api_key) or in ~/.hermes/.env as OPENAI_API_KEY — the custom provider falls back to OPENAI_API_KEY. The key never leaves your machine.

How do I switch between Kimi, Qwen and MiniMax? +

They all live on the same endpoint, so just change the model id: /model custom:<model-id> in a session, or hermes config set model.default <model-id>. No new keys needed.