NVIDIA

NVIDIA bietet eine OpenAI-kompatible API unter https://integrate.api.nvidia.com/v1 für Nemotron- und NeMo-Modelle. Authentifiziere dich mit einem API-Key von NVIDIA NGC.

CLI-Einrichtung

Exportiere den Key einmalig, führe dann das Onboarding durch und setze ein NVIDIA-Modell:

export NVIDIA_API_KEY="nvapi-..."
openclaw onboard --auth-choice skip
openclaw models set nvidia/nvidia/llama-3.1-nemotron-70b-instruct

Wenn du --token verwendest, beachte, dass es in der Shell-History und ps-Ausgabe landet; bevorzuge die Umgebungsvariable, wenn möglich.

Konfigurationsbeispiel

{
  env: { NVIDIA_API_KEY: "nvapi-..." },
  models: {
    providers: {
      nvidia: {
        baseUrl: "https://integrate.api.nvidia.com/v1",
        api: "openai-completions",
      },
    },
  },
  agents: {
    defaults: {
      model: { primary: "nvidia/nvidia/llama-3.1-nemotron-70b-instruct" },
    },
  },
}

Modell-IDs

nvidia/llama-3.1-nemotron-70b-instruct (Standard)
meta/llama-3.3-70b-instruct
nvidia/mistral-nemo-minitron-8b-8k-instruct

Hinweise

OpenAI-kompatibler /v1-Endpunkt; verwende einen API-Key von NVIDIA NGC.
Provider wird automatisch aktiviert, wenn NVIDIA_API_KEY gesetzt ist; verwendet statische Standardwerte (131.072-Token-Kontextfenster, 4.096 maximale Tokens).