LiteLLM

Office Web - Source - Docker Image - Document

LiteLLM 是一个非常实用的开源 Python 库（也是目前最流行的 LLM 网关/代理工具之一）。

使用 OpenAI 的输入/输出格式，调用 100+ 个大型语言模型

将输入转换为各个服务提供商的端点格式（包括 /chat/completions、/responses、/embeddings、/images、/audio、/batches 等）。
统一的输出格式 - 无论您使用哪个服务提供商，响应格式都保持一致。
提供重试/回退机制，可在多个部署之间切换（例如，Azure/OpenAI）- 路由器。
跟踪支出并为每个项目设置预算 - LiteLLM 代理服务器。

使用教程

环境变量

管理平台登录

bash

# 设置登录账号和密码
# https://docs.litellm.ai/docs/proxy/ui#4-change-default-username--password
UI_USERNAME=ishaan-litellm   # username to sign in on UI
UI_PASSWORD=langchain        # password to sign in on UI

# 设置入口网址
# https://docs.litellm.ai/docs/proxy/cli_sso#steps
LITELLM_PROXY_URL=https://litellm-api.up.railway.app

# 设置通知邮箱
# https://docs.litellm.ai/docs/proxy/email#1-configure-email-integration
SMTP_HOST="smtp.resend.com"
SMTP_TLS="True"
SMTP_PORT="587"
SMTP_USERNAME="resend"
SMTP_SENDER_EMAIL="notifications@alerts.litellm.ai"
SMTP_PASSWORD="xxxxx"

全部设置（文件）：config.yaml 全部设置（环境变量）：.env

添加模型

注意：使用代理配合数据库时，你也可以直接通过 UI 添加模型 （UI 可在 /ui 路由访问）。文件名为 config.yaml，相关参数查看模型配置

yaml

model_list:
  - model_name: gpt-4o
    litellm_params:
      model: azure/my_azure_deployment
      api_base: "os.environ/AZURE_API_BASE"
      api_key: "os.environ/AZURE_API_KEY"
      api_version: "2025-01-01-preview" # [OPTIONAL] litellm uses the latest azure api_version by default

model_name (字符串) - 此字段应包含接收到的模型名称。
litellm_params (字典) 查看所有 LiteLLM 参数
- model (字符串) - 指定要发送到 litellm.acompletion / litellm.aembedding 等的模型的名称。这是 LiteLLM 用于在后端将请求路由到正确的模型和提供商的标识符。
- api_key (字符串) - 身份验证所需的 API 密钥。可以从环境变量中获取，使用 os.environ/。
- api_base (字符串) - 您的 Azure 部署的 API 基础地址。
- api_version (字符串) - 调用 Azure OpenAI API 时使用的 API 版本。您可以从这里获取最新的推理 API 版本。

关于 litellm_params 下的 model 参数的详细说明：

yaml

model_list:
  - model_name: gpt-4o                       # 客户端使用的名称
    litellm_params:
      model: azure/my_azure_deployment       # <提供商>/<模型名称>
             ─────  ───────────────────
               │           │
               │           └─────▶ 发送给提供商 API 的模型名称
               │
               └─────────────────▶ LiteLLM 路由到哪个提供商

可视化分解：

model: azure/my_azure_deployment
       └─┬─┘ └─────────┬─────────┘
         │             │
         │             └────▶ 发送给 Azure 的实际模型标识符
         │                   （例如，你的部署名称或模型名称）
         │
         └──────────────────▶ 告诉 LiteLLM 使用哪个提供商
                             （azure, openai, anthropic, bedrock 等）

关键概念：

model_name: 客户端用于调用模型的别名。这是你在 API 请求中发送的内容（例如，gpt-4o）。
model (在 litellm_params 中)：格式为 <提供商>/<模型标识符>
- 提供商（/ 之前）：用于路由到正确的 LLM 提供商（例如，azure、openai、anthropic、bedrock）。
- 模型标识符（/ 之后）：发送给该提供商 API 的实际模型/部署名称。

高级配置示例：

对于兼容 OpenAI 的自定义端点（例如，vLLM、Ollama、自定义部署）：

yaml

model_list:
  - model_name: my-custom-model
    litellm_params:
      model: custom_openai/nvidia/llama-3.2-nv-embedqa-1b-v2
      api_base: http://my-service.svc.cluster.local:8000/v1
      api_key: "sk-1234"

# 或者(推荐)
model_list:
  - model_name: my-custom-model # 调用的模型名称
    litellm_params:
      model: nvidia/llama-3.2-nv-embedqa-1b-v2 # 代理源的模型名称
      custom_llm_provider: custom_openai # 固定值，openai 兼容模式
      api_base: http://my-service.svc.cluster.local:8000/v1
      api_key: "sk-1234"

上述两种方式效果是一样的，区别在于：

yaml

model: custom_openai/nvidia/llama-3.2-nv-embedqa-1b-v2

# 拆分成
model: nvidia/llama-3.2-nv-embedqa-1b-v2
custom_llm_provider: custom_openai

设置忽略不可用的参数

yaml

litellm_settings:
  drop_params: true          # 全局生效，所有模型自动丢弃不支持的参数
# 环境变量
# LITELLM_DROP_PARAMS=True

请求 /metrics 出错

bash

"GET /metrics HTTP/1.1" 404 Not Found

yaml

litellm_settings:
  callbacks:
    - prometheus

分解复杂的模型路径：

model: openai/nvidia/llama-3.2-nv-embedqa-1b-v2
       └─┬──┘ └────────────┬────────────────┘
         │                 │
         │                 └────▶ 发送给提供商 API 的完整模型字符串
         │                       （在本例中： "nvidia/llama-3.2-nv-embedqa-1b-v2"）
         │
         └──────────────────────▶ 提供商 (openai = OpenAI 兼容的 API)

关键点：/ 之后的所有内容都会原样传递给提供商的 API。

本地开发（无需 postgres 和 prometheus）

含三文件：compose.yaml, .env, config.yaml
config.yaml 文件已添加了 iflow 的支持。需要设置环境变量 IFLOW_API_BASE 和 IFLOW_API_KEY

yaml

# compose.yaml
---
# https:///ghcr.io/berriai/litellm:main-stable
services:
  litellm:
    container_name: litellm
    env_file:
      - path: ./.env
        required: false
    environment:
      IFLOW_API_BASE: "https://apis.iflow.cn/v1"
      IFLOW_API_KEY: "$IFLOW_API_KEY"
    hostname: litellm
    healthcheck:  # Defines the health check configuration for the container
      test:
        - CMD-SHELL
        - python3 -c "import urllib.request; urllib.request.urlopen('http://localhost:4000/health/liveliness')"  # Command to execute for health check
      interval: 30s  # Perform health check every 30 seconds
      timeout: 10s   # Health check command times out after 10 seconds
      retries: 3     # Retry up to 3 times if health check fails
      start_period: 40s  # Wait 40 seconds after container start before beginning health checks
    image: docker.litellm.ai/berriai/litellm:main-latest
    ports:
      - 4000:4000
    #extra_hosts:
    #  - "host.docker.internal:host-gateway"
    #network_mode: host
    restart: unless-stopped
    #########################################
    ## Uncomment these lines to start proxy with a config.yaml file ##
    volumes:
     - ./config.yaml:/app/config.yaml
    command:
     - "--config=/app/config.yaml"
    ##############################################

dotenv

# .env 文件
# CUSTOM
TZ=Asia/Shanghai

# 必须 sk- 开头
LITELLM_MASTER_KEY="sk-1234"
DISABLE_ADMIN_UI="True"

# 忽略不支持的参数
LITELLM_DROP_PARAMS=True

yaml

# config.yaml
model_list:
  - model_name: ollama/translategemma:12b
    litellm_params:
      model: ollama/translategemma:12b
#      custom_llm_provider: custom_openai
#      api_key: null
      api_base: "http://192.168.1.117:11434"

  - model_name: iflow/qwen3-coder-plus
    litellm_params:
      model: qwen3-coder-plus
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/glm-4.6
    litellm_params:
      model: glm-4.6
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/iflow-rome-30ba3b
    litellm_params:
      model: iflow-rome-30ba3b
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/qwen3-max
    litellm_params:
      model: qwen3-max
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/qwen3-vl-plus
    litellm_params:
      model: qwen3-vl-plus
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/qwen3-max-preview
    litellm_params:
      model: qwen3-max-preview
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/kimi-k2-0905
    litellm_params:
      model: kimi-k2-0905
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/kimi-k2
    litellm_params:
      model: kimi-k2
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/deepseek-v3.2
    litellm_params:
      model: deepseek-v3.2
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/deepseek-r1
    litellm_params:
      model: deepseek-r1
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/deepseek-v3
    litellm_params:
      model: deepseek-v3
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/qwen3-32b
    litellm_params:
      model: qwen3-32b
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/qwen3-235b-a22b-thinking-2507
    litellm_params:
      model: qwen3-235b-a22b-thinking-2507
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/qwen3-235b-a22b-instruct
    litellm_params:
      model: qwen3-235b-a22b-instruct
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

  - model_name: iflow/qwen3-235b
    litellm_params:
      model: qwen3-235b
      custom_llm_provider: custom_openai
      api_base: "os.environ/IFLOW_API_BASE"
      api_key: "os.environ/IFLOW_API_KEY"

触发 `/chat/completion`

LiteLLM 代理 100%兼容 OpenAI。通过 /chat/completions 途径测试你的 Azure 模型。

bash

curl -X POST 'http://0.0.0.0:4000/chat/completions' \
-H 'Content-Type: application/json' \
-H 'Authorization: Bearer sk-1234' \
-d '{
    "model": "gpt-4o",
    "messages": [
      {
        "role": "system",
        "content": "You are an LLM named gpt-4o"
      },
      {
        "role": "user",
        "content": "what is your name?"
      }
    ]
}'

生成虚拟密钥

config.yaml

yaml

model_list:
  - model_name: gpt-4o
    litellm_params:
      model: azure/my_azure_deployment
      api_base: os.environ/AZURE_API_BASE
      api_key: "os.environ/AZURE_API_KEY"
      api_version: "2025-01-01-preview" # [OPTIONAL] litellm uses the latest azure api_version by default

# 或者直接通过 .env 环境变量设置
general_settings:
  master_key: sk-1234 # 必须以 sk- 开头
  database_url: "postgresql://<user>:<password>@<host>:<port>/<dbname>" # 👈 KEY CHANGE

什么是 general_settings？

这些是 LiteLLM 代理服务器的设置。查看所有通用设置在此处。

master_key (字符串)
- 描述：
  - 设置一个 master key，这是您的代理管理员密钥。您可以使用它来创建其他密钥（🚨 必须以 sk- 开头）。
- 用法：
  - 在 config.yaml 中设置： 在 general_settings:master_key 下设置您的主密钥，例如： master_key: sk-1234
  - 设置环境变量： 设置 LITELLM_MASTER_KEY。
database_url (字符串)
- 描述：
  - 设置一个 database_url，这是连接到您的 PostgreSQL 数据库的地址，LiteLLM 使用它来生成密钥、用户和团队。
- 用法：
  - 在 config.yaml 中设置： 在 general_settings:database_url 下设置您的数据库 URL，例如： database_url: "postgresql://..."
  - 在环境变量中设置 DATABASE_URL=postgresql://<user>:<password>@<host>:<port>/<dbname>。

常见问题

本地部署（不部署 postgres、prometheus）
1. 移除 compose.yaml 中的 db 和 prometheus 服务。
2. 将 .env 文件中的 DATABASE_URL 注释。禁用数据库功能。
3. 将 .env 文件中的 DISABLE_ADMIN_UI=True 启用。禁用 UI 功能。
SSL 证书错误

bash

ssl.SSLCertVerificationError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: self-signed certificate in certificate chain (_ssl.c:1006)

yaml

# config.yaml
litellm_settings:
    ssl_verify: false # 👈 KEY CHANGE

数据库连接失败

bash

httpx.ConnectError: All connection attempts failed

ERROR:    Application startup failed. Exiting.
3:21:43 - LiteLLM Proxy:ERROR: utils.py:2207 - Error getting LiteLLM_SpendLogs row count: All connection attempts failed

bash

# 创建数据库
STATEMENT: CREATE DATABASE "litellm"

若权限有问题

bash

ERROR: permission denied to create

bash

GRANT ALL PRIVILEGES ON DATABASE litellm TO your_username;

脚本

查看所有模型

bash

curl -X 'GET' 'http://0.0.0.0:4000/models' \
  -H 'accept: application/json' \
  -H "x-litellm-api-key: $LITELLM_API_KEY" \
  | jq -r '.data[].id'

添加模型

bash

curl -X 'POST' \
  'http://0.0.0.0:4000/model/new' \
  -H 'accept: application/json' \
  -H "x-litellm-api-key: $LITELLM_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "model_name": "'$BASE/$MODEL'",
    "litellm_params": {
      "api_key": "'$ADD_API_KEY'",
      "api_base": "'$ADD_API_URL'",
      "model": "'$MODEL'",
      "custom_llm_provider": "custom_openai"
    }
  }'

从文本中批量添加模型（需要预设参数）

bash

while IFS= read -r MODEL; do echo "$MODEL"; curl -X 'PO
ST' \
 'http://0.0.0.0:4000/model/new' \
 -H 'accept: application/json' \
 -H "x-litellm-api-key: $LITELLM_API_KEY" \
 -H 'Content-Type: application/json' \
 -d '{
	   "model_name": "'$BASE/$MODEL'",
	   "litellm_params": {      
	     "api_key": "'$ADD_API_KEY'",
	     "api_base": "'$ADD_API_URL'",
	     "model": "'$MODEL'",                       
	     "custom_llm_provider": "custom_openai"  
	   }
   }'; done < add.txt

测试模型

bash

# 简单 GET-like 测试（实际是 POST）
curl -X POST http://0.0.0.0:4000/v1/chat/completions \
  -H "Authorization: Bearer $LITELLM_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "'$MODEL'",
    "messages": [
      {
        "role": "system",
        "content": "你是一个超级简洁的助手。"
      },
      {
        "role": "user",
        "content": "随机给我一首古诗。"
      }
    ],
    "temperature": 0.3,
    "max_tokens": 50
  }'| jq -r

批量测试模型

bash

while IFS= read -r MODEL; do echo "$MODEL"; # 简单 GET-like 测试（实际是 POST）                                                1 ↵
curl -X POST http://0.0.0.0:4000/v1/chat/completions \
 -H "Authorization: Bearer $LITELLM_API_KEY" \
 -H "Content-Type: application/json" \
 -d '{
   "model": "'$MODEL'",
   "messages": [
     {
       "role": "system",
       "content": "你是一个古诗历史学家。"
     },
     {
       "role": "user",
       "content": "随机给我一首古诗。"
     }
   ],
   "temperature": 0.3,
   "max_tokens": 50
 }'| jq -r  ; done < test.txt

LiteLLM ​

使用教程 ​

环境变量 ​

添加模型 ​

本地开发（无需 postgres 和 prometheus） ​

触发 /chat/completion ​

生成虚拟密钥 ​

常见问题 ​

脚本 ​

查看所有模型 ​

添加模型 ​

测试模型 ​

LiteLLM

使用教程

环境变量

添加模型

本地开发（无需 postgres 和 prometheus）

触发 `/chat/completion`

生成虚拟密钥

常见问题

脚本

查看所有模型

添加模型

测试模型