Star 历史趋势
数据来源: GitHub API · 生成自 Stargazers.cn
README.md

vLLM Skills

A collection of skills for deploying and benchmarking vLLM. This project follows the anthropics/skills template format and is installable as a Claude Code plugin marketplace.

Overview

This repository provides modular, reusable agent skills required to operate and benchmark vLLM, following the Anthropics SKILL.md specification. Each skill is a self-contained directory implementing automation, scripts, and metadata for a specific operational task.

Skills Index

SkillDescription
vllm-deploy-dockerDeploy vLLM using Docker (pre-built images or build-from-source) with NVIDIA GPU support and run the OpenAI-compatible server.
vllm-deploy-k8sDeploy vLLM to Kubernetes with GPU support, health probes, and OpenAI-compatible API endpoint.
vllm-deploy-simpleQuick install and deploy vLLM, start serving with a simple LLM, and test OpenAI API.
vllm-prefix-cache-benchBenchmark the efficiency of vLLM automatic prefix caching using fixed prompts, real datasets, or synthetic prefix/suffix patterns.
vllm-bench-random-syntheticRun vLLM performance benchmark using synthetic random data to measure throughput, TTFT, TPOT, and other key performance metrics without downloading external datasets.
vllm-bench-serveBenchmark vLLM or OpenAI-compatible serving endpoints using vllm bench serve.

Installation

Plugin Marketplace (Recommended)

Install directly from the plugin marketplace in Claude Code:

/plugin marketplace add vllm-project/vllm-skills
/plugin install vllm-skills@vllm-skills

Manual Install

Clone the repository and copy skills to your Claude Code skills directory:

git clone https://github.com/vllm-project/vllm-skills.git
cd vllm-skills

Copy to global skill folder:

cp -r plugins/vllm-skills/skills/vllm-deploy-simple ~/.claude/skills/

Or copy to the project skill folder:

cp -r plugins/vllm-skills/skills/vllm-deploy-simple .claude/skills/

Usage

Once installed, use the skills with slash commands or natural language:

/vllm-deploy-simple
Deploy vLLM with Qwen2.5-1.5B-Instruct on port 8000
Install and start a vLLM server using the vllm-deploy-simple skill

Supported Models

See vLLM documentation for the full list.

Contributing

This project follows the anthropics/skills template. When adding new skills:

  1. Create a new directory under plugins/vllm-skills/skills/ (e.g., plugins/vllm-skills/skills/your-skill/)
  2. Add a SKILL.md file with YAML frontmatter:
    ---
    name: your-skill
    description: Brief description of what this skill does
    ---
  3. Add optional scripts/, references/, and assets/ directories
  4. Update this README with your skill documentation

License

Licensed under the Apache License 2.0. See LICENSE.

Resources

关于 About

Agent skills for vLLM

语言 Languages

Shell100.0%

提交活跃度 Commit Activity

代码提交热力图
过去 52 周的开发活跃度
32
Total Commits
峰值: 15次/周
Less
More

核心贡献者 Contributors