Star 历史趋势
数据来源: GitHub API · 生成自 Stargazers.cn
README.md
vLLM Semantic Router

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Documentation | Playground | Blog | Publications | Hugging Face


About

In the LLM era, the number of models is exploding. Different models vary across capability, scale, cost, and privacy boundaries. Choosing and connecting the right models to build semantic AI infrastructure is a system problem.

vLLM Semantic Router is a signal-driven intelligent router for that problem. It helps teams build model systems that are more efficient, safer, and more adaptive across cloud, data center, and edge environments.

system

It delivers three core values:

  • Token economics: reduce wasted tokens, increase effective output, and maximize the value of every token.
  • LLM safety: detect jailbreaks, sensitive leakage, and hallucinations so agents remain controllable, trustworthy, and auditable.
  • Fullmesh intelligence: build personal AI at the edge and intelligent MaaS in the cloud by coordinating local, private, and frontier models across cost, privacy, and capability boundaries.

Getting Started

Install

curl -fsSL https://vllm-semantic-router.com/install.sh | bash

For platform notes, detailed setup options, and troubleshooting, see the Installation Guide.

[!IMPORTANT] Online playground default credentials:

  • username: love@vllm-sr.ai
  • password: vllm-sr

Latest News

Earlier announcements

More announcements are available on the Blog and Publications pages.

Community

For questions, feedback, or to contribute, please join the #semantic-router channel in vLLM Slack.

Community Meetings

We host community meetings on the first and third Tuesday of each month to sync with contributors across different time zones:

Contributing

If you want to contribute, start with CONTRIBUTING.md.

For repository-native development workflow and validation commands, use AGENTS.md as the entrypoint and docs/agent/README.md as the canonical index.

Citation

If you find Semantic Router helpful in your research or projects, please consider citing it:

@misc{semanticrouter2025,
  title={vLLM Semantic Router},
  author={vLLM Semantic Router Team},
  year={2025},
  howpublished={\url{https://github.com/vllm-project/semantic-router}},
}

Star History

Star History Chart

Sponsors

We are grateful to our sponsors who support us:


AMD provides us with GPU resources and ROCm™ software for training and researching frontier router models, enhancing E2E testing, and building the online models playground.


关于 About

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge
ai-gatewaybert-classificationfine-tuninggolanghuggingface-candlehuggingface-transformerskubernetesllmllmroutermcpmixture-of-modelsopenclawpii-detectionprompt-engineeringprompt-guardrustsemantic-routervllm

语言 Languages

Go47.0%
Python19.3%
TypeScript13.4%
Rust9.8%
CSS3.4%
Shell2.5%
TeX1.6%
Makefile1.1%
C++0.4%
ASL0.3%
JavaScript0.3%
SCSS0.2%
Dockerfile0.1%
HTML0.1%
Svelte0.1%
BibTeX Style0.1%
HIP0.1%
C0.1%
CMake0.1%
Go Template0.0%
Assembly0.0%
GLSL0.0%

提交活跃度 Commit Activity

代码提交热力图
过去 52 周的开发活跃度
1573
Total Commits
峰值: 78次/周
Less
More

核心贡献者 Contributors